Generators

generators-window.png

AnkiMorphs provides the following three generators:


To use the generators you have to follow these three steps:

  1. Load input files
  2. Select processing options
  3. Select output options

Loading Files

File Formats

file_formats.png

These are the files that the generators are (mostly) able to read. Any files that don't have these extensions will be ignored.

Please note that the files must be encoded in UTF-8. Using other encodings may lead to parsing errors or crashes.

Selecting Root Folder

generator-select-input.png

Any files that match your selected file formats and are in this folder or sub-folders, will be used by the generators.

Take, for example, the following folders and their files:

english_texts/
    - books/
        - The Wise Man's Fear/
            - The Wise Man's Fear.epub
            - The Wise Man's Fear.txt
    - subs/
        - Game-of-Thrones/
            - season-1/
                - episode_1.srt
        - Lord_of_the_Rings/
            - The_Fellowship_of_the_Ring.vtt

If you were to select the books folder, and you checked the .txt file format, then the generator would only use the The Wise Man's Fear.txt file.

If you were to select the folder english_texts and you checked all the file format options, then the generator would use the files:

  • The Wise Man's Fear.txt
  • episode_1.srt
  • The_Fellowship_of_the_Ring.vtt

After Loading

generator_loaded_files.png

The files that will be used by the generators will be shown in the File column in the tables below, and the generator buttons are now enabled. Next, you need to specify how the generators should process the files.

Processing Files

Morphemizer

generator-morphemizer.png

This is the tool AnkiMorphs uses to split text into morphs.

Preprocess

generator-parsing-options.png

These options are equivalent to those found in Preprocess settings.

Generator Output

When clicking the Generate Freuency File or Generate Study Plan buttons you will be presented with these options:

generator-output-dialog.png

The output file is automatically set to be in the [anki profile folder]/frequency-files/ folder. Any frequency files or study plans that are placed in this folder can be selected in the note filter: morph priority settings.

You can name the file whatever you want as long as it has a .csv extension, e.g. ja-freq.csv.

Minimum Occurrence

Limit the morphs to only those that occur at least x many times.

Comprehension Target

Limit the morphs to only those that occur below the specified comprehension percent. Let's take these morphs as an example:

comprehension-example.png

If your target is 90%, then we get:

The morphs in the fifth and sixth rows would therefore not be included since they have an occurrence sum greater than 360.



Readability Report Generator

The Readability Report Generator can give you insights into how much of the text in a file you are able to read. It produces two different outputs, one with pure numerical values, and one with percentages.

readability-report-generator-numerical.png readability-report-generator-percentages.png

You can click on the column headers to sort the rows based on those values. In the pictures above the rows are sorted based on the value in the File column.



Frequency File Generator

The Frequency File Generator creates a frequency file that is described in the prioritizing section.



Study Plan Generator

study-guide-example.png

Using a study plan is convenient if you want to learn morphs from source materials in a specific sequence, e.g., TV show episodes, book series, etc.

A study plan differs from a regular frequency file in the following ways:

  • It is first sorted by input files, then morph frequency.
  • It has extra columns:
    • Learning status
    • File name

The study plan generator basically does this:

  1. Creates a frequency file for each input file
  2. Combines those frequency files
  3. Removes duplicate morphs

The resulting file can be used in the note filter: morph priority settings like any other frequency file.

Note: that only the data from the Morph-lemma, and Morph-inflection columns are read by AnkiMorphs, so you can delete or modify the other columns if you want.

Changing The File Order

The study plan uses the same file order as that displayed in the currently opened table at the bottom of the window. This provides more flexibility than relying solely on the alphanumeric values of the file names.

If I have this table open as I click the Generate Study Plan button:

study-plan-order-example-1.png

Then the study plan will have the files in this order:

  1. Jigokuraku-03.srt
  2. Jigokuraku-01.srt
  3. Jigokuraku-02.srt

Note: the Total "file" is artificial and won't be included, nor is its data used in any calculations.

With this table open:

study-plan-order-example-2.png

Then the order will be this:

  1. Jigokuraku-03.srt
  2. Jigokuraku-02.srt
  3. Jigokuraku-01.srt