Generate images from font files.
The images that are generated are useful in machine learning training for image applications.
These categories of characters are used:
- simplified Chinese (selected)
- Hànyǔ Pīnyīn vowels with tone marks - diacritics
- traditional Chinese including Zhuyin (selected)
- Chinese characters used in Cantonese only (selected)
- Japanese Kanji (selected)
- Japnese 平仮名-hiragana & 片仮名-katakana
- Korean syllables (selected)
- ascii, Windows-1252, Russian, Greek characters
- Math symbols (selected)
See README.md in char_lists/ for character selection rationales.
char_lists
contains lists of characters, each file contains one category of characters.fonts
will contain font filesscripts
has the main program