The words list with meaning generated using
- word list shared by Pleco
- CC-CEDICT
- cedict-json
- Anki Chinese Vocabulary Generator
- wiktionary, Mandarin Frequency lists
The meanings in HSK list with meaning
taken in following order
- if found in
wiktionary
then get meaning - else take meaning from
CC-CEDICT
- if not found then translate using Google Translate
The Scripts and data contains files that used to create HSK 3.0 lists.
The main.py is used to create HSK 3.0 word list with meaning. The script is not so optimized, it may need improvements.
The following data used to generate list
- 10k Mandarin from wiktionary
- all_cedict.json
- HSK 1.txt to HSK 7-9.txt (view in the folder)
The following data generated (view in the folder)
- HSK 1 to HSK 7-9 with clear meaning .txt files
- tsv list for importing in Anki
- Install Python
- Install following python modules using
pip
pinyin
pycedict
hanziconv
googletrans
pinyin_jyutping_sentence
- The script reads characters/words per line and fetch meaning. Then it write the data .txt files
- Uncomment functions to use the script
# uncomment below and run
# get_meaning()
# some other helper functions
# find_dup()
# get_sound()
# count_field()
- The meaning translated using Google when not found in CC-CEDICT.
- This generated using python program, may contain errors and need improvements.
View License