Giter VIP home page Giter VIP logo

zhongwen's Introduction

zhongwen

大家好~ Curating a collection of Mandarin Chinese vocabulary, idioms (成语), and characters (汉字). Utilizing data from HSK 3.0, RSH, and other frequency lists. Providing information such as meanings, pinyin pronunciation, character decomposition, frequency order, and relevant tags.

csv file name words characters (汉字) total
hsk 3.0 - characters.csv x 11,092
hsk 3.0 - words.csv x 3,000
rsh.csv x 3,000
chengyu_by_theme.csv 2,350
mega_hanzi_compilation.csv x 11,266

Table of Contents

HSK 3.0 (11,092 words and 3,000 characters)
Remembering Simplified Hanzi - RSH (3,000 characters)
Chengyu 成语 - Chinese Idioms ordered by theme
General Standard Chinese Characters 通用规范汉字表 (8,105 characters)
Jun Da's Character frequency list of Modern Chinese List (9,933 characters)
Character frequency list compilation of 11,266 characters
Additional Language Learning Resources

Your GIF

HSK 3.0 汉语水平考试 (Chinese Proficiency Test)

People's Republic of China's standardized test of proficiency in PRC Standard Chinese for non-native speakers.

Both list, includes characters, pinyin, and definitions

  • Characters (recognition) - 3,000
  • Words - 11,092

Remembering Simplified Hanzi (RSH)

3,000 characters

Book 1 and 2.

By James W. Heisig, Timothy W.Richardson. Book 1 of Remembering Simplified Hanzi covers the writing and meaning of the 1,000 most commonly used characters in the simplified Chinese writing system, plus another 500 that are best learned at an early stage. (Book 2 adds another 1,500 characters for a total of 3,000.)

Chengyu 成语

Chengyu are a type of traditional Chinese idiomatic expressions, most of which consist of four characters.

Data source : **成语大全,值得收藏!(Chengyu)

Jun Da's Modern Chinese Character Frequency List

This website provides character frequency lists generated from a large corpus of Chinese texts collected from online sources.

https://lingua.mtsu.edu/chinese-computing/statistics/

General Standard Chinese Characters 通用规范汉字表

The Table of General Standard Chinese Characters is the current standard list of 8,105 Chinese characters published by the government of the People's Republic of China and promulgated in June 2013.

Compilation of 11,266 Hanzi

Used the following sources:

Lists

Dictionaries

Parser/Module/Libraries

Additional Language Learning Resources

zhongwen's People

Contributors

alyssabedard avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.