Giter VIP home page Giter VIP logo

netemvocabulary's Introduction

考研词汇词频排序数据

经统计,在《2024年全国硕士研究生招生考试英语(一)考试大纲词汇表》中要求掌握的词汇共 5530 个,根据四六级、考研英语、专四专八约 200 套试卷文本,按照试卷文本中出现的词频对词汇表进行排序。

排序使用了词形还原策略,所以与实际试卷呈现略有差异。

2444 个单词出现 40 次以上,即平均每做 5 套试卷就能遇到一次的这些单词即为真正的高频词汇

高频词汇的释义经过了人工初步校对,其他单词选取使用频率总和大于 50% 的释义(数据来自 the little dict),可以保证一定的准确性。减轻不必要的机械记忆负担。

每个单词有其他拼写(即考纲当中有多种写法的单词)的,一并列出,以保证原始数据的准确性。目前根据这个数据进行了初步填充。有空再和考纲校对。

netem_full_list.json 里面存储了所有的数据。也已转换成 sql 文件

本仓库数据基于 CC BY-NC-SA 4.0 共享,程序基于 MIT License.

Release 页面下载 PDF 版本。

如果想自行生成,请参阅文档

netemvocabulary's People

Contributors

2022mfan avatar awxiaoxian2020 avatar knitting-wool-ball avatar m2qian avatar prokingdu avatar shy-114514 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

netemvocabulary's Issues

如何补充“异形词”这一列数据?

在考纲当中列出了多种写法的单词,为了保证数据完整性,我计划将这些其他的写法列在“异形词”这一列。

我手头虽然有原始数据,但是未经清洗和整理,有没有线上可以查询英美拼写差异的数据库可供调用?

API开发计划

  • 将所有的硬编码字符串变成常量或配置文件
  • js API 使用 Promise 重新实现
  • doc 美化自动化(使用 JS macro in WPS)
  • 仅使用 sql 维护 JSON

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.