Giter VIP home page Giter VIP logo

Glenn1Q84's Projects

hanlp icon hanlp

中文分词 词性标注 命名实体识别 依存句法分析 语义依存分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

ijson icon ijson

Iterative JSON parser with Pythonic interfaces

introduction-nlp icon introduction-nlp

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

keyword_extraction icon keyword_extraction

利用Python实现中文文本关键词抽取,分别采用TF-IDF、TextRank、Word2Vec词聚类三种方法。

latte icon latte

Wow, A cup of Latte! A zhihu crawler~ 灵活[简易]知乎爬虫

ltp icon ltp

Language Technology Platform

meddg icon meddg

a large-scale high-quality medical dialogue dataset

miningzhidaoqacorpus icon miningzhidaoqacorpus

ZhiDaoChatCorpus, zhidao QA pairs crawled from Baidu zhidao which contains more than 5,800,000 question and 9,830,000 answers with certain tags。百度知道问答语料库,包括超过580万的问题,938万的答案,5800个分类标签。基于该问答语料库,可支持多种应用,如闲聊问答,逻辑挖掘。

ml-note icon ml-note

:orange_book:慢慢整理所学的机器学习算法,并根据自己所理解的样子叙述出来。(注重数学推导)

ml-visuals icon ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

network-modules icon network-modules

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

newspaper icon newspaper

News, full-text, and article metadata extraction in Python 3. Advanced docs:

ngram2vec icon ngram2vec

Four word embedding models implemented in Python. Supporting arbitrary context features

online_pca icon online_pca

courses: A Brief Survey of Approaches for Unconstrained Optimization Problems

opencc icon opencc

Conversion between Traditional and Simplified Chinese

promptpapers icon promptpapers

Must-read papers on prompt-based tuning for pre-trained language models.

proxypool icon proxypool

An Efficient ProxyPool with Getter, Tester and Server

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.