陈明's Projects
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
Beam search for neural network sequence to sequence (encoder-decoder) models.
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private server services
TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)
Demo web server app that shows how BERT model trained on SQuAD dataset deals with the machine comprehension task.
一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算
简单的向量白化改善句向量质量
TensorFlow code and pre-trained models for BERT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
中文公开聊天语料库
基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model
This repo is unofficial ChatGPT api. It is based on Daniel Gross's WhatsApp GPT
Integrate ChatGPT into your own discord bot
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
Chinese word segmentation algorithm without corpus(无需语料库的中文分词)
多标签文本分类,多标签分类,文本分类, multi-label, classifier, text classification, BERT, seq2seq,attention, multi-label-classification
multi-label,classifier,text classification,多标签文本分类,文本分类,BERT,ALBERT,multi-label-classification
multi-label,classifier,text classification,多标签文本分类,文本分类,BERT,ALBERT,multi-label-classification
multi-label,classifier,text classification,多标签文本分类,文本分类,BERT,ALBERT,multi-label-classification,seq2seq,attention,beam search
中文任务基准测评 datasets, baselines, pre-trained models, corpus and leaderboard
中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)
对话机器人(聊天机器人)设计思考