paineliu Goto Github PK
Type: User
Type: User
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
使用Bert,ERNIE,进行中文文本分类
BERT-based intent and slots detector for chatbots.
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
基于PyTorch的BERT中文文本分类模型(BERT Chinese text classification model implemented by PyTorch)
文本分类
多粒度分词的CQL解析器系统(CQL Analyzer with Multi-lEvel Lexical System, CAMELS)
Instance Segmentation for Chinese Character Stroke Extraction, Datasets and Benchmarks.
GBK汉字拆分表
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
🇨🇳 功能全面的汉字工具库 (拼音 笔画 偏旁 成语 语音 可视化等) (Chinese character util)
Corpus Query Language Parser
Parse Corpus Query Language (CQL) into a list of JSON queries
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
Data and tools for generating and inspecting OLMo pre-training data.
Chinese version of GPT2 training code, using BERT tokenizer.
Extraction of Chinese Character Components in Digital Ink based on Recognition Scores
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"
BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.