Sijun He's Projects
Public repo for HF blog posts
Stanford CS341: Projects for Mining Massive Dataset, MOOC
Simple command-line scripts for document classification
DSPy: The framework for programming—not prompting—foundation models
The ERNIE Bot Python library provides convenient access to the ERNIE Bot API.
Frontend components, documentation and information hosted on the Hugging Face website.
⚡ Building applications with LLMs through composability ⚡
文本智能校对大赛(Chinese Text Correction)的baseline
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Easy-to-use and powerful NLP library with Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including Neural Search, Question Answering, Information Extraction and Sentiment Analysis end-to-end system.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
PaddleSlim is an open-source library for deep model compression and architecture search.
My Chinese and English Resumes in LaTeX with Font Awesome 5
💎 A text first theme for Jekyll.
Mirror of Apache Spark
An Open Source Machine Learning Framework for Everyone
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Named Entity Recognition on Twitter data