widyhu2020 Goto Github PK
Type: User
Type: User
A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
ArduPlane, ArduCopter, ArduRover, ArduSub source
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Let ChatGPT teach your own chatbot in hours with a single GPU!
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Code for CodeT5: a new code-aware pre-trained encoder-decoder model.
CodeXGLUE
Making large AI models cheaper, faster and more accessible
darknet text detect and darknet cnn ocr
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Bash script for installing V2Ray in operating systems such as Debian / CentOS / Fedora / openSUSE that support systemd
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
IPTV 国内+国外 电视台直播源m3u文件, 收集&汇总&本地源脚本
A framework for few-shot evaluation of language models.
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
结合python一起学习自然语言处理 (nlp): 语言模型、HMM、PCFG、Word2vec、完形填空式阅读理解任务、朴素贝叶斯分类器、TFIDF、PCA、SVD
记录本人整理的一些数据集
The official Python library for the OpenAI API
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
PyTorch 1.0 官方文档 中文版,欢迎关注微信公众号:磐创AI
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
SVN support for VS Code
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A platform for building proxies to bypass network restrictions.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.