kingfener Goto Github PK
Name: king
Type: User
Bio: a man open the new world
Location: Beijing
Name: king
Type: User
Bio: a man open the new world
Location: Beijing
PyTorch implementation of LF-MMI for End-to-end ASR
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developped as a fast prototyping platform for beamforming algorithms in indoor scenarios.
All Algorithms implemented in Python
A Python wrapper for the high-quality vocoder "World"
关于python的面试题
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Reference implementation for DPO (Direct Preference Optimization)
Deep Learning Experiment Management
和 emo 类似的 图片+ 音频 转 视频。 [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
开源免费的简易中文分词系统,PHP分词的上乘之选!
Foundational Models for State-of-the-Art Speech and Text Translation
deep learning based speech enhancement using keras python, make it easy to use
Speech Enhancement Generative Adversarial Network in TensorFlow
各种功能集合:ASR\TTS: Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Icelandic TTS (text-to-speech) service for Android
Python library for processing Chinese text
SoftVC VITS Singing Voice Conversion
Simple library to speed up or slow down speech
视频中的与对象关联的音频分割:Codebase for ECCV18 "The Sound of Pixels"
Towards hot directions in industrial end to end speech recognition
List of speech synthesis papers.
A PyTorch-based Speech Toolkit
Easy-to-Use Speech MOS predictors
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.