kingfener Goto Github PK

followers: 1.0 following: 3.0 repos: 236.0 gists: 1.0

Name: king

Type: User

Bio: a man open the new world

Location: Beijing

king's Projects

pychain

PyTorch implementation of LF-MMI for End-to-end ASR

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developped as a fast prototyping platform for beamforming algorithms in indoor scenarios.

python-wrapper-for-world-vocoder

A Python wrapper for the high-quality vocoder "World"

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

pytorch-tdnn

Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training

pytorch_mlp_for_asr

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

real-time-voice-cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

rlhf-dpo-direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

runx

Deep Learning Experiment Management

sadtalker_emo

和 emo 类似的图片+ 音频转视频。 [CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

scws

开源免费的简易中文分词系统，PHP分词的上乘之选！

seamlessm4t-tts-asr

Foundational Models for State-of-the-Art Speech and Text Translation

sednn

deep learning based speech enhancement using keras python, make it easy to use

segan

Speech Enhancement Generative Adversarial Network in TensorFlow

sherpa-onnx

各种功能集合:ASR\TTS: Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

kingfener Goto Github PK

king's Projects

Recommend Projects

Recommend Topics

Recommend Org