boostpapa Goto Github PK

followers: 0.0 following: 3.0 repos: 33.0 gists: 0.0

Type: User

boostpapa's Projects

academicodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

automatic_speech_recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

bert-vits2

vits2 backbone with bert

bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

bigvsan

Pytorch implementation of BigVSAN

cat

A CRF-based ASR Toolkit

chainer

A flexible framework of neural networks for deep learning

ctc_decoder

A ctc decoder for both online and offline asr model

deepspeech

A PaddlePaddle Speech to Text toolkit.

e2e_lfmmi

This is the implementation of paper CONSISTENT TRAINING AND DECODING FOR END-TO-END SPEECH RECOGNITIONUSING LATTICE-FREE MMI submitted to ICASSP2022

espnet

End-to-End Speech Processing Toolkit

fastertransformer

Transformer related optimization, including BERT, GPT

fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

fish-speech

Brand new TTS solution

gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

kaldi

This is now the official location of the Kaldi project.

kkndme_tianya

天涯 kkndme 神贴聊房价

leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

lhotse

mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

machinelearning

My blogs and code for machine learning. http://cnblogs.com/pinard

mytinystl

Achieve a tiny STL in C++11

natspeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

opennmt-py

Open Source Neural Machine Translation in PyTorch

openseq2seq

Toolkit for efficient experimentation with various sequence-to-sequence models

ppg-vc

PPG-Based Voice Conversion

sherpa

Speech-to-text server framework with next-gen Kaldi

tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

boostpapa Goto Github PK

boostpapa's Projects

Recommend Projects

Recommend Topics

Recommend Org