Giter VIP home page Giter VIP logo

boostpapa's Projects

academicodec icon academicodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

audiocraft icon audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

bigcidian icon bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

cat icon cat

A CRF-based ASR Toolkit

chainer icon chainer

A flexible framework of neural networks for deep learning

ctc_decoder icon ctc_decoder

A ctc decoder for both online and offline asr model

e2e_lfmmi icon e2e_lfmmi

This is the implementation of paper CONSISTENT TRAINING AND DECODING FOR END-TO-END SPEECH RECOGNITIONUSING LATTICE-FREE MMI submitted to ICASSP2022

espnet icon espnet

End-to-End Speech Processing Toolkit

fastspeech2 icon fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

gpt-sovits icon gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

kaldi icon kaldi

This is now the official location of the Kaldi project.

leaderboard icon leaderboard

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

mace icon mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

natspeech icon natspeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

opennmt-py icon opennmt-py

Open Source Neural Machine Translation in PyTorch

openseq2seq icon openseq2seq

Toolkit for efficient experimentation with various sequence-to-sequence models

sherpa icon sherpa

Speech-to-text server framework with next-gen Kaldi

tts icon tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

uis-rnn icon uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.