Giter VIP home page Giter VIP logo

speechprojects's Projects

aeneas icon aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

amodem icon amodem

Audio MODEM Communication Library in Python

asr-evaluation icon asr-evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

audiocraft icon audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

autoeq icon autoeq

Automatic headphone equalization from frequency responses

espnet icon espnet

End-to-End Speech Processing Toolkit

leaf-audio icon leaf-audio

LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.

lmms icon lmms

Cross-platform music production software

loop icon loop

A method to generate speech across multiple speakers

magenta icon magenta

Magenta: Music and Art Generation with Machine Intelligence

natspeech icon natspeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

nesmdb icon nesmdb

The NES Music Database: use machine learning to compose music for the Nintendo Entertainment System!

openspeech icon openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

praudio icon praudio

Audio preprocessing framework for Deep Learning audio applications

pyttsx3 icon pyttsx3

Offline Text To Speech synthesis for python

soundata icon soundata

Python library for downloading, loading & working with sound datasets

speech-trident icon speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

speechbrain.github.io icon speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

speechpy icon speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition

spokestack-python icon spokestack-python

Spokestack is a library that allows a user to easily incorporate a voice interface into a Python application.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.