speechprojects Goto Github PK

repos: 35.0 gists: 0.0

Type: Organization

speechprojects's Projects

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

amodem

Audio MODEM Communication Library in Python

asr-evaluation

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

autoeq

Automatic headphone equalization from frequency responses

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

deep_speaker-speaker_recognition_system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

deepmusicclassification

An implementation of a Convolutional Neural Network to Classify Music Genres

diarizers

espnet

End-to-End Speech Processing Toolkit

flashlight

A C++ standalone library for machine learning

leaf-audio

LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.

lmms

Cross-platform music production software

loop

A method to generate speech across multiple speakers

magenta

Magenta: Music and Art Generation with Machine Intelligence

musictransformer-tensorflow2.0

implementation of music transformer with tensorflow-2.0 (ICLR2019)

natspeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

nesmdb

The NES Music Database: use machine learning to compose music for the Nintendo Entertainment System!

openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

praudio

Audio preprocessing framework for Deep Learning audio applications

pytheory

Music Theory for Humans.

pyttsx3

Offline Text To Speech synthesis for python

samplernn_iclr2017

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

snickery

Hybrid speech synthesiser

soundata

Python library for downloading, loading & working with sound datasets

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

speechpy

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition

spokestack-python

Spokestack is a library that allows a user to easily incorporate a voice interface into a Python application.

speechprojects Goto Github PK

speechprojects's Projects

Recommend Projects

Recommend Topics

Recommend Org