macroustc Goto Github PK

followers: 0.0 following: 4.0 repos: 262.0 gists: 0.0

Type: User

macroustc's Projects

3d-speaker

A repository for single- and multi-modal speaker verification, speaker recognition, and speaker diarization.

advoc

Vocode spectrograms to audio with generative adversarial networks

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

annotated_deep_learning_paper_implementations

🧑‍🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

asrt_speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

asteroid

The PyTorch-based audio source separation toolkit for researchers

asv-subtools

An Open Source Tools for Speaker Recognition

athena

an open-source implementation of sequence-to-sequence based speech processing engine

attentionbasedprosodyprediction

Encoder and Decoder and Attention Based Prosody Prediction

aud-crawler

A pakage for crawling audio from Youtube

audino

Open source audio annotation tool for humans

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

audio-driven-talkingface-headpose

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose"

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

audioldm

AudioLDM: Generate speech, sound effects, music and beyond, with text.

audioldm2

Text-to-Audio/Music Generation

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

automatic-prosody-annotation

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling