Guochen Yu's Projects
recent audio generation papers (including speech, music and general audios)
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
speech enhancement\speech seperation\sound source localization
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Conditional Diffusion Probabilistic Model for Speech Enhancement
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
Contrastive Language-Audio Pretraining
Conformer-based Metric GAN for speech enhancement
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计、Java、Python、C++
The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"
The audio demos with respect to the paper "DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement" are provided (submitted to TASLP). The code will also be released soon.
Noise supression using deep filtering
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
PyTorch Implementation of FastDiff (IJCAI'22)
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
Leetcode solutions
Efficient neural speech synthesis
Examples of machine learning and signal processing algorithms.
Acoustic Echo Cancellation with Nerual Kalman Filtering
Simple implementations of NLP models. Tutorials are written in Chinese on my website https://mofanpy.com
Modern audio compression for the internet.
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
A unofficial Pytorch implementation of Microsoft's PHASEN
Generating room impulse responses
Self-Supervised Speech Pre-training and Representation Learning Toolkit.