wenzheliu-speech Goto Github PK

followers: 256.0 following: 188.0 repos: 63.0 gists: 0.0

Name: Wenzhe Liu (刘文哲)

Type: User

Bio: Hi, I am Wenzhe Liu. I work for Kuaishou, and was employed by Tencent. focusing on generalized speech enhancement, audio codec and speech synthesis

Location: Beijing, China

Blog: https://wenzheliu-speech.github.io/

Hi, I'm Wenzhe Liu (刘文哲)

🏠 I work for Kuaishou(快手), and was employed by Tencent(腾讯), and graduated from the Institute of Acoustics, Chinese Academy of Sciences (中科院声学所)
📕 Research interests: speech enhancement, compression, synthesis, and voice conversion
- frontend processing: acoustic echo cancellation, denoising, and dereverberation
- audio codec and speech compression: audio (speech, music, and noise) codec, packet loss concealment, and bandwidth extension
- speech synthesis: TTS, voice conversion, and speech restoration
- far-field sound pickup: beamforming, DOA estimation, and microphone array signal processing
📫 How to reach me: [email protected]
More information about me on my homepage: https://wenzheliu-speech.github.io/

Wenzhe Liu (刘文哲)'s Projects

adsp_tutorials

Advanced Signal Processing Notebooks and Tutorials

aero

Audio Super Resolution in the Spectral Domain

AI Audio Datasets 🎵. A list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

audiocodingtutorials

Audio Coding Notebooks and Tutorials

autoencoder-speech-compression

Code for "End-to-End Optimized Speech Coding with Deep Neural Networks" (ICASSP 2018)

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

awesome-singing-voice-synthesis-and-singing-voice-conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

beamforming

不同波束形成算法仿真，共计30余种

clarity_cec1

1st Clarity Enhancement Challenge

cospa

Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

cutword

一个简单快速的分词、命名实体识别工具

deepfilter_implement_see_-networks-speakerfilter.py

Wenzhe Liu Notes: deep filter reproduction, see: 23_3090_speakerfilter_new_deepfilter_final_1024_new/networks/speakerfilter.py i.e. https://github.com/heshulin/23_3090_speakerfilter_new_deepfilter_final_1024_new/blob/86dd75cb9f7858b11e8adc0097da372f706c23a1/networks/speakerfilter.py#L103

dns-challenge-iacaslab9.github.io

easyrec

A framework for large scale recommendation algorithms.

egemaps_estimator

erb_bands

ERB representation of an audio file implemented in Python

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

ilrma

MATLAB script of Independent Low-Rank Matrix Analysis (ILRMA)

jaecbf

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

maximilian

C++ Audio and Music DSP Library

mcnet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023

microphone-array-generalization-for-multichannel-narrowband-deep-speech-enhancement

This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.

minbpe

Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

mp-senet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

wenzheliu-speech Goto Github PK

Wenzhe Liu (刘文哲)'s Projects

Recommend Projects

Recommend Topics

Recommend Org