cst781 Goto Github PK
Type: User
Type: User
A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.
Simple implementation of MMDenseNet model
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
End-to-end ASR/LM implementation with PyTorch
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021 Accepted
:musical_note: a new stem dataset for Music Demixing research, from the OnAir royalty-free music project
Online Normalization for Training Neural Networks (Companion Repository)
Open-Unmix - Music Source Separation for PyTorch
Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
A unofficial Pytorch implementation of Microsoft's PHASEN
An open source car license plate recognition python lib based on HyperLPR; Trained OpenCV’s cascaded target detector; Used Hard Sample Mining to crop out the detected error areas;
Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
Python implementation of performance metrics in Loizou's Speech Enhancement book
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
Tools for Speech Enhancement integrated with Kaldi
A PyTorch Dataset for Slakh2100
Deep learning based speech source separation using Pytorch
Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
Speech Denoising with Deep Feature Losses
Official repository of our paper: https://arxiv.org/abs/2010.15366
Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was submitted on ICASSP2022
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.