dipjyoti92 Goto Github PK
Name: Dipjyoti Paul
Type: User
Bio: Marie Curie PhD Fellow, University of Crete
Name: Dipjyoti Paul
Type: User
Bio: Marie Curie PhD Fellow, University of Crete
Open source code for AlphaFold.
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS (text to speech, speech synthesis) based on FastSpeech2, supporting English and Korean
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.
MelGAN vocoder (compatible with NVIDIA/tacotron2)
Official repository of https://doi.org/10.1109/TASLP.2022.3167258
Pytorch implementation of MixNMatch
NeMo: a toolkit for conversational AI
Implementation code of non-parallel sequence-to-sequence VC
Reference implementation of real-time autoregressive wavenet inference
Official PyTorch implementation of Speaker Conditional WaveRNN
PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification
Unsupervised Speech Decomposition Via Triple Information Bottleneck
A Pytorch implementation of StarGAN-VC (Better Audio Quality)
A Pytorch implementation of StarGAN-VC2
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, Interspeech 2021
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
Tacotron + WaveRNN Vocoder
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Official PyTorch implementation of TTS Style Transfer
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.