dipjyoti92 Goto Github PK

followers: 35.0 following: 0.0 repos: 31.0 gists: 0.0

Name: Dipjyoti Paul

Type: User

Bio: Marie Curie PhD Fellow, University of Crete

Blog: https://dipjyoti92.github.io/

Dipjyoti Paul's Projects

club

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

expressive-fastspeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS (text to speech, speech synthesis) based on FastSpeech2, supporting English and Korean

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

fbk-fairseq

fg-transformer-tts

generative-models

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

meta-tts

Official repository of https://doi.org/10.1109/TASLP.2022.3167258

mixnmatch

Pytorch implementation of MixNMatch

nemo

NeMo: a toolkit for conversational AI

nonparaseq2seqvc_code

Implementation code of non-parallel sequence-to-sequence VC

nv-wavenet

Reference implementation of real-time autoregressive wavenet inference

sc-wavernn

Official PyTorch implementation of Speaker Conditional WaveRNN

smart-single_emotional_tts

speaker_embeddings_ge2e

PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification

speechsplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

stargan-voice-conversion

A Pytorch implementation of StarGAN-VC (Better Audio Quality)

stargan-voice-conversion-2

A Pytorch implementation of StarGAN-VC2

styler

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, Interspeech 2021

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

dipjyoti92 Goto Github PK

Dipjyoti Paul's Projects

Recommend Projects

Recommend Topics

Recommend Org