orantake Goto Github PK

followers: 3.0 following: 11.0 repos: 33.0 gists: 0.0

Name: GaonJhwan

Type: User

Company: Korea Telecom

Location: Seoul

Blog: www.linkedin.com/in/orantake

GaonJhwan's Projects

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

bigvgan

Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

ddpm

Denoising Diffusion Probabilistic Models

ddpm_pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

demo-page

dl-for-emo-tts

:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

fastspeech2_pytorch

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

fastspeech2_pytorch_korean

Implementation of Korean FastSpeech2

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

hosting-school

History of hosting and summer/winter school presented at Deepest

lightning_cloud

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

mfa-models

Collection of pretrained models for the Montreal Forced Aligner

naslib

NASLib is a Neural Architecture Search (NAS) library for facilitating NAS research for the community by providing interfaces to several state-of-the-art NAS search spaces and optimizers.

normalize-encodec-demo

Demo page of the paper "Enhancing Neural Audio Codec and Zero-Shot Text-to-Speech Performance Through Latent Vector Shape Optimization"

orantake.github.io

parallel-tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

portaspeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

practice_deep-learning

Deep learning practice using keras and several datasets

pytorch

📍 Deep Learning Zero to All - Pytorch

pytorch-gan

PyTorch implementations of Generative Adversarial Networks.

soundstorm-speechtokenizer

Implementation of SoundStorm built upon SpeechTokenizer.

speech-synthesis-paper

List of speech synthesis papers.

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

vall-e-x

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

orantake Goto Github PK

GaonJhwan's Projects

Recommend Projects

Recommend Topics

Recommend Org