orantake Goto Github PK
Name: GaonJhwan
Type: User
Company: Korea Telecom
Location: Seoul
Name: GaonJhwan
Type: User
Company: Korea Telecom
Location: Seoul
Audio generation using diffusion models, in PyTorch.
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Denoising Diffusion Probabilistic Models
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Implementation of Korean FastSpeech2
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
History of hosting and summer/winter school presented at Deepest
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Collection of pretrained models for the Montreal Forced Aligner
NASLib is a Neural Architecture Search (NAS) library for facilitating NAS research for the community by providing interfaces to several state-of-the-art NAS search spaces and optimizers.
Demo page of the paper "Enhancing Neural Audio Codec and Zero-Shot Text-to-Speech Performance Through Latent Vector Shape Optimization"
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Deep learning practice using keras and several datasets
📍 Deep Learning Zero to All - Pytorch
PyTorch implementations of Generative Adversarial Networks.
Implementation of SoundStorm built upon SpeechTokenizer.
List of speech synthesis papers.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
[WIP] VoiceSmith makes training text to speech models easy.
Production First and Production Ready End-to-End Speech Recognition Toolkit
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.