zhyoung24 Goto Github PK
Type: User
Type: User
Huggingface Transformers + Adapters = ❤️
Vocode spectrograms to audio with generative adversarial networks
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
Global Rhythm Style Transfer Without Text Transcriptions
BERT for Multitask Learning
Charsiu: A neural phonetic aligner.
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Implementation related to the Deep Complex Networks
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-parallel training data".
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
Efficient neural speech synthesis
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
A Demo of Mandarin/Chinese TTS frontend
Multi-speaker Tacotron in TensorFlow. 오픈소스 딥러닝 다중 화자 음성 합성 엔진.
An easy-to-use named entity recognition (NER) toolkit, implemented the Bi-LSTM+CRF model in tensorflow.
Problem Agnostic Speech Encoder
PPG-Based Voice Conversion
Some basic praat scripts.
Speech Model Pre-training for End-to-End Spoken Language Understanding
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A python package to analyze and compare voices with deep learning
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.