entn-at Goto Github PK
Name: Ewald Enzinger
Type: User
Bio: Ph.D. EE (UNSW Sydney). ML, speaker recognition, speech recognition, speech synthesis, forensic voice comparison
Twitter: entn_at
Location: Portland, Oregon
Blog: https://entn.at/
Name: Ewald Enzinger
Type: User
Bio: Ph.D. EE (UNSW Sydney). ML, speaker recognition, speech recognition, speech synthesis, forensic voice comparison
Twitter: entn_at
Location: Portland, Oregon
Blog: https://entn.at/
🤖💬 Implementation of a Transformer based neural network for text to speech.
new version
Transcribing Speech with Multinomial Diffusion, training code and models.
A zero-shot learning based grapheme-to-phoneme model for 8k languages
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Systems
TristouNet: Triplet Loss for Speaker Turn Embedding
Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
End-2-end speech synthesis with recurrent neural networks
TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
Simple REST-style HTTP TTS (text to speech) server based on MaryTTS, espeak and sequitur
Official Pytorch implementation of TTS Style Transfer
Text-to-Speech tutorial at SLTU 2016
This repository is a collection of TTS Models in TFLite
Text to Speech Synthesis based on controllable latent representation
A simple app for recording speech datasets.
tensorflow speech synthesis c++ inference for voicenet
A recipe to build a Modern Standard Arabic set of ASR acoustic models with Kaldi.
Resources for "Simple Speech Representation Learning from Perceptual Data".
PyTorch implementation of XML on TVR dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Offical repository of TwiBot-22.
twin model GPLDA for short duration speaker verification
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
Development of ASR for ArchiMob, a spoken corpus of Swiss German.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.