drbrule Goto Github PK
Name: Jack
Type: User
Bio: Neuroscientist, Data Scientist, Musician
Name: Jack
Type: User
Bio: Neuroscientist, Data Scientist, Musician
All-In-One Music Structure Analyzer
Toolkit for downloading and processing Google's AudioSet dataset.
Official codes of Towards Real-World Blind Face Restoration with Generative Diffusion Prior
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
DeepFaceLab is the leading software for creating deepfakes.
ai lip sync project
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Discriminative Region Proposal Adversarial Network (DRPAN)
Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".
A light and simple way to stream EEG data from the Muse 2 using LSL
EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engine
Demos of Essentia models hosted on Replicate.com
Next generation face swapper and enhancer
A tool to clear the cache on the mac Spotify Desktop app
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
[ICASSP 2024] π΅ Matcha-TTS: A fast TTS architecture with conditional flow matching
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Mind Monitor OSC Streaming Python Samples
Get up and running with Llama 2, Mistral, Gemma, and other large language models.
π A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Instant voice cloning by MyShell.
Audio datasets, easier.
A fast, local neural text to speech system
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
Voice data <= 10 mins can also be used to train a good VC model!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.