drbrule Goto Github PK

followers: 1.0 following: 11.0 repos: 37.0 gists: 0.0

Name: Jack

Type: User

Bio: Neuroscientist, Data Scientist, Musician

Jack's Projects

audioset-processing

Toolkit for downloading and processing Google's AudioSet dataset.

bfrffusion

Official codes of Towards Real-World Blind Face Restoration with Generative Diffusion Prior

coqui-tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

deepfacelab

DeepFaceLab is the leading software for creating deepfakes.

dinet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

drpan

Discriminative Region Proposal Adversarial Network (DRPAN)

eat_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

eegstreamer

A light and simple way to stream EEG data from the Muse 2 using LSL

emotivoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

essentia-replicate-demos

Demos of Essentia models hosted on Replicate.com

facefusion

Next generation face swapper and enhancer

fixspotify

A tool to clear the cache on the mac Spotify Desktop app

gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

matcha-tts

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

melotts

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

mindmonitorpython

Mind Monitor OSC Streaming Python Samples

ollama

Get up and running with Llama 2, Mistral, Gemma, and other large language models.

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

openvoice

Instant voice cloning by MyShell.

ozen-toolkit

Audio datasets, easier.

pheme

piper

A fast, local neural text to speech system

pl-bert

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.

retrieval-based-voice-conversion-webui

Voice data <= 10 mins can also be used to train a good VC model!

drbrule Goto Github PK

Jack's Projects

Recommend Projects

Recommend Topics

Recommend Org