Giter VIP home page Giter VIP logo

Jack's Projects

bfrffusion icon bfrffusion

Official codes of Towards Real-World Blind Face Restoration with Generative Diffusion Prior

coqui-tts icon coqui-tts

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

deepfacelab icon deepfacelab

DeepFaceLab is the leading software for creating deepfakes.

dinet icon dinet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

drpan icon drpan

Discriminative Region Proposal Adversarial Network (DRPAN)

eat_code icon eat_code

Official code for ICCV 2023 paper: "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation".

eegstreamer icon eegstreamer

A light and simple way to stream EEG data from the Muse 2 using LSL

emotivoice icon emotivoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

fixspotify icon fixspotify

A tool to clear the cache on the mac Spotify Desktop app

gpt-sovits icon gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

img2img-turbo icon img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

localai icon localai

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

matcha-tts icon matcha-tts

[ICASSP 2024] 🍡 Matcha-TTS: A fast TTS architecture with conditional flow matching

melotts icon melotts

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

ollama icon ollama

Get up and running with Llama 2, Mistral, Gemma, and other large language models.

open-speech-corpora icon open-speech-corpora

πŸ’Ž A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

piper icon piper

A fast, local neural text to speech system

pl-bert icon pl-bert

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

radtts icon radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.