Retrieval-based-Voice-Conversion-WebUI

An easy-to-use voice conversion (voice changer) framework based on VITS

更新日志 | Frequently Asked Questions | AutoDL - 50 cents to train AI singers | Record of control experiments | Online Demo

English | 中文简体 | 日本語 | 한국어 (韓國語)

Click here to view our demo video !

Real-time voice conversion using RVC: w-okada/voice-changer

Online demo of a vocal-to-acoustic guitar model trained with the RVC vocoder ：https://huggingface.co/spaces/lj1995/vocal2guitar

RVC Vocal to Guitar Effect Demonstration Video ：https://www.bilibili.com/video/BV19W4y1D7tT/

The bottom model is trained using close to 50 hours of open-source, high-quality VCTK training set, with no copyright concerns.

High-quality licensed vocal training sets will be added to train the base model.

Introduction

This repository has the following features

Eliminate tone leakage by replacing input source features with training set features using top1 searches.
Fast training even on relatively poor graphics cards.
Good results with small amounts of data (at least 10 minutes of low-floor-noise speech data is recommended).
Possibility to change the timbre by model fusion (ckpt-merge in the ckpt-processing tab).
Easy-to-use web interface
UVR5 modeling for fast separation of vocals and accompaniment
Rooting out mute problems with the state-of-the-art [vocal pitch extraction algorithm InterSpeech2023-RMVPE] (#Reference Item). Best results (significantly) but faster and less resource intensive than crepe_full!

Environment Configuration

The following commands need to be executed in an environment with Python version greater than 3.8.

(Windows/Linux)
Start by installing the main dependencies via pip:

# Install Pytorch and its core dependencies, skip if already installed
# Reference from: https://pytorch.org/get-started/locally/
pip install torch torchvision torchaudio

# For Win + Nvidia Ampere architecture (RTX30xx), according to #21 experience, you need to specify the cuda version of pytorch.
#pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117

Dependencies can be installed using poetry:

# Install Poetry dependency management tool, skip if already installed.
# Reference from: https://python-poetry.org/docs/#installation
curl -sSL https://install.python-poetry.org | python3 -

# Install the dependencies via poetry
poetry install

You can also install the dependencies via pip:

pip install -r requirements.txt

Mac users can install dependencies via run.sh:

sh ./run.sh

Other pre-model preparation

RVC needs some other pre-models for inference and training.

You can download these models from our Hugging Face space

Here is a list with the names of all the pre-models and other files needed for RVC:

hubert_base.pt

./pretrained 

./uvr5_weights

Additional downloads are required if you want to test the v2 version of the model.

./pretrained_v2 

If you are using Windows, you may need this file, but skip it if ffmpeg and ffprobe are already installed; ubuntu/debian users can install these two libraries via apt install ffmpeg, and Mac users can install ffmpeg via brew install ffmpeg (requires the pre-installed brew)

./ffmpeg

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffmpeg.exe

./ffprobe

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/ffprobe.exe

If you want to use the latest RMVPE vocal pitch extraction algorithm, then you need to download the pitch extraction model parameters and place them in the RVC root directory

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.pt

After that, use the following command to start WebUI.

python infer-web.py

If you are using Windows or macOS, you can just download and unzip RVC-beta.7z and run go-web.bat to start WebUI for the former, or run the command sh . /run.sh to start WebUI.

There is also a White Easy Tutorial.doc in the repository for reference.

Reference projects

ContentVec
VITS
HIFIGAN
Gradio
FFmpeg
Ultimate Vocal Remover
audio-slicer
Vocal pitch extraction:RMVPE
- The pretrained model is trained and tested by yxlllc and RVC-Boss.

hinabl / rmvpe-ai-kaggle Goto Github PK

rmvpe-ai-kaggle's Introduction

Retrieval-based-Voice-Conversion-WebUI

Introduction

Environment Configuration

Other pre-model preparation

Reference projects

Thanks to all contributors for their efforts

rmvpe-ai-kaggle's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent