Giter VIP home page Giter VIP logo

ai-song-cover-sovits's Introduction

AI-Song-Cover-SOVITS

All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab.

Leave A Star if This Repo Was Helpful

ko-fi Trakteer

Tutorial (Indonesian)

https://youtu.be/v5MwAqQTc6Q

Google Colab

Open In Colab

ai-song-cover-sovits's People

Contributors

ardha27 avatar dianemeee avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ai-song-cover-sovits's Issues

inference

May I ask if you can use the mp3 or wav file you uploaded for inference?

#bukan issue,sekedar tips

Tips:

Pertama
Jika kalian melakukan Training, pasti akan sangat menggunakan storage yang sangat besar,kalian bisa menghapus file D_*.pth dan G_*.pth pada google drive kalian yang sudah lama dan tinggalkan D _*.pth dan G _*.pth yang terbaru.
PS: Kalian gak usah ngelakuin ini,karena setelah gw lihat ternyata setelah 3 kali wav epoch pasti file terlamanya bakal di hapus

Kedua
Jika ditengah-tengah training kalian mendapatkan Runtime Disconnect kalian bisa tetap melanjutkan training dengan cara memindahkan file so-vits-svc-fork ke google drive lain dan memulai melanjutkannya pada akun lain dengan catatan kalian harus membuat data set yang sama dari yang kalian pakai sebelumnya setelah itu kalian dapat melanjutkan proses training

Oh, Iya.Informasi yang agak gak penting sih.

Waktu kita training,wave epoch akan di perbarui sekitar 7-9 menit tergantung data set yang kalian pake.
Kalian akan menghabiskan waktu sekitar satu jam untuk training 240 epoch.
Jadi jika target epoch kalian 1000,maka kalian akan membutuhkan waktu sekitar 3 jam - 4 jam

Itu aja sih tips dari gw, mungkin kalo nemu lagi bakal gw taruh disini.
@ardha27 Mohon koreksinya

5. Inference

Mas, ini kenapa ya error bagian display(AUDIO())?

image

Model: https://huggingface.co/spaces/zomehwh/vits-models/blob/main/pretrained_models/alice/alice.pth (Blue Archive: Tendou Arisu)

Error logs:

[18:58:20] INFO     [18:58:20] Version: 3.14.1                    

---------------------------------------------------------------------------

ValueError                                Traceback (most recent call last)

[<ipython-input-20-26778c592e41>](https://localhost:8080/#) in <cell line: 12>()
     10 get_ipython().system('svc infer {AUDIO}.wav -m {MODEL} -na -t {PITCH}')
     11 # Try comment this line below if you got Runtime Error
---> 12 display(Audio(f"/content/{AUDIO}.out.wav", autoplay=True, rate=22050))

2 frames

[/usr/local/lib/python3.10/dist-packages/IPython/lib/display.py](https://localhost:8080/#) in _validate_and_normalize_with_numpy(data, normalize)
    157         waveobj = wave.open(fp,mode='wb')
    158         waveobj.setnchannels(nchan)
--> 159         waveobj.setframerate(rate)
    160         waveobj.setsampwidth(2)
    161         waveobj.setcomptype('NONE','NONE')

ValueError: could not convert string to float: '/content//content/separated/htdemucs/audio/vocals.out.wav'

Inference /command not found

Halo kak! I've been trying to use your tool to train my own voice and some artists since your latest video popped up on my tiktok's fyp. Jadi masalahnya muncul itu pas aku udah selesai train suaranya aku, terus mau ke tahap inference tapi awalnya kena disconnecting issue, kan. Nah katamu bisa pakai comman ctrl + /, tapi aku gak ngelakuin itu malah aku ngedisconnect dan hapus runtime. Karena aku pikir aku juga udah train suara aku tadinya dan filenya kesimpan di gdrive aku langsung aja, ke langkah 1, 2, dan 5 dan ini yang muncul kak.

Aku masukin yang model dan confignya folder trainnya dari GDrive tadi kak, kamu bisa liat sendiri di gambar bawah ini, ya.... tapi malah muncul yang kayak digambar, gimana ya kak? Aku bingung sendiri juga jadinya T_T

Atau apa kita tidak bisa kak pakai yang dari GDrive untuk tahap nomor 5? atau harus upload/buat clone di github?

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.