Giter VIP home page Giter VIP logo

Comments (3)

CavidanZ avatar CavidanZ commented on June 3, 2024

Hello. Not sure if a reply this late will help, but that is simply known as transfer learning. You take your first model's checkpoint, pass it as the pre_trained model, and warm start from that point. This will ensure that your model now has the new speaker's voice as well as benefitting from the previous training.

from vits.

ToiYeuTien avatar ToiYeuTien commented on June 3, 2024

Hello. Not sure if a reply this late will help, but that is simply known as transfer learning. You take your first model's checkpoint, pass it as the pre_trained model, and warm start from that point. This will ensure that your model now has the new speaker's voice as well as benefitting from the previous training.

Hello, thank you for your response.
I understand that to fine-tune the model in such a way, I just need to replace the audio files and metadata of the new model in the location of the old model, and continue training, right? I would appreciate your feedback!

from vits.

CavidanZ avatar CavidanZ commented on June 3, 2024

Yes. I have done it with tacotron 2 model, and it for sure works. Basically you would do the training just as in the first time: you get your audio dataset ready, and give the model your new audios and the audios' transcriptions.

  1. Just the only difference is you choose the pre trained model to be your previously trained model's checkpoint.
  2. You make use of warm starting. It should be like a parameter in hparams that you set to TRUE.
  3. One more thing: do not change the original batch size. At least in tacotron 2 whenever I changed batch size it printed out errors.

from vits.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.