Giter VIP home page Giter VIP logo

Comments (6)

prajwalkr avatar prajwalkr commented on July 24, 2024

Did you try once with the non-GAN model? It performs better on silences

from wav2lip.

bigboss97 avatar bigboss97 commented on July 24, 2024

Did you try once with the non-GAN model? It performs better on silences

I'm not familiar with that. How can I do that in colab?

from wav2lip.

prajwalkr avatar prajwalkr commented on July 24, 2024

Just download the corresponding checkpoint and pass it as an argument in the same inference command.

from wav2lip.

HusainKapadia avatar HusainKapadia commented on July 24, 2024

Just a suggestion...perhaps to improve performance on silence you can add some simulated data to the training data set, where:

  1. The audio inputs would be silence or add some low DB noise to it (to avoid Mel bands being NaN)
  2. The corresponding video could be created with an image of any random person with a neutral expression (of course the length's of the audio and video should match)
  3. Train the model with this extra data along with the LSR2 dataset specified in the README. Or you could fine tune the model with the pre-trained weights.
  4. Run inference with new model.

PS: this is not something I have tried yet, but if this is a limitation of the existing model, this could be one way of improving it, I believe
Would you agree @prajwalkr ?

from wav2lip.

prajwalkr avatar prajwalkr commented on July 24, 2024

Maybe it can help. Maybe using some short range temporal info in the generator might help even better. We are yet to experiment more on this.

from wav2lip.

prajwalkr avatar prajwalkr commented on July 24, 2024

Re-open if needed.

from wav2lip.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.