Giter VIP home page Giter VIP logo

Comments (4)

Yusics avatar Yusics commented on June 17, 2024 1

Thanks for your prompt reply, it's really helpful.
Sorry for bring up another question. Is it possible to share the training hyperparameters and the schedulers (I only found the optimizers in the repo) of all the models? (Batch size, learning rate,.. etc). The hyperparameters and the schedulers would be super helpful if I want to train the model based on the repo. Thanks!

from talking-face_pc-avs.

Hangz-nju-cuhk avatar Hangz-nju-cuhk commented on June 17, 2024

Hi, thank you for looking into the details of the code! Most previous work for facial image generation normally align each face according to the three points (As I have done in my previous paper https://github.com/Hangz-nju-cuhk/Talking-Face-Generation-DAVS). However, this will lead to a zoom-out-and-in artifact due to the affine transformation.

Thus in this work, I do not align the samples provided in VoxCeleb2 and choose to align the faces according to the average of all key points in other videos. The bias is to ensure that the face is almost at the center of the cropped frame but certain misalignment is allowed.

from talking-face_pc-avs.

Yusics avatar Yusics commented on June 17, 2024

Thank you for your reply! Your reply answered my question!

Sorry for another question about the training details. Since no preprocessed code is released. I have a question regarding it.
During the training stage, I'm wondering did you train the entire clip or did you sample a few number of frames in a clip and each time you random sample a pair of frames as the training data? Thanks.

from talking-face_pc-avs.

Hangz-nju-cuhk avatar Hangz-nju-cuhk commented on June 17, 2024

Hi, for each epoch we sample like 12 continuous frames for contrastive learning and among them, 4 are used for reconstruction training.

from talking-face_pc-avs.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.