Comments (4)
Thanks for your prompt reply, it's really helpful.
Sorry for bring up another question. Is it possible to share the training hyperparameters and the schedulers (I only found the optimizers in the repo) of all the models? (Batch size, learning rate,.. etc). The hyperparameters and the schedulers would be super helpful if I want to train the model based on the repo. Thanks!
from talking-face_pc-avs.
Hi, thank you for looking into the details of the code! Most previous work for facial image generation normally align each face according to the three points (As I have done in my previous paper https://github.com/Hangz-nju-cuhk/Talking-Face-Generation-DAVS). However, this will lead to a zoom-out-and-in artifact due to the affine transformation.
Thus in this work, I do not align the samples provided in VoxCeleb2 and choose to align the faces according to the average of all key points in other videos. The bias is to ensure that the face is almost at the center of the cropped frame but certain misalignment is allowed.
from talking-face_pc-avs.
Thank you for your reply! Your reply answered my question!
Sorry for another question about the training details. Since no preprocessed code is released. I have a question regarding it.
During the training stage, I'm wondering did you train the entire clip or did you sample a few number of frames in a clip and each time you random sample a pair of frames as the training data? Thanks.
from talking-face_pc-avs.
Hi, for each epoch we sample like 12 continuous frames for contrastive learning and among them, 4 are used for reconstruction training.
from talking-face_pc-avs.
Related Issues (20)
- stack expects a non-empty TensorList HOT 2
- 关于评价指标
- "Train your own model" Release date HOT 7
- Lip jitter(嘴唇频繁抖动)
- 找不到训练后的文件夹
- why embedding the audio features
- Does training procedure need any other py file or module that I have to make on my own?
- How to run the repo (inference) on CPU?
- Why start from 2?
- Code coming soon ? HOT 1
- ffmpeg: not found
- demo_id.csv missing HOT 1
- TypeError: mel() takes 0 positional arguments but 2 positional arguments (and 3 keyword-only arguments) were given HOT 1
- 推理结果人脸模糊无五官
- 已创建中文的讨论组想加入的请添加微信xaaheng
- can this project run in real-time?
- ./checkpoints/demo_id/latest_net_A.pth not exists yet! HOT 1
- where is mapping function?
- About mouth_source
- 我们创建了一个中文讨论组,有需要的加我微信douzijun1999
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from talking-face_pc-avs.