imaginator's Introduction

ImaGINator: Conditional Spatio-Temporal GAN for Video Generation

Yaohui Wang, Piotr Bilinski, Francois Bremond and Antitza Dantcheva

Requirements

Python 3.6
cuda 9.2
cudnn 7.1
PyTorch 1.4+
scikit-video
tensoboard
moviepy
PyAV

Dataset

You can download the original MUG datest from https://mug.ee.auth.gr/fed/ and use https://github.com/1adrianb/face-alignment to crop face regions. You can also download our preprocessed version from here and save it under $DATA_PATH.

Pretrained model

Download the pretrained model on MUG from here and put it under ./pretrained.

Inference

Generate videos and save them under ./demos/mug

python demo.py --dataset mug --model_path ./pretrained/mug.pth

Training

python train.py --data_path $DATA_PATH

Evaluation

To compute FID, please ref to G3AN.

Citation

If you find this code useful for your research, please consider citing our paper:

@InProceedings{WANG_2020_WACV,
  author = {WANG, Yaohui and Bilinski, Piotr and Bremond, Francois and Dantcheva, Antitza},
  title = {ImaGINator: Conditional Spatio-Temporal GAN for Video Generation},
  booktitle = {The IEEE Winter Conference on Applications of Computer Vision (WACV)},
  month = {March},
  year = {2020}
}

imaginator's People

Contributors

Stargazers

Watchers

imaginator's Issues

About the source code and model

Hello, I have read your paper of WACV 2020 and I'm very interesting in your work. when will you upload the source code and models, I want to know some details about it. Thanks!

Clarification Needed: Locating Transposed (1+2)D Convolution Implementation

Dear Yaohui Wang,

I found your article very innovative. Thank you for sharing the code as well! It was especially helpful.

I'm interested in learning more about the implementation of the Transposed (1+2)D Convolution concept. Could you point me to the specific part of the code where this is implemented?

Recommend Projects