Giter VIP home page Giter VIP logo

issa's Introduction

Intra-Source Style Augmentation for Improved Domain Generalization (ISSA)

Official PyTorch implementation of the WACV 2023 paper "Intra-Source Style Augmentation for Improved Domain Generalization". This repository provides the minimal code snippets of the masked noise encoder for GAN inversion.

🔥 Updates: "Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization" has been accepted at International Journal of Computer Vision (IJCV)! We extended our WACV paper and add more applications, e.g., utilzing stylized data for assessing domain generalization performance. Please check it out and reach out in case of any questions!

arXiv arXiv Static Badge

overview
teaser
extra-style

Getting Started

The code is tested for Python 3.9. ISSA conda environment can be created via

conda env create --file environment.yml
source activate issa

Training

Note: please read how-to.pdf for more detailed instruction. After proper path configuration in configs/mne_training.yml, run the command below for training the encoder

python train_encoder.py

Some important paths need to be adjusted in the configuration file:

  • data: path to the real data
  • data_fake: path to the GAN generated data, where the corresponding w latents are stored in the same path
  • pkl_dir: path to the pretrained GAN model

Inference

For inference, please refer to the code snippets here, which shows how the Encoder & Generator are used for image generation.

Citation

If you use this code please cite

@inproceedings{li2023intra,
  title={Intra-Source Style Augmentation for Improved Domain Generalization},
  author={Li, Yumeng and Zhang, Dan and Keuper, Margret and Khoreva, Anna},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  pages={509--519},
  year={2023}
}

@article{li2023extra,
  title={Intra-\& extra-source exemplar-based style synthesis for improved domain generalization},
  author={Li, Yumeng and Zhang, Dan and Keuper, Margret and Khoreva, Anna},
  journal={International Journal of Computer Vision},
  pages={1--20},
  year={2023},
  publisher={Springer}
}

License

This project is open-sourced under the AGPL-3.0 license. See the LICENSE file for details.

For a list of other open source components included in this project, see the file 3rd-party-licenses.txt.

Purpose of the project

This software is a research prototype, solely developed for and published as part of the publication cited above. It will neither be maintained nor monitored in any way.

Contact

Please feel free to open an issue or contact personally if you have questions, need help, or need explanations. Don't hesitate to write an email to the following email address: [email protected]

issa's People

Contributors

yumengli007 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

issa's Issues

Completely don't know how to use

how to train and inference?Nearly no instructions?
And how to prepare the pretrained model of GAN generator(the .pkl file)?

Request to upload modified "networks_stylegan2.py" code

There is a long list of complicated instruction of how to copy and edit code from other repositories before being able to use this one.
For the sake of self containment, is it possible to upload your edits of all necessary files to serve as a single working example?

I believe I'm speaking of behalf of all potential users when I say that although we all appreciate the long list of detailed instructions in how-to.pdf, for 99.9% of potential users it will be much simpler to just upload your own edits of the original stylegan2 code and have all the necessary code in this single repository to use the basic functionality demonstrated in your paper.
This way, the 99.9% of users will use your code, and the remaining 0.1% of users will use the instructions for their own custom needs.

模型推理

请问,可以直接用weights下的权重直接推理吗?推理的命令是什么呢?
config文件里有很多个文件路径,分别应该填写什么地址呢?
十分感谢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.