Giter VIP home page Giter VIP logo

bigroc's Introduction


BIGRoC: Boosting Image Generation via a Robust Classifier

Roy Ganz โ€ข Michael Elad

This repository contains code for the paper "BIGRoC: Boosting Image Generation via a Robust Classifier"

BIGRoC

BIGRoC: Boosting Image Generation via a Robust Classifier
Roy Ganz, Michael Elad

Abstract: The interest of the machine learning community in image synthesis has grown significantly in recent years, with the introduction of a wide range of deep generative models and means for training them. In this work, we propose a general model-agnostic technique for improving the image quality and the distribution fidelity of generated images, obtained by any generative model. Our method, termed BIGRoC (Boosting Image Generation via a Robust Classifier), is based on a post-processing procedure via the guidance of a given robust classifier and without a need for additional training of the generative model. Given a synthesized image, we propose to update it through projected gradient steps over the robust classifier, in an attempt to refine its recognition. We demonstrate this post-processing algorithm on various image synthesis methods and show a significant improvement of the generated images, both quantitatively and qualitatively, on CIFAR-10 and ImageNet. Specifically, BIGRoC improves the image synthesis state of the art on ImageNet 128x128 by 14.81%, attaining an FID score of 2.53 and on 256x256 by 7.87%, achieving an FID of 3.63.

Citation

Ganz, Roy, and Michael Elad. "BIGRoC: Boosting Image Generation via a Robust Classifier." arXiv preprint arXiv:2108.03702 (2021).

@article{DBLP:journals/corr/abs-2108-03702,
author    = {Roy Ganz and Michael Elad},
title     = {BIGRoC: Boosting Image Generation via a Robust Classifier},
journal   = {CoRR},
year      = {2021}
}

Preprint on ArXiv: 2108.03702

Prerequisites

The entire code is contained in Colab jupiter notebook to facilitate the environment installation. Each such notebook is self-contained with the relevant package installation and explanations.

Repository Organization

File name Content
/CIFAR10 Notebooks for Section 5.1 - experimenting BIGRoC on CIFAR-10 image generators, both conditional and unconditional
/ImageNet Notebooks for Section 5.2 - experimenting BIGRoC on ImageNet 128x128 & 256x256 image generators, both conditional and unconditional
/Comparison Notebooks for Section 5.3 - experimenting BIGRoC on CIFAR-10 and ImageNet using SN-ResNetGAN

Credits

  • Robustness Package - Code.
  • FID is calculated natively in PyTorch using Seitzer implementation - Code
  • Mimicry - Code, Paper
  • Guided-Diffusion - Code, Paper

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.