Giter VIP home page Giter VIP logo

nasvit's Introduction

Summary

This repo is the official implementation of our ICLR2022 paper "NASVIT". It currently includes the training/ eval code and a pretrained supernet checkpoint on ImageNet.

Training

python -m torch.distributed.launch --nproc_per_node=1 --master_port=1024 main.py --cfg configs/cfg.yaml --amp-opt-level O0 --accumulation-steps 1 --batch-size 64

Search

Evolutionary search is done on a subsampled data set. Specifically, we randomly select five images for each category from the original ImageNet training set and treat them as our validation set.

checkpoint

Download

ImageNet Accuracy (val)

Model Accuracy top-1 Accuracy top-5
Smallest 78.34 93.46
Largest 82.79 96.00

License

The majority of NASViT is licensed under CC-BY-NC, however portions of the project are available under separate license terms: pytorch-image-models (Timm) is licensed under the Apache 2.0 license; Swin-Transformer is licensed under the MIT license.

Contributing

We actively welcome your pull requests! Please see CONTRIBUTING and CODE_OF_CONDUCT for more info.

nasvit's People

Contributors

dilinwang820 avatar

Stargazers

Kevin Soong avatar AI_amateur avatar  avatar  avatar HAOJIE CHANG avatar  avatar shine avatar Zhenhan Huang avatar  avatar nafeng avatar Mouxiao Huang avatar Ren Tianhe avatar Zizheng Pan avatar Chakkrit Termritthikun avatar Yuhong Li avatar Joel Lee avatar Dayoung Kil avatar cc avatar Dixi Yao avatar hongyuan_yu avatar Sai Gopal Reddy Kovvuri avatar  avatar  avatar Muhammad Junaid Ali avatar  avatar SherwoodZheng avatar  avatar Xingchen Wan avatar wmkai avatar  avatar Xinyu Liu avatar Peyton avatar  avatar frick huang avatar JZhou avatar  avatar Fang Chen (谌放) avatar Jinnian Zhang avatar Abd Shomad avatar Zhixiong Yue avatar Rumen Dangovski avatar ChenFeynman avatar xuke avatar Hongyeob Kim avatar Derrick avatar Mike avatar yongjie.chen avatar Phan Hoang avatar MagicSource avatar Bklat avatar Peng avatar  avatar guoht avatar Roger GOU avatar Rui Huang avatar  avatar 爱可可-爱生活 avatar  avatar Kento Nozawa avatar  avatar Swall0w avatar Aryan Shekarlaban avatar QiQi avatar Christoph Reich avatar  avatar

Watchers

Mike avatar  avatar Cami Williams avatar  avatar Arun Sathiya avatar  avatar

nasvit's Issues

Question about amp

Hi ! Thanks for the excellent work! I am trying to use the Constraint_opt to train my model, I am curious about is Constraint_opt work well with the amp, shall I make any modification?

Is there dynamic network original VIT?

Hi, thanks for the great work! I want to make some changes to the supernet, but I found the dynamic network is not for an original VIT. So I wonder whether there is a code for original VIT.

Implementation of gradient projection

After skimming through the code, I cannot find the relevant part of gradient projection as Eq. 1 in the paper.

If it's my carelessness, could you please help me figure it out?

Provide documentation for code

Hello,
I have been following the work related to using NAS for Vision Transformers. I am reading your approach from the NASViT paper, it would be very helpful if you could provide a documentation for your module and add descriptive comments in the code.

Release of trained checkpoint

Hi,
Thank you for your work!
I am trying to reproduce some of the results mentioned in the paper, could you please share the NASVIT (A0-A5) checkpoints as well?

Search space design

I wonder if the the authors had any experiments conducted using this work on different search spaces apart from the NASViT, such as ResNet or others.

Could you please clarify this?

Thank you

Porblems with calculating flops

Hi, when I want to use the function

compute_active_subnet_flops() 

in the model class

attentive_nas_dynamic_model.py

It will output an error said that NoneType has no active_expand_ratio. I guess the problem is this is desgined for mb and I did not find the logic of logging flops of transformer block.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.