Light

facebookresearch / nasvit Goto Github PK

View Code? Open in Web Editor NEW

65.0 6.0 3.0 51 KB

code for NASViT

License: Other

Python 100.00%

nasvit's Introduction

Summary

This repo is the official implementation of our ICLR2022 paper "NASVIT". It currently includes the training/ eval code and a pretrained supernet checkpoint on ImageNet.

Training

python -m torch.distributed.launch --nproc_per_node=1 --master_port=1024 main.py --cfg configs/cfg.yaml --amp-opt-level O0 --accumulation-steps 1 --batch-size 64

Search

Evolutionary search is done on a subsampled data set. Specifically, we randomly select five images for each category from the original ImageNet training set and treat them as our validation set.

checkpoint

ImageNet Accuracy (val)

Model	Accuracy top-1	Accuracy top-5
Smallest	78.34	93.46
Largest	82.79	96.00

License

The majority of NASViT is licensed under CC-BY-NC, however portions of the project are available under separate license terms: pytorch-image-models (Timm) is licensed under the Apache 2.0 license; Swin-Transformer is licensed under the MIT license.

Contributing

We actively welcome your pull requests! Please see CONTRIBUTING and CODE_OF_CONDUCT for more info.

nasvit's People

Contributors

Stargazers

Watchers

Forkers

pugangqiang zhanzheng8585 alindkhare

nasvit's Issues

Question about amp

Hi ! Thanks for the excellent work! I am trying to use the Constraint_opt to train my model, I am curious about is Constraint_opt work well with the amp, shall I make any modification?

Is there dynamic network original VIT?

Hi, thanks for the great work! I want to make some changes to the supernet, but I found the dynamic network is not for an original VIT. So I wonder whether there is a code for original VIT.

How do we test the model?

Hi author,
Looks only training code is released?

Implementation of gradient projection

After skimming through the code, I cannot find the relevant part of gradient projection as Eq. 1 in the paper.

If it's my carelessness, could you please help me figure it out?

Provide documentation for code

Hello,
I have been following the work related to using NAS for Vision Transformers. I am reading your approach from the NASViT paper, it would be very helpful if you could provide a documentation for your module and add descriptive comments in the code.

Release of trained checkpoint

Hi,
Thank you for your work!
I am trying to reproduce some of the results mentioned in the paper, could you please share the NASVIT (A0-A5) checkpoints as well?

experiment on detection task?

Hi, will NasVit add detection result?

Search space design

I wonder if the the authors had any experiments conducted using this work on different search spaces apart from the NASViT, such as ResNet or others.

Could you please clarify this?

Thank you

Porblems with calculating flops

Hi, when I want to use the function

compute_active_subnet_flops()

in the model class

attentive_nas_dynamic_model.py

It will output an error said that NoneType has no active_expand_ratio. I guess the problem is this is desgined for mb and I did not find the logic of logging flops of transformer block.

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.