imankgoyal / nondeepnetworks Goto Github PK

Official Code for "Non-deep Networks"

License: BSD 3-Clause "New" or "Revised" License

Shell 1.66% Python 98.34%

nondeepnetworks's Introduction

Non-deep Networks
NeurIPS 2022
Ankit Goyal, Alexey Bochkovskiy, Jia Deng, Vladlen Koltun

Overview: Depth is the hallmark of DNNs. But more depth means more sequential computation and higher latency. This begs the question -- is it possible to build high-performing ``non-deep" neural networks? We show that it is. We show, for the first time, that a network with a depth of just 12 can achieve top-1 accuracy over 80% on ImageNet, 96% on CIFAR10, and 81% on CIFAR100. We also show that a network with a low-depth (12) backbone can achieve an AP of 48% on MS-COCO.

If you find our work useful, please consider citing it:

@article{goyal2021nondeep,
  title={Non-deep Networks},
  author={Goyal, Ankit and Bochkovskiy, Alexey and Deng, Jia and Koltun, Vladlen},
  journal={arXiv:2110.07641},
  year={2021}
}

nondeepnetworks's People

Contributors

Stargazers

Watchers

nondeepnetworks's Issues

When will the code be released?

I am very interested in your work and would like to further study. I hope you can release the code as soon as possible in your busy schedule. Thank you！

question

Can you provide some cfg files and annotation information? The annotation information and images always don't match!

Problem about replacing the backbone of YOLOv4 with ParNet.

Which three layers' outputs for neck fusion when the three stream is parallel.
Those three ones?
Thanks a lot！

Can the network be used in change detection tasks

could you provide finetune model?

I want to finetune the model for other classification task,could you provide a finetune model?

fusion module, accuracy about cifar100

what is your shuffle code in your fusion module?
what is your model architecture in cifar-100? I just changed front two downsample modules based on the ParNet for Imagenet in the paper. But the accuracy is lower. And How do you set the LR, MILESTONES and NUM_EPOCH to meet high accuracy?

How model parallelize across GPUs?

Could you introduce more details in parallelizing across GPUs, like how to implement through PyTorch.

When will the code be released? Thanks.

Could you please provide the code for the CIFAR 10/100 section?

关于代码版本的问题（或者说是开发环境）

您好，非常有幸阅读了贵团队的论文，感谢作者们作出的贡献，这篇文章对我有很大的启发，就是不知道会不会发布pytorch版本的？

Will I be able to see the code at the end of 2021？

Will I be able to see the code at the end of 2021？
I am very interested in your work, I have waited from October until now, hoping to open source，^_^

what is the meaning of 'Shuffle' of fusion block in Fig. A1?

Hello. Thank you for your great study. I wonder the meaning of 'Shuffle' of fusion block in Fig. A1.
Is it pixel shuffle layer?
Please let me know the meaning of that.

Thank you.

Really faster than ResNet? I am very confused

Hello, my friend, appreciate for your great work! I have tested the code on https://github.com/Pritam-N/ParNet by Pritam-N and change the ResNet code in my model by using your ParNet , but the actual time is quite slow than the paper said. My block size is [64, 128, 256, 512, 2048], and the time of "forward()" is more than 5s average while the Resnet is 0.02s in my device. I have use the time function for every line in the forward(), find that the encode stuff is the main reason. I continue write time.perf_counter() in the encode stuff, find that the "self.stream2_fusion" and "self.stream3_fusion" is the most time user. Do you know why ?

when will the code of the model be released?

I am very interested in your research, when will the code of the model be released? I saw on October 23rd that you said it would be released in 4 weeks

How's speed comparasion on batch size 64 or input resolution up to 800w input?

when will the code be released? thx~

Question about SSE module

Hi. Figure 2b shows that there's one 1x1conv in a branch of SSE, how to match the channel of output by 1x1conv with the channel of input after shortcut? If I set the output channel of 1x1conv the same as input, the channels of the outputs by RepVGG block and SSE will not match.

imankgoyal / nondeepnetworks Goto Github PK

nondeepnetworks's Introduction

nondeepnetworks's People

Contributors

Stargazers

Watchers

Forkers

nondeepnetworks's Issues

Recommend Projects

Recommend Topics

Recommend Org