nmndeep / revisiting-at Goto Github PK

Code for the paper "Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models"

Python 99.42% Shell 0.58%

adversarial-training convnext deep-learning imagenet vit robustness convstem

revisiting-at's Issues

loading a pretrained chekpoint

Hello,

First of all - thanks for the great work!
After downloading it from the link, I am trying to load a given checkpoint.
However, the load command fails -
torch.load('convnext_b_cvst_robust.pt', map_location='cpu')
and outputs the following error:
RuntimeError: Expected hasRecord("version") to be true, but got false.
More details - I use the same torch version as required, and I have tried several different checkpoints.
How can I resolve this?

Thanks!

RobustAcc for L∞=8/255 models.

Could authors publish the RobustAcc about L∞=8/255 imagenet models? I tried to test the robustAcc of the models. But they are very low.

Purpose of having no weight decay in BN parameters

https://github.com/nmndeep/revisiting-at/blob/932f73f248447addac17542a574ad7bc784c0cbd/main.py#L447-449

adversarial finetuning recipe for downstream datasets?

Can you please provide in more detail the training recipe used for adversarial finetuning on downstream datasets(cifar10, cifar1000, flowers)? What optimizer, augmentations are used? Is the adversarial training done similar to how it is done on ImageNet or TRADES framework is used?

Pretrained weights for models mentioned in Table 2

Can you please provide weights for other models mentioned in Table 2 such as ViT-S , ConvNeXt-T. ViT-B and ConvNeXt-B(from Table 3)?

Which model can handle 299*299 images on ImageNet-1K？

Thank you for sharing your code. Could you please recommend a pre-trained model suitable for processing 299*299 images on the ImageNet-1K dataset?

imagenet1k and imagenet21k pre-train

Hi,
thank you for your nice work, it's really enlightening. Especially, using clean pre-trained checkpoints on ImageNet-1K and -21K.

I found that you use:
elif modelname == 'vit_s':
model = create_model('vit_small_patch16_224', pretrained=pretrained)
and
elif modelname == 'vit_s_21k':
model = create_model('deit3_small_patch16_224_in21ft1k', pretrained=pretrained)
to load pre-trained checkpoints.

Do you mean "vit_small_patch16_224" for ImageNet-1K pretraining? But I print the model.default_cfg, it shows the model is timm/vit_small_patch16_224.augreg_in21k_ft_in1k. So it actually loads a checkpoint pre-trained on ImageNet-21K?

Different results between Table 1 and 2

Thank you for the great work. I am just wondering if you can explain the reason behind the difference between Table 1 and 2. For example, in Table 1, the ViT-S' performance is (60.3, 30.4), while in Table 2 its performance is (61.5, 31.8) where random init and basic augmentation are adopted. I think the difference is that model in Table 1 are pretrained with standard training for 100 epochs while Table 2 use rand init. But if that's the case, why Table 1 is worse than Table 2? Thank you very much!

nmndeep / revisiting-at Goto Github PK

revisiting-at's Issues

loading a pretrained chekpoint

RobustAcc for L∞=8/255 models.

Purpose of having no weight decay in BN parameters

adversarial finetuning recipe for downstream datasets?

Pretrained weights for models mentioned in Table 2

Which model can handle 299*299 images on ImageNet-1K？

imagenet1k and imagenet21k pre-train

Different results between Table 1 and 2

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent