sclbd / backdoorbench Goto Github PK

View Code? Open in Web Editor NEW

382.0 382.0 60.0 75.52 MB

License: Other

Python 11.45% Shell 0.02% C++ 0.04% Cuda 0.09% Jupyter Notebook 88.40%

backdoorbench's People

Contributors

Stargazers

Watchers

backdoorbench's Issues

Ft_Trojan paper link

Great work, but I can't find the link of the paper of Ft_Trojan. Could you please provide it

Poison Index Selection - Clean label case

Hi BackdooorBench Team,

I think I have found a small issue with the backdoor poison index selection method. In the case where a clean label attack is used, the index selection method errors in some cases. I believe the issue is caused by the pratio not being multiplied by the correct length operator. The issue is on line 59 of "backdoor_generate_poison_index.py." I have provided a code snippet below of the issue and how I think it should be fixed. Notice that pratio should be multiplied by len(np.where(targets == tlabel)[0]) rather then len(targets).

non_zero_array = np.random.choice(np.where(targets == tlabel)[0], round(pratio * len(targets)), replace = False)
non_zero_array = np.random.choice(np.where(targets == tlabel)[0], round(pratio * len(np.where(targets == tlabel)[0])), replace = False)

Results for CLP - Channel Lipschitzness based Pruning

I see this repo has just added a new defense method CLP - Channel Lipschitzness-based Pruning.

Can you update the Leaderboard to include the results of the CLP and README file to update your API?

Thank you for such comprehensive work.

A data loading problem of LC attack

V2.0版本中Label Consistent的数据加载存在一些问题。
在LS攻击的毒化数据生成代码craft_adv_dataset.py中，图像和标签被统一保存了下来，并以npz格式存储，然而在Label Consistent中的89行和95行默认读取的文件格式为npy。

craft_adv_dataset.py中将图像和标签统一保存为npz格式：

BackdoorBench/resource/label-consistent/craft_adv_dataset.py

Lines 775 to 778 in 9df0c6c

 adv_data_path = os.path.join( 

 config["adv_dataset_dir"], f"{args.dataset}_test.npz" 

 ) 

 np.savez(adv_data_path, data=perturbed_img.numpy(), targets=target.numpy())

Label Consistent.py中的默认读取格式为npy：

BackdoorBench/attack/lc.py

Lines 89 to 92 in 9df0c6c

 if ('attack_train_replace_imgs_path' not in args.__dict__) or (args.attack_train_replace_imgs_path is None): 

 args.attack_train_replace_imgs_path = f"../resource/label-consistent/data/adv_dataset/{args.dataset}_train.npy" 

 logging.info( 

 f"args.attack_train_replace_imgs_path does not found, so = {args.attack_train_replace_imgs_path}")

Hyperparameter --lr_finetuning_init and --lr_unlearning_init seems to be not used in ./defense/abl/abl.py

Thank you for your excellent work. I found that the hyperparameters --lr_finetuning_init and --lr_unlearning_init seem to be not used in the ./defense/abl/abl.py. Is that a bug?

v2.0 run scripts

Thanks so much for the v2.0 update - it's a big quality of life improvement!

Could you update the readme to include scripts for running an attack and defense with v2.0 (or update saying that the same script as v1 work).

Edit: nevermind! I just got confused by the formatting!

How to implement 'all-to-all' attack settings?

Many thanks for your excellent work! 😁

Now I have a question. When I used the following setting in Blended Attack.

attack_label_trans: all2all
attack_label_shift_amount: 1
num_classses: 10

But I ran into the following error.

Traceback (most recent call last):
  File "./attack/laser_attack.py", line 347, in <module>
    main()
  File "./attack/laser_attack.py", line 212, in main
    adv_train_ds = prepro_cls_DatasetBD(
  File "./backdoorbench/attack/../utils/bd_dataset.py", line 63, in __init__
    assert len(poison_idx) == len(full_dataset_without_transform)
AssertionError

Did I do something wrong? Looking forward to your reply.

A resource of `lc_atyack` is missing

Hi,
It seems that a resource of lc_atyack is missing:
FileNotFoundError: [Errno 2] No such file or directory: '../resource/label-consistent/data/adv_dataset/cifar10_train.npy
Did I forget to download it?
Thanks!

How to use this library in an incremental manner?

Assuming I have developed some new features on top of the library, how do I merge them into the existing code when BackdoorBench is updated?

Model Zoo and Onedrive link cannot be accessed

Dear authors,

Thank you very much for providing the complete attack and defense baselines, it is a great work.

You mentioned in README.md: We recommend you to download the data and model from model zoo of our website , but we also provide the whole onedrive link at Backdoor Model as an alternative source for your convenience.

However, I tried both links and they didn't work. model zoo has been unresponsive and Onedrive cannot be opened.

Therefore, I would like to ask you to update the latest accessible links.

Looking forward to your reply.

Thanks.

How can i use the Vit model.pt ?

Hello, thank you for the thorough summary you provided.

I encountered some problems. I imported it in torch-vision.model— and conducted color backdoor attack experiments. Therefore, I wanted to evaluate the defense after the attack, but I found that the contents of my weight file differed from those of Git Hub. The format of attack_result.pt is different, so it cannot be defended directly.
For example, the following problem report.

How can I solve the problem? Thanks

About Missing Clean Model for TrojanNN Attack Implementation

Hi BackdoorBench Team,

Firstly, I want to express my admiration for your project.Thank you very much for putting in the effort to design and make this !

But I encountered while attempting to implement the TrojanNN attack method based on your code. It appears that a 'clean_model.pth' is required within the 'resource' directory for the operation to proceed successfully. However, after a thorough review of both the project documentation and the README file, I couldn't find any mention or inclusion of this essential file. I'm concerned that I might have overlooked something or perhaps the file is indeed missing from the repository.

Could you please shed some light on this matter? Any guidance you can offer or provision of the missing file would be greatly appreciated and would significantly aid in my understanding and application of your work.

Thank you in advance for your time and assistance !

Question for the implementation of Neural cleanse(NC)

On page 16 of the BackdoorBench NeurIPS2022 paper, I noticed that NC is introduced as a method that first detects the backdoored model and then mitigates it through pruning based on the searched trigger. However, for the code in this repo, I find that you perform unlearning instead of pruning. Is that a common operation for the implementation of NC?

Cannot generate poison data in SSBA algorithm

I really appreciate your job.
Excuse me. I cannot generate poison data in ssba.py("--attack_train_replace_imgs_path" and "--attack_test_replace_imgs_path" should receive two path for poisoned training data and poisoned testing data, respectively)
We've already installed the packages in requirement.txt file.
We generated the toxic data based on the command in the readme, but the execution reported an error at step 3.

I want to ask how should I solve this problem. Thanks in advance!

Results for Tiny Imagenet

Thanks for your great work! I notice that you mentioned a public leaderboard for evaluating backdoor attacks against different defense methods. However, I can't find the results for tiny imagenet. Will you please update the results for tiny imagenet? Much appreciated!

您好，很敬佩该工作对于后门攻击的总结。想请问下为什么 v1 里的 ft_attack 频域后门攻击在v2 里被移除了呢

missing resource

hi there, i notice that attacks like lc and ssba need files in ./reource, could you please upload the resource folder with google drive or baidu disk? thank you!

the data of leaderboard(asr of blended to nad)

Thank you for sharing your work with the community and for all that you do. Your project has made a difference in the lives of many people, and for that, I am truly grateful.
However, I have some concerns regarding the accuracy of the results displayed on the leaderboard. Specifically, I noticed that the success rate of the traditional attack method blended against defense methods such as nad/nc appears to be too high. After attempting to replicate the nad defense against blended attacks, I found that the results were not as good as those reported in the original paper but also not as bad as those presented on the leaderboard.
Can you review the accuracy of the results on the leaderboard to ensure they are reflective of the true performance of the defense methods?
Thank you for your attention to this matter.

Bugs in WaNet All2All Attack

Hi,

I think in the WaNet Attack All2All attack case, the code to calculate targets_bd is wrong.

BackdoorBench/attack/wanet.py

Lines 301 to 302 in 408db06

 if args.attack_label_trans == "all2all": 

 targets_bd = torch.remainder(targets, args.num_classes)

Reference code from WaNet's GitHub.

if opt.attack_mode == "all2all":
    targets_bd = torch.remainder(targets[:num_bd] + 1, opt.num_classes)

https://github.com/VinAIResearch/Warping-based_Backdoor_Attack-release/blob/94453080f241053ac7c8cc4717da20806ee17e5c/train.py#L82-L83

Backdoor OneDrive Access Unavailable

Hi, I'm trying to use the BackdoorBench folder of backdoored models to benchmark my defense mechanism for publication, but I cannot access the OneDrive. I get a message saying the link has expired and I do not have access.

can not generate trigger using low frequency attack for resnet18

python generate_pattern.py --dataset cifar10 --model resnet18 --clean_model_path ../../resource/clean_model/cifar10_resnet18/clean_model.pth --save_folder_name lf
When I use this python code，it will response the error ModuleNotFoundError: No module named 'resnet18'

CTRL

RuntimeError: DataLoader worker (pid(s) 35464) exited unexpectedly

Hello, thank you for the thorough summary you provided. I encountered a problem while executing a command; when i executing

python ./attack/badnet.py --yaml_path ../config/attack/prototype/cifar10.yaml --patch_mask_path ../resource/badnet/trigger_image.png

there might be an infinite loop in the getattr method within bd_dataset_v2.py. I searched online for a solution and found that reducing the number of num_workers could help. I have already adjusted it to 1, but the problem persists. Could you suggest any other possible solutions?
Thanks!

python ./attack/badnet.py --yaml_path ../config/attack/prototype/cifar10.yaml --patch_mask_path ../resource/badnet/trigger_image.png

bpp attack source

Thank you for your great work!
I would like to know why the file "cifar10_preactresnet18_bpp_0_1.zip" in OneDrive, which you are offering, contains too much backdoor train data (>num_total_train_datapoisoning_ratio), and what is the role of cross_test_data folder(the amount of cross_test_data equals to num_total_train_datapoisoning_ratio). Which data should I use to train the poisoned model?

fine-pruning is missing

there is no fp in your defense folder.

Errors when running tiny-imagent

I really appreciate your job.
I have encountered the error when running all attacks on tiny-imagenet.

INFO:root:Reminder: plz note that if p_num or pratio exceed the number of possible candidate samples
 then only maximum number of samples will be applied
2022-10-24:21:36:35 [INFO    ] [backdoor_generate_pindex.py:35] Reminder: plz note that if p_num or pratio exceed the number of possible candidate samples
 then only maximum number of samples will be applied
INFO:root:Reminder: priority p_num > pratio, and choosing fix number of sample is prefered if possible 
2022-10-24:21:36:35 [INFO    ] [backdoor_generate_pindex.py:36] Reminder: priority p_num > pratio, and choosing fix number of sample is prefered if possible 
INFO:root:poison num:0.0,real pratio:0.0
2022-10-24:21:36:35 [INFO    ] [backdoor_generate_pindex.py:61] poison num:0.0,real pratio:0.0
No poison sample generated !

I want to ask how should I solve this problem. The current pratio is 0.1, Do I need to drease it to 0.01?
Thanks in advance!

Attack my own datasets made with videos.

I want to attack my own datasets filled with videos using badnets or others, so can you tell me how to run the codes or modify the corresponding codes?

How should we run a label-consistent attack?

My command is python ./attack/lc_attack.py --yaml_path ../config/attack/lc/cifar10.yaml --dataset cifar10 --dataset_path --save_folder_name xxx --model resnet18
and the result here is

And I checked your git, it does not has that file either... So how should we train the LC attack?

Thank you!

Target Class of Label-Consistent in the Google Drive

Thank you for your excellent work! I would like to know the target class of Label-Consistent on Tiny ImageNet in Google Drive adv_img folder that you are offering.

Is there an error in the calculation of outlier score in spectral.py

According to the algorithm description in original paper，the outlier score should be calculated as follows:

However,you just calculate it by $\mathcal{R} \left( x_{\mathrm{i}} \right) $ in your code:

Imagenet dataset training problem

Hello, thanks for your excellent work. But I meet a question when I train the Imagenet dataset. It was killed during the pre-process bd dataset creation. I think it may be caused by RAM limitation. Can you help me to solve it?

Errors when we execute shell script

Hi！Excuse me. When we execute shell script 'scp_data.sh', we can't download four datasets.Because we can't connect this server 10.20.12.241:22.Please watch this picture.Thank you!

Implementation issue with ANP

Hi BackdoorBench Team,

Firstly, this tool is amazing. Thank you very much for putting in the effort to design and make this.

I have found an issue with the implementation of the ANP method. In both the evaluate_by_threshold and evaluate_by_number methods, the test ASR value and ACC accuracy difference are used to determine what threshold value is used. However, the test ASR value would not be accessible to the defender when performing pruning as the paper does not assume that backdoor data is available. Could you please clarify if this is an issue or if I am missing something.

Thanks :)

low test acc in the lira attack experimental

when i run the lira attack experimental with the command python ./attack/lira.py --save_folder_name lira,i got the result like this

i don't know why the test_acc will be so low,and i run it with default pattern, can you tell some Possible causes about it. Thanks!
and another question about the code lira .py is that i'm not clear understanding the Autoencoder's work，Does it act as a noise canceller?because i found that it doesn't have a loss and backward function

Could you please provide a download link to the cifar10_ssba_train_b1.npy for running issba attack?

FileNotFoundError: [Errno 2] No such file or directory: '../resource/ssba/cifar10_ssba_train_b1.npy'

LC attack source

Very nice work. May I confirm that the LC attack is Label-Consistent Backdoor Attacks as shown in this project. In the paper, the LC attack cites the paper Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks.

function definition missing in the T-SNE visualization

Hi!
There may lack a definition of the get_features, i.e.,

features, labels = get_features(args, model_attack, target_layer, data_loader)
NameError: name 'get_features' is not defined

Could you provide some suggestions on this issue?

Thanks a lot.

How to generate trigger by myself for the lf attack?

Hi, SCLBD! Thank you for your excellent work! I have a question for the low-frequency attack: How to generate the trigger by myself for the model of a new architecture?

can not generate trigger for low frequency attack

Thank you for sharing your work! It's really helpful.
However (excuse me), I can't generate trigger for low frequency attack with resources/lowFrequency/generate_pattern.py.
There's no errors in execution, but the fooling rate in Universal Perturbation algorithm keeps 0 in 50 epochs （it just doesn't change）, which means the perturbation is not optimized and the agorithm is not working.
The picture below shows the fooling rate in epoch 12.

Here's some information:

I've already installed the packages in requirement.txt file
I tried to generate trigger on cifar10 dataset, with several clean models you provided in Google drive, but none of them works.
I used default.yaml and cifar10.yaml (in resources/lowFrequency) to test, only edit necessary parameters(model path or type, for example), left most parameter unchanged.

I've removed "{iter}" in code below, for it brings errors in execution. The code is about saving the trigger image and I guess this won't influence the algorithnm. I didn't make other change except this.

BackdoorBench/resource/lowFrequency/generate_pattern.py

Lines 268 to 272 in 04137b7

 np.save(f'{save_path_prefix}_{iter}.npy', v) 

 v_lossy_image = np.clip(deepcopy(v) * 255 + 255 / 2, 0, 255).squeeze() # since v is [0,1] 

 Image.fromarray(v_lossy_image.astype(np.uint8)).save(f'{save_path_prefix}_{iter}_lossy.jpg')

Your work is the only pytorch-based implementation for lf attack that I can find. The triggers you provided is useful and I'm grateful, but I still want to generate my own triggers.
I want to ask how can I solve this problem. Thanks in advance!

concern about backdoor_generate_pindex.py

First, think about this benchmark, which helps me to reproduce baseline experiment results quickly. However, I have a few concerns about the code in the backdoor_generate_pindex.py. Specifically, we can see that in https://github.com/SCLBD/BackdoorBench/blob/main/utils/backdoor_generate_pindex.py#L59,

non_zero_array = np.random.choice(np.where(targets == tlabel)[0], round(pratio * len(targets)), replace = False)

Seems like the non_zero_array is selected from the label, maybe there should use targets!=tlabels.

	adv_data_path = os.path.join(
	config["adv_dataset_dir"], f"{args.dataset}_test.npz"
	)
	np.savez(adv_data_path, data=perturbed_img.numpy(), targets=target.numpy())

	if ('attack_train_replace_imgs_path' not in args.__dict__) or (args.attack_train_replace_imgs_path is None):
	args.attack_train_replace_imgs_path = f"../resource/label-consistent/data/adv_dataset/{args.dataset}_train.npy"
	logging.info(
	f"args.attack_train_replace_imgs_path does not found, so = {args.attack_train_replace_imgs_path}")

	if args.attack_label_trans == "all2all":
	targets_bd = torch.remainder(targets, args.num_classes)

	np.save(f'{save_path_prefix}_{iter}.npy', v)

	v_lossy_image = np.clip(deepcopy(v) * 255 + 255 / 2, 0, 255).squeeze() # since v is [0,1]

	Image.fromarray(v_lossy_image.astype(np.uint8)).save(f'{save_path_prefix}_{iter}_lossy.jpg')

sclbd / backdoorbench Goto Github PK

backdoorbench's People

Contributors

Stargazers

Watchers

Forkers

backdoorbench's Issues

python ./attack/badnet.py --yaml_path ../config/attack/prototype/cifar10.yaml --patch_mask_path ../resource/badnet/trigger_image.png

Recommend Projects

Recommend Topics

Recommend Org