ziplab / spt Goto Github PK

[ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.

License: MIT License

Shell 4.89% HTML 0.28% Ruby 0.08% CSS 0.20% JavaScript 2.20% Python 92.36%

parameter-efficient-fine-tuning peft adapter lora prompt-tuning transfer-learning

spt's People

Contributors

Stargazers

Watchers

Forkers

ther-nullptr tangent-h

spt's Issues

Does SPT support the selection of sensitive parameters in LLMS?

Does SPT support the selection of sensitive parameters in LLMS?
Very excited to see your work, I want to migrate your work to LLM,
How can I change the code? Can you give me some advice? Thank you very much.

ModuleNotFoundError: No module named 'model.swin_transformer'

Traceback (most recent call last):
File "train_spt.py", line 18, in
from model.vision_transformer_timm import VisionTransformerSepQKV
File "/home/lyc/TNTprojectz/KE/SPT/model/init.py", line 2, in
from .swin_transformer import *
ModuleNotFoundError: No module named 'model.swin_transformer'

Questions about the sensitivity function

Hello, thanks for providing the code.
I have some questions about calculating sensitivity, and I appreciate it if you could clarify them for me.

What values of alpha and beta should generally be used?
in your experience, how many batches should be processed for reliable estimation of sensitivity?
In L181 what do the values denote? Are they the number of total tunable parameters to select?
Could you explain how the sweep is performed in, and why the value of 80 is chosen in L189?
can you explain this condition in L282 in your code? When I run the code it only return results with for 1.0, 0.8 and 0.6, and for smaller values the condition does not satisfy apparently.
In L279, can you explain why param count is calculated in this way? What is the division by 1e6 performed?
In L191 and L196, why param_num is multiplied by 0.02 and 1e6 respectively?
When using LoRA, I assume the additional parameters will be merged into the original params after training is done. Is the code for that available?

Thank you in advance.

Question about the reproducing result on vtab-1k using ViT-B/16 pre-trained on ImageNet-21k.

Hi, great work! We are unable to achieve the results of the experiment described in the title of the paper. But we can reproduce the results of pre-training the model using other datasets. Can you give us some suggestions for experiments? The following are the experimental results under the parameter quantity of 0.4M. Looking forward to hearing from you.

70.82, 92.42, 71.54, 99.28, 87.22, 55.20, 91.22,Natural.
85.57, 95.96, 85.60, 74.31,Specialized.
81.83, 66.81, 49.36, 78.76, 79.02, 49.38, 27.73, 38.11,Structured.

The question for function of get_sensitivity

When I set args.get_sensitivity, I find no such .sh.
I need the .sh for get_sensitivity. Can you provide it?
Thanks very much.

ziplab / spt Goto Github PK

spt's People

Contributors

Stargazers

Watchers

Forkers

spt's Issues

Does SPT support the selection of sensitive parameters in LLMS?

ModuleNotFoundError: No module named 'model.swin_transformer'

Questions about the sensitivity function

Question about the reproducing result on vtab-1k using ViT-B/16 pre-trained on ImageNet-21k.

The question for function of get_sensitivity

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent