Comments (4)
Hi,
Thanks for your interest in our work!
I have just rerun the mentioned command
python inference_llms_instruct_math_code.py --dataset_name gsm8k --finetuned_model_name WizardMath-7B-V1.0 --tensor_parallel_size 1 --weight_mask_rate 0.9 --use_weight_rescale
and it works well for me. I got an accuracy of 50.42.
To identify the issues, could you please run
python inference_llms_instruct_math_code.py --dataset_name gsm8k --finetuned_model_name WizardMath-7B-V1.0 --tensor_parallel_size 1 --weight_mask_rate 0.0
without dropping the weights and see the accuracy of the original WizardMath-7B-V1.0 model? I got 55.34 accuracy and you can compare with this result to ensure your inference process is right.
from mergelm.
Hi,
Thanks for your interest in our work!
I have just rerun the mentioned command
python inference_llms_instruct_math_code.py --dataset_name gsm8k --finetuned_model_name WizardMath-7B-V1.0 --tensor_parallel_size 1 --weight_mask_rate 0.9 --use_weight_rescale
and it works well for me. I got an accuracy of 50.42.To identify the issues, could you please run
python inference_llms_instruct_math_code.py --dataset_name gsm8k --finetuned_model_name WizardMath-7B-V1.0 --tensor_parallel_size 1 --weight_mask_rate 0.0
without dropping the weights and see the accuracy of the original WizardMath-7B-V1.0 model? I got 55.34 accuracy and you can compare with this result to ensure your inference process is right.
Thanks for you help! I haved ran the --weight_mask_rate 0.0
and get acc=0.5534495830174374
. However, I just cannot make --weight_mask_rate 0.9
right, whether with rescale or not.
from mergelm.
Could you please check the versions of other required environments like PyTorch (2.0.1) and transformers (4.33.1)? The mentioned problem is a bit strange as --weight_mask_rate 0.9
works for me.
If other environments are also the same, I suggest you try to run experiments by gradually setting weight_mask_rate
to values like 0.1, 0.4, 0.7, and 0.9. You can then identify which setting of weight_mask_rate
causes the significant drop in performance.
Please feel free to ask when you finish running the above experiments.
from mergelm.
Close this issue now.
Please feel free to reopen it when there are any further questions.
from mergelm.
Related Issues (20)
- 如何对齐论文中 LM&Math&Code融合的指标 HOT 4
- 模型支持 HOT 2
- Couldn't find a dataset script at /home/dell7960/PycharmProjects/DARE/MergeLM/glue/glue.py or any data file in the same directory. HOT 1
- ValueError: BuilderConfig 'rte' not found. Available: ['default'] HOT 2
- AssertionError: cannot find file trainer_state.json! HOT 10
- Questions about randomly set delta parameters==zero HOT 2
- Seeking mirrors of WizardLM models HOT 3
- Script to reproduce all experiment in paper HOT 1
- 如何合并多个模型? HOT 19
- 是否可以Drop 100%??? HOT 6
- Are the classification heads merged? HOT 1
- Any solution to merge 3B models? HOT 1
- Alpaca_eval evaluation error HOT 2
- alpaca eval 评测问题 HOT 6
- python enviornment HOT 2
- Why same values of evaluation metrics when run>=1 HOT 2
- about Table1 HOT 3
- When getting Figure3, and 4, you apply DARE on all layer seperately? HOT 3
- Question about experimental settings
- Environment issues HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mergelm.