rui-ye / openfedllm Goto Github PK

License: Apache License 2.0

Python 98.53% Shell 1.47%

openfedllm's Issues

Results of training on Alpaca-GPT4 and testing on MT-Bench

Hello, directly run your code, the configuration has not changed, the test steps are also according to your evaluation/open_ended process, but gen_judge_mtbench.py results are all ERROR. What's the problem? Looking forward to your reply.

Run the following command:

sh training_scripts/run_sft.sh
python utils/merge_lora.py --base_model_path "meta-llama/Llama-2-7b-hf" --lora_path "output/alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822/checkpoint-200"
CUDA_VISIBLE_DEVICES=2 python gen_model_answer_mt.py --base_model_path "../../output/alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822/full-200" --template "alpaca"
python gen_judge_mtbench.py --judge_model gpt-4-1106-preview --model_list alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822_200
python show_results_mt.py --model_list alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822_200 --judge_model gpt-4-1106-preview

Here are the results of running on the A40:
model_answer.zip
model_judgment.zip

Does the framework support multi-gpu training?

Thanks for your brilliant work. I would like to do SFT with multiple GPUs. Does your framework support this feature by design or I need to make some modifications?

Run myself Data

Thx for ur good work. Could u pls tell me how can we run the local dataset? If the prompt in my dataset is same with the vicgalle/alpaca-gpt4, but it's not the hugging face format, it's the jason format. How can we run the code?

Evaluation on other datasets?

Hello and thank you for your great work. In addition to MT-Bench, Vicuna benchmark, and Advbench, do you have test details or codes for the other datasets?

How can i perform local training using the OpenFedLLM?

I want to perform the local training without any federated learning operations in the same settings. So how can I modify the parameters in the off-the-shelf code framework?

Continue training according to checkpoint

I have generated several checkpoint model folders after 200 rounds of training, can I continue training with this model next time? If so, how to use the trained model parameters.

Request for Close-ended evaluation scripts

Congratulations on your excellent work！
Currently, the evaluation scripts only include open-ended scripts. Could you also release the closed-ended and other test dataset evaluation scripts? Due to the different settings (such as few-shot numbers and templates) used for different datasets, the results can vary significantly. To facilitate better comparisons and support the community's ongoing work, it would be beneficial to make this part of the code open-source.

Need help with evaluation

Thanks to the authors for contributing OpenFedLLM to the open source community. Currently, I finished training the model using the default run_sft.sh script, and save to model to the output file, checkpoint-XX. When I try to use the evaluation script <gen_model_answer_mt.py> and loading the checkpoint path, I'm not able to run due to some config error
(Can't load the configuration of <file_path>, If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure <file_path> is the correct path to a directory containing a config.json file).

Any help or advice?

train_loss

Hello, is this train_loss so high?

Error in FedProx loss computation

The summation of L2 losses over named_parameters() does not equal the L2 loss of the entire model. Please check fed_local_dpo.py and fed_local_sft.py

rui-ye / openfedllm Goto Github PK

openfedllm's Issues

Results of training on Alpaca-GPT4 and testing on MT-Bench

Does the framework support multi-gpu training?

Run myself Data

Evaluation on other datasets?

How can i perform local training using the OpenFedLLM?

Continue training according to checkpoint

Request for Close-ended evaluation scripts

Need help with evaluation

train_loss

Error in FedProx loss computation

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent