Giter VIP home page Giter VIP logo

openfedllm's Issues

Results of training on Alpaca-GPT4 and testing on MT-Bench

Hello, directly run your code, the configuration has not changed, the test steps are also according to your evaluation/open_ended process, but gen_judge_mtbench.py results are all ERROR. What's the problem? Looking forward to your reply.

Run the following command:

  1. sh training_scripts/run_sft.sh
  2. python utils/merge_lora.py --base_model_path "meta-llama/Llama-2-7b-hf" --lora_path "output/alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822/checkpoint-200"
  3. CUDA_VISIBLE_DEVICES=2 python gen_model_answer_mt.py --base_model_path "../../output/alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822/full-200" --template "alpaca"
  4. python gen_judge_mtbench.py --judge_model gpt-4-1106-preview --model_list alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822_200
  5. python show_results_mt.py --model_list alpaca-gpt4_20000_fedavg_c20s2_i10_b16a1_l512_r32a64_20240313104822_200 --judge_model gpt-4-1106-preview

Here are the results of running on the A40:
model_answer.zip
model_judgment.zip

Run myself Data

Thx for ur good work. Could u pls tell me how can we run the local dataset? If the prompt in my dataset is same with the vicgalle/alpaca-gpt4, but it's not the hugging face format, it's the jason format. How can we run the code?

Evaluation on other datasets?

Hello and thank you for your great work. In addition to MT-Bench, Vicuna benchmark, and Advbench, do you have test details or codes for the other datasets?

Continue training according to checkpoint

I have generated several checkpoint model folders after 200 rounds of training, can I continue training with this model next time? If so, how to use the trained model parameters.

Request for Close-ended evaluation scripts

Congratulations on your excellent work!
Currently, the evaluation scripts only include open-ended scripts. Could you also release the closed-ended and other test dataset evaluation scripts? Due to the different settings (such as few-shot numbers and templates) used for different datasets, the results can vary significantly. To facilitate better comparisons and support the community's ongoing work, it would be beneficial to make this part of the code open-source.

Need help with evaluation

Thanks to the authors for contributing OpenFedLLM to the open source community. Currently, I finished training the model using the default run_sft.sh script, and save to model to the output file, checkpoint-XX. When I try to use the evaluation script <gen_model_answer_mt.py> and loading the checkpoint path, I'm not able to run due to some config error
(Can't load the configuration of <file_path>, If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure <file_path> is the correct path to a directory containing a config.json file).

Any help or advice?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.