Comments (5)
WORLD_SIZE=4 CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nproc_per_node=4 --master_port=3192 uniform_finetune.py
--data belle1m
--model_type bloom
--model_name_or_path bigscience/bloomz-7b1-mt
--lora_target_modules query_key_value
--per_gpu_train_batch_size 4
--learning_rate 3e-4
--epochs 1
from alpaca-cot.
what should I do
from alpaca-cot.
This may be caused by environmental issues, and we will quickly identify the cause and provide solutions.
from alpaca-cot.
We were unable to reproduce this bug. The code works correctly in A100 using the following command:
CUDA_VISIBLE_DEVICES=1,2,3,4 python3 -m torch.distributed.launch --nproc_per_node 4 --nnodes=1 \
uniform_finetune.py --model_type bloom --model_name_or_path bigscience/bloomz-7b1-mt \
--data alpaca --lora_target_modules query_key_value \
--per_gpu_train_batch_size 4 --learning_rate 3e-4 --epochs 1
or
CUDA_VISIBLE_DEVICES=0 python3 uniform_finetune.py \
--data alpaca \
--model_type bloom \
--model_name_or_path bigscience/bloomz-7b1-mt \
--lora_target_modules query_key_value \
--per_gpu_train_batch_size 4 \
--learning_rate 3e-4 \
--epochs 1
from alpaca-cot.
It seems that this may be caused by a low-level version of Python. Try using version 3.9 and above.
from alpaca-cot.
Related Issues (20)
- ChatGLM的Finetune推荐命令,使用3090 24G会OOM,代码默认使用8Bit量化同样会导致OOM HOT 1
- 请问如何修改模型的自我认知
- GPTeacher Code-Instruct HOT 1
- 8卡V100跑moss OOV HOT 2
- Prompt设置 HOT 1
- About the tokenizer
- The text meaning in zh_helpfulness_context.json in Alpaca-CoT / MOSS / moss-002-sft
- DataCollatorForLanguageModeling uses the unmasked labels
- web.py中缺少--size参数 HOT 1
- inference结果差异比较大,请问是什么原因 HOT 2
- 是否可以提供一个Gdrive和百度云的下载方式 HOT 2
- 是否可以支持qlora
- 你好,群二维码过期了 HOT 1
- About the source of the dataset
- What is the relationship between the data and the link you provided?
- 你好,能更新下群信息么 HOT 2
- Adding Contributors Section In readme.md
- 你好,群二维码过期了,能更新一下么~ HOT 6
- main分支下的readme顺序,以及base模型能否提供huggingface的链接 HOT 4
- 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from alpaca-cot.