Comments (4)
之前也是用过readme中的cann toolkit和kernel 报一样的错误
from llama-factory.
推理速度正常吗?正常的话那就是在 npu 上面,cpu 会特别慢
from llama-factory.
推理速度正常吗?正常的话那就是在 npu 上面,cpu 会特别慢
不正常,一秒几个token吧,看htop用了两核cpu,npu把模型推进去了,显存有占用但是功率没有变化
from llama-factory.
一秒几个应该是正常速度
from llama-factory.
Related Issues (20)
- 预训练方式lora微调Qwen2 base模型,是否需要添加template HOT 1
- How to pre-train Llava1.5 from vicuna1.5? HOT 1
- 训练glm4报错:RuntimeError when using flash attention with 8-bit quantization,同样的参数训llama3则没问题 HOT 1
- 请问改工程可以用来glm4的增量预训练吗 HOT 1
- 请问支持 early stopping 吗?
- stop word of template of qwen HOT 1
- qlora微调Qwen2-57B。使用单卡A6000显存占用40G,使用双卡A6000则是两张卡各占40G显存,请问是什么原因?
- Memory Error during tokenization while fine tuning LLava1.5-7B-Chat more than 8000 images HOT 1
- 如何指定已划分好的训练集和验证集? HOT 1
- LoRA微调和全参微调的时候总是会出现过拟合,在无法提高数据集大小的情况下,应该如何解决这个问题呢 HOT 2
- 8*A800 80G lora训练qwen2-72B模型 内存占用异常 HOT 2
- lora微调后的glm4模型不生成回答
- 最新代码中没有llamafactory-cli ,怎么合并权重 HOT 2
- docker容器内没有example和data文件
- 关于基座模型和对话模型的疑问
- PPO 跑example例子报错:value should be one of int, float, str, bool, or torch.Tensor HOT 1
- deepspeed zero3 出现 training_eval_loss 图为空白 HOT 1
- ## feature request ## 支持 ZeRO3 infinity HOT 1
- 对sft阶段的数据进行packing之后,同一条训练数据内的指令之间是否会相互影响?
- scripts/pissa_init.py to initialize PiSSA for a quantized model. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama-factory.