Comments (15)
您好,欢迎您的关注,请根据下面的步骤来解决:
- 请确保您 已经合并权重 得到
zhixi
且已经 下载LoRA权重 ; - 根据您的报错信息,请尝试使用
zhixi
和lora
的 绝对路径 来传入到命令行
如果上述都不能解决,请告知我 :)
from knowlm.
您好,您也可以检查下md5码
from knowlm.
好像是./zhixi下面权重的问题,我再尝试解决一下: )
from knowlm.
您好,欢迎您的关注,请根据下面的步骤来解决:
- 请确保您 已经合并权重 得到
zhixi
且已经 下载LoRA权重 ;- 根据您的报错信息,请尝试使用
zhixi
和lora
的 绝对路径 来传入到命令行如果上述都不能解决,请告知我 :)
好的,谢谢!已经定位到错误,合并权重的问题
from knowlm.
python tools/weight_diff.py recover --path_raw ./llama-13b-hf --path_diff ./zhixi-13b-diff --path_tuned ./zhixi
File "tools/weight_diff.py", line 26
path_raw: str, path_tuned: str, path_diff: str, device="cpu", # "cuda" or "cpu"
^
SyntaxError: invalid syntax
您好,我在合并权重的时候遇到了这样的问题,重新按照readme配置了环境,但是还是会出现如上情况
from knowlm.
您好,您是否修改过weight_diff.py
文件呢,我这边测试正常的,方便提供一下您的weight_diff.py
文件吗?
from knowlm.
您好,您是否修改过
weight_diff.py
文件呢,我这边测试正常的,方便提供一下您的weight_diff.py
文件吗?
您好,我没有修改过'weight_diff.py',报错后我从仓库中复制了一份代码,但是还是不行
weight_diff.txt
from knowlm.
是一开始执行的时候马上就报这个错误吗?可以把完整的错误分享一下吗,这个错误是说python语法问题,我这边一切正常。
from knowlm.
是一开始执行的时候马上就报这个错误吗?可以把完整的错误分享一下吗,这个错误是说python语法问题,我这边一切正常。
嗯嗯,是的,上面那个就是完整的错误 ,一开始执行的时候就报这个错误了,会不会是我python版本的问题,我的python版本是3.9.6
from knowlm.
你好,我重新配置了一下环境,使用python=3.9.6
进行,一切也都是正常的。
from knowlm.
您可以尝试一下把make_diff
函数删除,观察是否会报错
from knowlm.
您可以尝试一下把
make_diff
函数删除,观察是否会报错
好的,收到,我尝试一下
from knowlm.
请问您的问题解决了吗
from knowlm.
请问您的问题解决了吗
还没,可能是我的环境有问题,今天刚刚重新排查问题
from knowlm.
您的问题解决了吗
from knowlm.
Related Issues (20)
- How to resolve out of memory HOT 6
- does it support models from vllm? HOT 2
- 在复现信息抽取结果时报错:TypeError: not a string HOT 2
- Input information on the web page, but no response is displayed HOT 7
- Domestic model download issues HOT 3
- don't know how to delete duplicate issue
- 关于显存不足,进行llama.cpp进行量化的问题 HOT 7
- New Example file HOT 1
- OceanBench HOT 1
- 下载超时 HOT 1
- 您好,经过测试,device="cpu",python examples/generate_finetune_web.py --base_model ./model/knowlm-13b-base-v1.0中产生的RuntimeError: "addmm_impl_cpu_" not implemented for 'Half'问题解决了。 HOT 1
- RuntimeError: "addmm_impl_cpu" not implemented for 'Half' HOT 1
- ValueError: Please specify the ZeRO optimization config in the DeepSpeed config. HOT 5
- Feature: Adding contributors section to the README.md file. HOT 2
- 请问下是否支持非中英语种的任务? HOT 3
- 当输入文本比较长时,输出结果仅仅是复制原文 HOT 2
- 请问如何指定运行在多个GPU上? HOT 5
- ValueError: Can't read finetune/lora/templates/alpaca.json HOT 12
- 批量预测问题 HOT 13
- 运行generate_lora_web.py遇到的问题。 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from knowlm.