Comments (2)
训练过程中用到了flash attention和rotatry embedding,可以用下面的命令安装:
# install flash attention
git clone [email protected]:Dao-AILab/flash-attention.git
cd flash-attention
python setup.py install
# install rotaty operator
cd csrc/rotary
pip install -e .
from internlm-xcomposer.
感谢!
from internlm-xcomposer.
Related Issues (20)
- 多机推理float16 HOT 5
- mmbench 效果评估 error HOT 2
- server resources required for finetune a lora model HOT 1
- 4bit版本使用modelscope拉取,inference时一直显示获取模型版本失败 HOT 2
- Add English commercial questionnaire HOT 3
- 用到的sam图片数据 HOT 7
- 预训练放开vision encoder,效果很差 HOT 4
- demo获得的结果 比使用代码获得的结果要好,如何解决? HOT 2
- ShareCaptioner is based on the improved InternLM-Xcomposer-7B base model.
- evaluation internlm-xcomposer2-vl-7b get 10% acc on mmbench-dev-cn, not 78.3% HOT 1
- InternLM-XComposer2-VL和InternLM-XComposer2 这两个模型的区别 HOT 4
- Cuda error When I try muti gpu inference(+lora). HOT 5
- interleav_wrap has no padding bug HOT 4
- Share training code and data preparation code of DualFocus HOT 1
- InternLM-XComposer2-VL-7B使用lora微调,似乎保存了整个模型? HOT 3
- v100 32g 无法推理, torch.cuda.OutOfMemoryError HOT 2
- couldn't find it in the cached files and it looks like openai/clip-vit-large-patch14-336 is not the path to a directory containing a file named config.json. HOT 2
- 请问InternLM-XComposer2 是否使用了vision_projector HOT 2
- InternLM-XComposer2-VL-7B使用lora微调后,如何量化得到int4版的模型用于推理? HOT 3
- 多机推理bug HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from internlm-xcomposer.