Comments (1)
你好,感谢你对我们工作的关注
- 有关训练资源的细节暂时还不会公布,后续如果有更新会放到arxiv论文上;
- 我们在MMLU和CMMLU上评测了Qwen-VL的效果,虽然相较于纯文本Qwen-7B有些下降,但它依然在这两个数据集上取得了比较领先的结果,希望这份结果对你有帮助。
Model | MMLU | CMMLU |
---|---|---|
LLaMA-7B | 35.1 | - |
Baichuan-7B | 42.3 | 44.4 |
ChatGLM2-6B | 47.9 | 48.8 |
Qwen-7B | 56.7 | 58.8 |
Qwen-VL | 50.7 | 49.5 |
from qwen-vl.
Related Issues (20)
- 💡 [REQUEST] - <title>学习率不改变,有人知道吗?
- Stream request is not supported currently.
- 关于图片描述,如果有多个描述,能否在标注文件中都加入?加入的话格式如何? HOT 5
- Qwen-VL多模态的的图片识别幻觉太严重了,识别不了就无中生有,这是有参数可以设置的吗,例如只输出能识别的部分。 HOT 1
- 请教关于微调训练finetune HOT 44
- chartQA的test集使用的是chartqa_test_human 还是 chartqa_test_augmented?
- PermissionError: [Errno 13] Permission denied: 'SimSun.ttf'[BUG] <title> HOT 1
- 💡 [REQUEST] - <title> Could you add the evaluation of ConBench. HOT 1
- [BUG] TypeError: isin() received an invalid combination of arguments.
- typeError: isin() received an invalid combination of arguments - got (test_elements=int, elements=Tensor,), but expected one of HOT 4
- RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cpu!
- 关于推理预测 HOT 17
- [BUG] 在利用vl-plus的api进行图片总结时,如果多张图片名称相同,只会按照第一张图片内容进行总结
- Discussion closed HOT 1
- [BUG] <title> deepspeed expected the next 1 parameters in the parameter fetch queue to be
- [BUG] <'Only Support Self-Attention Currently' Assert Error> HOT 1
- Pretrain数据格式
- 关于模型融合 HOT 18
- [BUG] <qwen-vl api 在阿里云ecs 上调用出现 网络连接错误>
- 关于 chat模型 和 base模型的微调
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from qwen-vl.