Giter VIP home page Giter VIP logo

Comments (6)

hutianyu2006 avatar hutianyu2006 commented on May 17, 2024 1

from qwen.

JianxinMa avatar JianxinMa commented on May 17, 2024

int4效果一般会有较大损失。如果只是想体验对话效果,可以在 modelscope 免费体验:https://modelscope.cn/studios/qwen/Qwen-7B-Chat-Demo/summary

from qwen.

JianxinMa avatar JianxinMa commented on May 17, 2024

了解~ int4 在 readme 里好像有相关内容,我找相关同事确认下是否可用。btw,我刚用新手机号试了下,注册 modelscope 体验 demo 似乎不需要阿里云账号,注册环节不要点击“领取算力礼包”、跳过绑定阿里云账号的环节即可。

from qwen.

hutianyu2006 avatar hutianyu2006 commented on May 17, 2024

1.我的意思是说像ChatGLM一样在HuggingFace上放一个预先int4量化过的模型,不然RAM较小的环境连加载模型都做不到
2.demo体验了一下,但是我发现可以免auth token通过gradio api使用demo,如果是有意为之,那我感觉登录完全多此一举,如果是无意为之,建议想办法加上auth token避免滥用
3.另外报个modelscope的bug,看似手机号支持那么多国家,实际上只支持+86,因为别的国家手机号都会格式错误(验证码正常收,在填写账号名称的地方会报错)

from qwen.

JustinLin610 avatar JustinLin610 commented on May 17, 2024

之前应该是默认fp32很容易爆显存,现在打开应该能节省。NF4和Int8也都可以使用。RAM比较小这个问题,我们后续直接做个量化版本的ckpt提供出来。

from qwen.

hutianyu2006 avatar hutianyu2006 commented on May 17, 2024

那么希望量化版本的ckpt能够尽快放上HF,这个issue我先关了。

from qwen.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.