Giter VIP home page Giter VIP logo

Comments (7)

supportend avatar supportend commented on June 10, 2024 2

I tested 13B and 30B without problems. Just download the models and start the program with the correct file, e.g.:
-m ggml-alpaca-30b-q4.bin

from alpaca.cpp.

supportend avatar supportend commented on June 10, 2024 2

Download of the 30B Model:
https://huggingface.co/Pi3141/alpaca-lora-30B-ggml/tree/main
4_1 does not work for me, i use the ggml-model-q4_0.bin file and renamed it to ggml-alpaca-30b-q4.bin.

It's the same file, that was downloaded inside the Dalai docker container:

sudo docker compose run dalai md5sum /root/dalai/alpaca/models/30B/ggml-model-q4_0.bin && md5sum /daten/alpaca/ggml-alpaca-30b-q4.bin
7bc55515cb128cd2ded9403d35c25244  /root/dalai/alpaca/models/30B/ggml-model-q4_0.bin
7bc55515cb128cd2ded9403d35c25244  /daten/alpaca/ggml-alpaca-30b-q4.bin

My system is a notebook and i reduced the cpu frequency a little, one token tooks over 600 ms.

from alpaca.cpp.

LuigiTrapanese avatar LuigiTrapanese commented on June 10, 2024

There was in the README the option to choose between 3 models and it disappeared a couple of weeks ago.
I found the 13b model, I wasn't able to find the 30b. I am still looking.

from alpaca.cpp.

AB0x avatar AB0x commented on June 10, 2024

@supportend could you please link the bin files for 13B and 30B?
how many tokens/s are you getting with the larger models?

from alpaca.cpp.

wangqiusheng1983 avatar wangqiusheng1983 commented on June 10, 2024

could you please link the bin files for 13B

Download of the 30B Model: https://huggingface.co/Pi3141/alpaca-lora-30B-ggml/tree/main 4_1 does not work for me, i use the ggml-model-q4_0.bin file and renamed it to ggml-alpaca-30b-q4.bin.

It's the same file, that was downloaded inside the Dalai docker container:

sudo docker compose run dalai md5sum /root/dalai/alpaca/models/30B/ggml-model-q4_0.bin && md5sum /daten/alpaca/ggml-alpaca-30b-q4.bin
7bc55515cb128cd2ded9403d35c25244  /root/dalai/alpaca/models/30B/ggml-model-q4_0.bin
7bc55515cb128cd2ded9403d35c25244  /daten/alpaca/ggml-alpaca-30b-q4.bin

My system is a notebook and i reduced the cpu frequency a little, one token tooks over 600 ms.

could you please link the bin files for 13B???

from alpaca.cpp.

supportend avatar supportend commented on June 10, 2024

I think, this is it:

https://huggingface.co/Pi3141/alpaca-lora-13B-ggml/tree/main
again 4_0 file.

There is a native version too on the site. But i dit not download a 4_1 file or the.

from alpaca.cpp.

wangqiusheng1983 avatar wangqiusheng1983 commented on June 10, 2024

I think, this is it:

https://huggingface.co/Pi3141/alpaca-lora-13B-ggml/tree/main again 4_0 file.

There is a native version too on the site. But i dit not download a 4_1 file or the.

thanks a lot i download it

from alpaca.cpp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.