Giter VIP home page Giter VIP logo

llama-cpu's Issues

BFloat16 error in last update

Using BFLOAT16 and 7B model. Fast to read and low mem but error before answer.
No solved replacing previous files (generation y model) on "llama" folder.

prompt> the meaning of the life is
0%| | 0/256 [00:00<?, ?it/s]Traceback (most recent call last):
File "/home/user/IA/ll-cpu3/llama-cpu/example-chat.py", line 109, in
fire.Fire(main)
...
...
File "/home/user/IA/ll-cpu3/llama-cpu/llama/model.py", line 109, in forward
xq, xk, xv = self.wq(x), self.wk(x), self.wv(x)
File "/home/user/.local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "/home/user/.local/lib/python3.10/site-packages/torch/nn/modules/linear.py", line 114, in forward
return F.linear(input, self.weight, self.bias)
RuntimeError: expected scalar type Float but found BFloat16
0%| | 0/256 [00:02<?, ?it/s]

merge-weights.py: unsupported operand type(s) for |=: 'dict' and 'dict'

When I run python3 merge-weights.py --input_dir /home//llama-cpu/ --model_size 7B I receive the following error:

Traceback (most recent call last):
  File "merge-weights.py", line 168, in <module>
    main()
  File "merge-weights.py", line 161, in main
    write_model(
  File "merge-weights.py", line 63, in write_model
    state_dict |= {
TypeError: unsupported operand type(s) for |=: 'dict' and 'dict'

In the README I understood that it is necessary to run example-chat-bfloat16.py

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.