System Info I am on pytorch2.2.2 cuda 12.1 gcc 10.3.1 Trying t

Build and install rotary and <code class="notranslate

Build and install rotary and <code class

Canno launch with error exllamav2_kernels not installed. about text-generation-inference HOT 5 CLOSED

coderaBruce commented on July 18, 2024

Canno launch with error exllamav2_kernels not installed.

from text-generation-inference.

Comments (5)

anhou commented on July 18, 2024

The same issue

from text-generation-inference.

Semihal commented on July 18, 2024

Build and install rotary and layer_norm from flash-attn repository.

from text-generation-inference.

Kev1ntan commented on July 18, 2024

Build and install rotary and layer_norm from flash-attn repository.

hi @Semihal , can you give the command to build that?

from text-generation-inference.

Semihal commented on July 18, 2024

Build and install rotary and layer_norm from flash-attn repository.

hi @Semihal , can you give the command to build that?

Clone the flash-attention repository with the same as in this makefile:
https://github.com/huggingface/text-generation-inference/blob/main/server/Makefile-flash-att-v2#L7-L12

Then:

Change current dir to layer_norm (from root of flash-attention repo): cd csrc/layer_norm
python setup.py build
python setup.py install
Same for rotary-emb: cd ../rotary
python setup.py build
python setup.py install

from text-generation-inference.

github-actions commented on July 18, 2024

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

from text-generation-inference.

Canno launch with error exllamav2_kernels not installed. about text-generation-inference HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent