Machine: MAX1100 ipex-llm: 2.1.0b20240421 <s

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

RuntimeError: "fused_dropout" not implemented for 'Byte' when running trl ppo finetuning about bigdl HOT 3 OPEN

Jasonzzt commented on May 25, 2024

RuntimeError: "fused_dropout" not implemented for 'Byte' when running trl ppo finetuning

from bigdl.

Comments (3)

Uxito-Ada commented on May 25, 2024 1

@Jasonzzt From the log, it is found that PPO also applies PEFT LoRA.
Therefore, like QLoRA, rather than from_pretrained a peft model with lora config, we should first load the base model, and then use get_peft_model, prepare_model_for_kbit_training etc. methods in qlora.py to create a peft model. Such a model is built on top of layers with supported operators like here.

from bigdl.

Uxito-Ada commented on May 25, 2024

@leonardozcm pls take a look, whether it is not supported by our kernel? tks.

from bigdl.

leonardozcm commented on May 25, 2024

hi, I think the VF.drop is not implemented by our kernels, instead I suppose this error indicates that input is in 8-bit data format which is not a supported dtype for torch.nn.functional.dropout

from bigdl.

RuntimeError: "fused_dropout" not implemented for 'Byte' when running trl ppo finetuning about bigdl HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent