Thanks for the great work and convenient benchmarking tool! I would

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

How to evaluate the model memory efficiently? about bigcode-evaluation-harness HOT 6 CLOSED

bigcode-project commented on June 20, 2024

How to evaluate the model memory efficiently?

from bigcode-evaluation-harness.

Comments (6)

loubnabnl commented on June 20, 2024 1

Closing this issue, as I tried loading CodeGen-16B in mixed precision and it fits under 40GB of RAM

from bigcode-evaluation-harness.

arjunguha commented on June 20, 2024

This is not going to be full solution. I have gotten Codegen-16B-multi to work on an A6000/48GB. The script we used to pull it off is here:

https://github.com/nuprl/MultiPL-E/blob/main/inference/codegen.py

Note the crazy code for the stopping criteria. IIRC it was necessary to get things to work.

from bigcode-evaluation-harness.

loubnabnl commented on June 20, 2024

Can you make sure that FP16 is set and follow memory consumption up until accelerator.prepare ?

from bigcode-evaluation-harness.

Godofnothing commented on June 20, 2024

@loubnabnl I set fp16 in the accelerate launch --mixed_precision fp16 but it doesn't help. There is no GPU memory consumption up to accelerator.prepare.

from bigcode-evaluation-harness.

loubnabnl commented on June 20, 2024

@Godofnothing we found a bug which made the memory consumption more than necessary, can you try running evaluation with code from this PR #61? you now need to specify --precision fp16

from bigcode-evaluation-harness.

Godofnothing commented on June 20, 2024

Sorry for long delay. I've pulled the latest version of the code and model successfully fits onto 40GB. Thanks for your help and response.

from bigcode-evaluation-harness.

How to evaluate the model memory efficiently? about bigcode-evaluation-harness HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent