Comments (2)
Hi, @RonanKMcGovern . In the old version of bitsandbytes, it is not allowed to save nf4 format weight. We avoid this issue by saving it in 16 bits in the disk and transforming it into 4 bits when loading it into GPU. However, as the bitsandbytes gets updated recently, it is possible to save it in nf4 format. We will update the code soon.
from loftq.
Ok, but are you even running the nf4 quantization then?
Or are you just directly saving the bf16 weights? If you're doing that, there is going to be error when reloading the model because the saved bf16 should be the dequantized weights, not the original...
Seems to me something is off because doing even one iteration of loftQ should improve results, but I see worsening results for 1 iteration and more (see this vid), as does kaitchup.substack.com
from loftq.
Related Issues (20)
- Can we use LoftQ to optimize vision foundation models like OWL-ViT v2 and Grounding Dino? HOT 1
- quantize_save.py script fails saving lora adapter with peft>=0.7.2 HOT 3
- Does it support Mixtral 8x7Bīŧ HOT 1
- loftQ can not use multi gpu to train HOT 9
- Is there any way for using LoftQ to GPTQ or AWQ model? HOT 2
- bugs for running python test_gsm8k.py when uses LoftQ for llama HOT 2
- A question from a novice. HOT 2
- The issue of not being able to download the LoftQ model from huggingface even when using an VPN HOT 1
- issues for running python test_gsm8k.py when uses LoftQ for llama
- Why are the full models, and not just adapters, pushed to hub? HOT 2
- Failing to converge when using some random seeds HOT 2
- Performance worsens versus QLoRA with TinyLlama
- Error with shape HOT 2
- quick question about the Llama-3 results HOT 1
- [BUG]size mismatch for base_model.model.model.embed_tokens.weight
- Method fails on Gemma-7B model HOT 1
- Embedding layer HOT 1
- Cannot reproduce the result of LoftQ on gsm8k with llama2-7b
- About the test result on gsm8k
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
đ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. đđđ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google â¤ī¸ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from loftq.