Comments (4)
Hi, thank you for the interest of our work. LoftQ supports any existing quantization function in theory, but GPTQ implementation AutoGPTQ doesn't support dequantization, which is required in LoftQ (see Section 2.2 in LoftQ paper).
If you can find GPTQ implementation that has the dequantization method, please let me know. I'm glad to add it to LoftQ :)
from loftq.
Plus, we do have the experimental uniform quantization method at https://github.com/yxli2123/LoftQ/blob/main/glue/utils.py#L103. However, it's not the same uniform quantization used in GPTQ.
from loftq.
Hi, thank you for the interest of our work. LoftQ supports any existing quantization function in theory, but GPTQ implementation AutoGPTQ doesn't support dequantization, which is required in LoftQ (see Section 2.2 in LoftQ paper).
If you can find GPTQ implementation that has the dequantization method, please let me know. I'm glad to add it to LoftQ :)
Do you mean that vecquant4matmul is not a seperate dequantization function (dequantization + matmul) ?
from loftq.
@yxli2123 Thank you for providing experimental details. And congratulations to LoftQ for being accepted as a oral at ICLR 2024! It's sure that AutoGPTQ uses group-wise quantization and bit compression. Maybe LoftQ requires a custom dequantization function if it have to integrate into PEFT.
I found some related discussions about Pytorch-like Dequatization function:
Faster Pytorch dequantize() + matmul for quantized models
A dequantization function seems to be implemented by offical pytorch:
FUNCTION AT::_WEIGHT_INT4PACK_MM
I hope the above information will help you.
from loftq.
Related Issues (20)
- Can we use LoftQ to optimize vision foundation models like OWL-ViT v2 and Grounding Dino? HOT 1
- quantize_save.py script fails saving lora adapter with peft>=0.7.2 HOT 3
- Does it support Mixtral 8x7Bīŧ HOT 1
- loftQ can not use multi gpu to train HOT 9
- Is there any way for using LoftQ to GPTQ or AWQ model? HOT 2
- bugs for running python test_gsm8k.py when uses LoftQ for llama HOT 2
- A question from a novice. HOT 2
- The issue of not being able to download the LoftQ model from huggingface even when using an VPN HOT 1
- issues for running python test_gsm8k.py when uses LoftQ for llama
- Why are the full models, and not just adapters, pushed to hub? HOT 2
- Failing to converge when using some random seeds HOT 2
- Performance worsens versus QLoRA with TinyLlama
- Why are base weights on HF LoftQ models in 16-bit? HOT 2
- Error with shape HOT 2
- quick question about the Llama-3 results HOT 1
- [BUG]size mismatch for base_model.model.model.embed_tokens.weight
- Method fails on Gemma-7B model HOT 1
- Embedding layer HOT 1
- Cannot reproduce the result of LoftQ on gsm8k with llama2-7b
- About the test result on gsm8k
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
đ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. đđđ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google â¤ī¸ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from loftq.