Hello， Thanks for your contribution in network quantzition field and your opensou

Can you provide pre-trained ResNet-18 model ? about apot_quantization HOT 4 CLOSED

yhhhli commented on June 22, 2024

Can you provide pre-trained ResNet-18 model ?

from apot_quantization.

Comments (4)

yhhhli commented on June 22, 2024

Hello sijeh,
Sorry for the late reply and thanks for your advice, I am planning to revise the code and provide the resnet-18 checkpoints. I will comment to you when the code is updated and the checkpoints are uploaded.

from apot_quantization.

sijeh commented on June 22, 2024

Thx.

from apot_quantization.

yhhhli commented on June 22, 2024

Hi sijeh,
I just uploaded the ckpts for the 4bit ResNet-18 and the new codes for the APoT Quantization! Here are the changes:

quantization function now does not manually overwrite the gradients
You can specify the bitwidth when initializing the model
Checkpoints for 4-bit, 3-bit ResNet-18 are uploaded. More ckpts will come in a few days
Tensorboard log file is also provided in the events dir
New Hyperparams: Please note that we have changed the hyper-params configuration. For ResNet-18-4bit, LR is set to 0.01 and scaled by 0.1 for all parameters including clipping thresholds. Weight decay is set to 1e-4 for all parameters. Please also note that 4-bit model are initialized by 5-bit quantized model. (If you do not pre-train a 5-bit model, it's fine to directly initialize it from full precision model. However, we recommend you to progressively initialize the low-bit model, e.g. 2-bit).

Regarding your question about batch size:
Theoretically speaking, LR is proportional to the batch size because lower batch size causes more training iterations. Therefore, you may use 0.01*192/1024 as your base LR.

If you still have further question, please do not hesitate to comment here.

from apot_quantization.

sijeh commented on June 22, 2024

Hi yhhhli,
Thanks for your detailed reply and updates of the opensource code and pretrained model, all the things going to be right since I re-downloaded and unzip ImageNet dataset.

from apot_quantization.

Recommend Projects

Can you provide pre-trained ResNet-18 model ? about apot_quantization HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent