Hi, Currently, is it possible to finetune one of the (pretrained) autocompressor m

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Finetuning an autocompressor model about autocompressors HOT 4 CLOSED

imbalu007 commented on August 12, 2024

Finetuning an autocompressor model

from autocompressors.

Comments (4)

CodeCreator commented on August 12, 2024 1

You only need to finetune the model -- the tokenizer remains unchanged, e.g., the tokenizer princeton-nlp/AutoCompressor-Llama-2-7b-6k is the same as the standard Llama-2 tokenizer.

from autocompressors.

CodeCreator commented on August 12, 2024

This should be straightforward!

First, have a look at the train.sh and train_llama.sh scripts in the run/ folder. To fine-tune from an existing AutoCommpressor instead of an LM base model, simply change the model_url variable to a AutoCompressor on huggingface-hub or from a local checkpoint, e.g., changing this line to model_url="princeton-nlp/AutoCompressor-Llama-2-7b-6k" will fine-tune from this model.

from autocompressors.

imbalu007 commented on August 12, 2024

Thanks!
Also, I couldn't find references to training/finetuning a tokenizer in the paper. But in the example usage, you refer to a custom tokenizer (I think).
tokenizer = AutoTokenizer.from_pretrained("princeton-nlp/AutoCompressor-Llama-2-7b-6k").
Do we need to finetune the tokenizer also and not just the model?

from autocompressors.

hxs91 commented on August 12, 2024

@imbalu007 Hi, have you finetuned the AutoCompressor model on downstream task? If so, what about the performance? Thanks.

from autocompressors.

Recommend Projects

Finetuning an autocompressor model about autocompressors HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent