nilq / cutoff-len-is-context-len Goto Github PK
View Code? Open in Web Editor NEWThis project forked from kaiokendev/cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
License: MIT License