Hello, Thank you for your fantastic project! I have some questions r

Hello! yes but iirc for forgetting we train a single classifie

the linear evaluation is exactly the same as the one performed at the end, there

Some questions about training and evaluation process about cassle HOT 5 CLOSED

donkeyshot21 commented on August 17, 2024

Some questions about training and evaluation process

from cassle.

Comments (5)

AndrewTal commented on August 17, 2024

Also, are the backbone (feature extractor) parameters frozen when training the above classifier?
Thanks!

from cassle.

DonkeyShot21 commented on August 17, 2024

Hello!

yes but iirc for forgetting we train a single classifier with 10 classes after each task (the same as for linear evaluation accuracy) and then calculate the accuracy only on the test samples for each task (without masking the classifier, i.e. class-incremental, not task-incremental). You can use the accuracy logged by our code on wandb for each task.
yes, this is done with the sole purpose of evaluating the quality of the representations.
no, it is the normal fine-tuning use in the whole continual learning literature and it goes more or less as follows: (i) take some data (task1), (ii) train a model on task1 using an SSL method, (iii) discard the data for the current taks and take the data for a new task (task2), (iv) keep training (fine-tune) the model on task2 with the same SSL method, (v) repeat.
yes, the backbone is frozen

from cassle.

AndrewTal commented on August 17, 2024

Thanks for your reply!

Regarding question 1, with respect to forgetting, you mentioned that for each task, the classifier has 10 classes. for each task， may I ask whether you train the classifier using all 10classes labeled data or just using the 5classes labeled data form the current task to train a 10-class classifier?

Regarding question 3, self-supervised learning has been added to the continual learning paradigm, which differs somewhat from the conventional steps. If I understand correctly, is the fine-tuning process like this:
[(use task1 unlabeled data to SSL) -> (use task2 unlabeled data to SSL) -> (use 10classes labeled to finetune)]?
For the forget calculation of fine-tuning, is it also the same as 1? (Each task train a 10classes classifier individually after all SSL step finish)

from cassle.

DonkeyShot21 commented on August 17, 2024

the linear evaluation is exactly the same as the one performed at the end, therefore, yes, we use all the data for all classes to learn the linear classifier, and then test the classifier on each task.
mmh if you want to see it like that, but generally, the self-supervised part does not include the final linear evaluation. Therefore it is more like: CSSL = [(use task1 unlabeled data to SSL) -> (use task2 unlabeled data to SSL)]; and then in order to evaluate the representations we train a linear classifier with all the data (the backbone is frozen, not fine-tuned). And yes, the forgetting calculation is the same for all methods.

from cassle.

AndrewTal commented on August 17, 2024

all clear, thanks! ^-^

from cassle.

Some questions about training and evaluation process about cassle HOT 5 CLOSED

Comments (5)

Related Issues (17)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent