<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

please share sflckr vqgan for segmentation maps about taming-transformers HOT 3 CLOSED

compvis commented on August 23, 2024

please share sflckr vqgan for segmentation maps

from taming-transformers.

Comments (3)

pesser commented on August 23, 2024 1

Ah, I see. The cond_stage_model.colorize is actually not that important - it is only used for visualization purposes (to make the colorization of logged segmentation maps persistent). The main issue is then that cond_stage_model.colorize is present in the provided weights but not in the model you are instantiating. If you look at

taming-transformers/taming/models/vqgan.py

Line 34 in 8549d3a

if colorize_nlabels is not None:

you will see that it is registered as a buffer if the config parameter colorize_nlabels is set. So if you simply add colorize_nlabels: 182 to https://github.com/ink1/taming-transformers/blob/master/configs/sflckr.yaml#L44 it should work. (in the sflckr_cond_stage.yaml this parameter is named n_labels as you have seen --- but in that case the model class is the VQSegmentationModel instead of the VQModel because it uses a different loss. Awkward that we use two different names for the same config parameter 🙄 ).

Finally, this shouldn't matter if you use model.params.ckpt_path=logs/2020-11-09T13-31-51_sflckr/checkpoints/last.ckpt (or, equivalently, add ckpt_path: logs/2020-11-09T13-31-51_sflckr/checkpoints/last.ckpt to the model.params section of the config you are using, e.g. in line 5 of sflckr.yaml) instead of -r logs/2020-11-09T13-31-51_sflckr/checkpoints/last.ckpt on the command line, because it will then load the checkpoint non-strictly and simply ignore unexpected keys. See

taming-transformers/taming/models/cond_transformer.py

Line 32 in 8549d3a

if ckpt_path is not None:

from taming-transformers.

pesser commented on August 23, 2024

Hi @ink1,
the vqgan weights of both the first stage (encoding images) and the cond stage (encoding segmentations/depth/...) are actually part of the transformer checkpoints. We added a small script scripts/extract_submodel.py such that you can now extract the sflckr segmentation map vqgan weights with

python scripts/extract_submodel.py logs/2020-11-09T13-31-51_sflckr/checkpoints/last.ckpt sflckr_cond_stage_model.ckpt cond_stage_model

Also added an example config for training a segmentation map cond stage in configs/sflckr_cond_stage.yaml. If you want to finetune the extracted weights you can adjust the data section of that config and then run

python main.py --base configs/sflckr_cond_stage.yaml -t True --gpus 0, model.params.ckpt_path=sflckr_cond_stage_model.ckpt

Best
Patrick

from taming-transformers.

ink1 commented on August 23, 2024

Much appreciated, Patrick! This is all great!
Due to a lack of compute budget I'm trying to leverage your models as much as I can. I do understand that both vqgan models (image and segmentation) are inside of the sflckr checkpoint.
My main problem is when I'm trying to resume from it, I hit
RuntimeError: Error(s) in loading state_dict for Net2NetTransformer:
Unexpected key(s) in state_dict: "cond_stage_model.colorize".
My understanding is that this is the top layer conditionally added to vqgan models.
The condition is the presence of a particular parameter, probably this one
https://github.com/CompVis/taming-transformers/blob/master/configs/sflckr_cond_stage.yaml#L8
But when I tried adding it to sflkr yaml after this line
https://github.com/ink1/taming-transformers/blob/master/configs/sflckr.yaml#L44
it gets rejected. This is why I thought if I load both vqgan models explicitly, I'll be in control.
The funny thing is, on restart after the error above, the sflkr model gets shrunk to under 2 GB (but I provide vqgan image model from its checkpoint) and a subsequent restart works fine. What's even more interesting is that the checkpoints are 4 GB again.
I strongly suspect, I just can't get my configuration straight.
I'm also curious if colorization of segmentation is persistent across runs.
Thanks again @pesser

from taming-transformers.

please share sflckr vqgan for segmentation maps about taming-transformers HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent