Would you be willing to share the code used for your training process? about marigold HOT 6 CLOSED

prs-eth commented on July 17, 2024

Would you be willing to share the code used for your training process?

from marigold.

Comments (6)

markkua commented on July 17, 2024 4

Thanks for your interest in our work. We do have a plan to release training and evaluation code in the future. However, the schedule is not clear yet. Please stay tuned for future updates.
#1 (comment)

from marigold.

Magicboomliu commented on July 17, 2024 3

I find it can be trained following the structure of the pipeline design, using the HuggingFace text_to_image.py example Here, only changing the UNet part.

I have implemented an unofficial training version using the HuggingFace's diffusers when using the batch size equal 1 or 2, the training takes about 21G VRAM. Not sure whether can reproduce the result or not. Basically, the author says it can be trained on a single RTX4090 is True :)

from marigold.

EasonChen99 commented on July 17, 2024

Thank you for sharing this! I will try it at the earliest opportunity and will share any outcomes here.

from marigold.

dddb11 commented on July 17, 2024

I find it can be trained following the structure of the pipeline design, using the HuggingFace text_to_image.py example Here, only changing the UNet part.

I have implemented an unofficial training version using the HuggingFace's diffusers when using the batch size equal 1 or 2, the training takes about 21G VRAM. Not sure whether can reproduce the result or not. Basically, the author says it can be trained on a single RTX4090 is True :)

Have you tried the "Annealed multi-resolution noise" mentioned in the paper? I'm not sure if I set it right in my training.

from marigold.

Magicboomliu commented on July 17, 2024

Ohh, about this point, I did not implement the Annealed multi-resolution noise......

from marigold.

snowflakewang commented on July 17, 2024

I find it can be trained following the structure of the pipeline design, using the HuggingFace text_to_image.py example Here, only changing the UNet part.

I have implemented an unofficial training version using the HuggingFace's diffusers when using the batch size equal 1 or 2, the training takes about 21G VRAM. Not sure whether can reproduce the result or not. Basically, the author says it can be trained on a single RTX4090 is True :)

Hello, I am sorry to bother you. Firstly, thanks a lot for writing the training pipeline code. I wonder whether you have trained your image2depth diffusion model based on the scripts. Did they perform good training results?
Thank you! :)

from marigold.

Recommend Projects

Would you be willing to share the code used for your training process? about marigold HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent