rockeycoss / spo Goto Github PK

View Code? Open in Web Editor NEW

107.0 4.0 2.0 31.2 MB

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

Home Page: https://arxiv.org/abs/2406.04314

Python 100.00%

diffusion-models dpo sdxl text-to-image text-to-image-generation

spo's People

Contributors

Stargazers

Watchers

Forkers

conglesolutionx tristanwqy

spo's Issues

About training loss

I trained the model following your training code and found that the recorded loss curve was very weird. The train_loss always fluctuates violently around 0.693. Additionally, the performance of the trained model was very poor. Could you share your training records? Is there anything specific I need to pay attention to during training?

Launching inference_spo_sdxl.py does not finish. "1Torch was not compiled with flash attention."

(venv) C:\AI\SPO>py inference_spo_sdxl.py
C:\AI\SPO\venv\Lib\site-packages\diffusers\utils\outputs.py:63: UserWarning: torch.utils._pytree._register_pytree_node is deprecated. Please use torch.utils._pytree.register_pytree_node instead.
torch.utils._pytree._register_pytree_node(
C:\AI\SPO\venv\Lib\site-packages\huggingface_hub\file_download.py:1132: FutureWarning: resume_download is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use force_download=True.
warnings.warn(
Loading pipeline components...: 100%|████████████████████████████████████████████████| 7/7 [00:00<00:00, 11.60it/s]
0%| | 0/50 [00:00<?, ?it/s]C:\AI\SPO\venv\Lib\site-packages\diffusers\models\attention_processor.py:1244: UserWarning: 1Torch was not compiled with flash attention. (Triggered internally at ..\aten\src\ATen\native\transformers\cuda\sdp_utils.cpp:455.)
hidden_states = F.scaled_dot_product_attention(

I have tried to launch it previously and it downloaded a ton of models.
Now it's stuck here seemingly forever with GPU spiking.

I'm on a 3070 with 8gb vram, maybe this is why?

Out of memory using default config

Hi, thanks for your great work. But when I try to train SPO_SDXL with the default config file spo_sdxl_4k-prompts_num-sam-2_3-is_10ep_bs2_gradacc2.py using 4 80G A800 GPUs, there is an error showing CUDA out of memory. Can you help me confirm your training configuration?

About 1.5

Interesting, thanks for the XL model. Will you release a 1.5 model ?
Our experiments with Stable Diffusion v1.5 and SDXL demonstrate that SPO significantly outperforms the latest Diffusion-DPO in aligning generated images with complex, detailed prompts and enhancing aesthetics, while also achieving more than 20× times faster in training efficiency.

About reward model dataset or reward model

Hi, great work!
I see you will release training code. How about reward model dataset or reward model?

[Qustion] Why SDXL Lora has no effects in stable diffusion webui

I found the webui version in the end, but still, really interested in the difference between these 2 versions, and what should we do to transfer one into another one?

rockeycoss / spo Goto Github PK

spo's People

Contributors

Stargazers

Watchers

Forkers

spo's Issues

About training loss

Launching inference_spo_sdxl.py does not finish. "1Torch was not compiled with flash attention."

Out of memory using default config

About 1.5

About reward model dataset or reward model

[Qustion] Why SDXL Lora has no effects in stable diffusion webui

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent