Hello, first of all, thank you for your great work. I just inserted the self-atten

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Insert the attention mechanism in the codes to the stylegan2-ada but fail about gigagan-pytorch HOT 7 CLOSED

lucidrains commented on July 27, 2024

Insert the attention mechanism in the codes to the stylegan2-ada but fail

from gigagan-pytorch.

Comments (7)

landian60 commented on July 27, 2024 2

@lucidrains Thanks for your kindness!
The results changed after a few modifications, and I will keep training to see what happens.
By the way, I am curious why you dont't let the style codes participate in the attention process? for example, let the style codes modulate the weights of q&k on self-attention.
Tks.

from gigagan-pytorch.

landian60 commented on July 27, 2024 1

tks! You are so nice. Waiting for your continuing work.

from gigagan-pytorch.

lucidrains commented on July 27, 2024

@landian60 i can do a quick review of your integration code if you have a public repo or a gist

from gigagan-pytorch.

lucidrains commented on July 27, 2024

i'm actually not too sure if they also used the l2 distance attention for cross attention as well as the extra transformer blocks they appended to the clip text encoder

maybe i should make it an option to fallback to regular attention

from gigagan-pytorch.

lucidrains commented on July 27, 2024

glad to hear it is working!

the style codes already participate in modulating the convolutional kernels. as for the text conditioning, that is cross attended to by each image feature map token

from gigagan-pytorch.

landian60 commented on July 27, 2024

Hello Phil,
I have a question about training the model with cross attention. I only used clip contrastive loss and did not add any other losses. And the generative results always became the same color and the G loss was very big even to NaN. I think the reason is the model collapse. Have you encountered similar issues when training the model? And have you noticed that some losses or well-designed discriminator containing some vital structures could help to stabilize the training process with the attention mechanism?
Tks.

from gigagan-pytorch.

landian60 commented on July 27, 2024

@lucidrains

from gigagan-pytorch.

Recommend Projects

Insert the attention mechanism in the codes to the stylegan2-ada but fail about gigagan-pytorch HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent