<div class="snippet-clipboard-content notranslate position-relative overflow-auto" data-snippet-clip

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

shouldn't it be D_real.backward(one)? about wgan-gp HOT 6 CLOSED

caogang commented on June 15, 2024

shouldn't it be D_real.backward(one)?

from wgan-gp.

Comments (6)

caogang commented on June 15, 2024 2

I use retain_graph=True in autograd.grad to support twice backward which is the same as retain_variable=True. And I want get the gradient w.r.t. input without accumulating into .grad, just get the gradient. So I use autograd.grad instead of backward(). https://github.com/caogang/wgan-gp/blob/master/gan_toy.py#L204-L207

from wgan-gp.

caogang commented on June 15, 2024

No, D_real is what we want to maximize, so we minimize the loss (-D_real)

from wgan-gp.

ypxie commented on June 15, 2024

Thanks for your reply. That makes sense, but why does author of Wgan do the opposite in
https://github.com/martinarjovsky/WassersteinGAN/blob/master/main.py ?

from wgan-gp.

caogang commented on June 15, 2024

Maybe, the output of net_d is the loss or error in the implementation of wgan. It is up to the definition of net_d

from wgan-gp.

ypxie commented on June 15, 2024

Hello, Thanks for the explaining. I have a question, since you backward through the network twice, why is retain_variable=True not used in the code? And why not directly use D_cost.backward() once ?

from wgan-gp.

eifuentes commented on June 15, 2024

@ypxie @caogang see the WGAN author's comment in this issue martinarjovsky/WassersteinGAN#9 the two approaches are equivalent as long as you are consistent.