Hi, thanks for your work and codes. I'm confused about the d

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Questions about debiased_loss about otdd HOT 5 CLOSED

microsoft commented on May 27, 2024

Questions about debiased_loss

from otdd.

Comments (5)

toooooodo commented on May 27, 2024

Oh! The index of class in D2 is from 11 to 19 if debiased loss is True. So the class distance index is correct and my previous understanding is wrong. But I still confused about why should we compute class distance in the same dataset if debiased loss is True.

from otdd.

dmelis commented on May 27, 2024

Hi @toooooodo. When debiased_loss=True, we also need to compute label-to-label distances within each of the two datasets. To avoid carrying around 3 different tensors, we stack all of them together in a block-wise matrix of size (k + k')**2, assuming the datasets have k and k' classes respectively. The diagonal blocks of this matrix are the within-domain label distances, and the off-diagonal (the matrix is symmetric, so the two off-diagonal blocks are the same) are the usual across-domains label distances that you would get if you run OTDD with debiased_loss=False. I hope that clarifies it!

from otdd.

toooooodo commented on May 27, 2024

Thanks for your immediate reply!
I understand that we have 3 tensors (label-to-label distance in D1, label-to-label distance in D2, and label-to-label distance across D1 and D2) and we stack all of them together to a symmetric matrix of size (k + k')**2. But why should we compute label-to-label distances within two datasets when debiased_loss=True? I don't quite understand this parameter.
Could you please clarify the effect of this parameter and the reason to compute distance within datasets?

from otdd.

dmelis commented on May 27, 2024

Ah, got it. So your question is about how the debiased parameter works in general. When debiased_loss=True we compute an unbiased version of the sinkhorn divergence: d_debiased(a,b) = d(a,b) - 0.5(d(a,a) + d(b,b)). You can check out this paper for details: http://proceedings.mlr.press/v89/feydy19a/feydy19a.pdf, but basically this is done to guarantee that d(a,a) = 0, which in turn leads to unbiased gradients (note this is not the case in general for the vanilla sinkhorn loss).

from otdd.

toooooodo commented on May 27, 2024

Thanks! I'll check out this paper.

from otdd.

Recommend Projects

Questions about debiased_loss about otdd HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent