Giter VIP home page Giter VIP logo

Comments (7)

WorldEditors avatar WorldEditors commented on June 1, 2024

A nice question as Meena (Google) has also claimed that top-K was enough to gain diversity. Our point of view is that the diversity of top-K sampling is different from that of latent space. We believe Neural Network (NN) is, after all, a mapping function, and mapping function can only do one-to-one mapping.

As an intuitive explanation of this, consider A,B, C to be correct responses. We may end up producing response D as NN averages out the responses A, B and C. However, it is possible that D does not contain any information of A,B and C but has totally different meaning. If we do top-K sampling we may resample 3 responses E,F,G around D, which won’t guarantee that you can recover A,B and C.

from research.

WorldEditors avatar WorldEditors commented on June 1, 2024

For the second question, the BOW loss and the generative loss actually "push" the latent variable to “leak” information of responses as much as possible. Thus, it would do against the target loss function to collapse the latent distribution to a single pattern. And indeed we have never observed that phenomenon.

from research.

songyouwei avatar songyouwei commented on June 1, 2024

和隐变量产生直接联系的好像只有 non-regressive 的 BoW loss ?

from research.

WorldEditors avatar WorldEditors commented on June 1, 2024

The main contribution of latent variable z is to improve the generative model(p(r|c,z)), BOW loss is regarded as an auxiliary loss. We do not need the BOW loss for learning theoretically. However, practically BOW loss is important for accelerating the convergence of the recognition network p(z|c,r), such that the generative model p(r|c,z) receives the correct input z.

from research.

eyuansu62 avatar eyuansu62 commented on June 1, 2024

@WorldEditors 您好,关于隐变量是怎么确定的呢?不同数据集会选择不同的隐变量吗?隐变量的取值也是需要事先规定的吗,还是只是一些向量而已?

from research.

WorldEditors avatar WorldEditors commented on June 1, 2024

We only need to specify the number of classes (K) of latent variables manually. The value of the latent vector is optimized during the training process. To specify K remains a question here, we'd like to see more future works in this problem.

from research.

eyuansu62 avatar eyuansu62 commented on June 1, 2024

Thanks a lot!

from research.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.