wyu-du / gp-vae Goto Github PK

This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Process Priors" (SPNLP@ACL2022)

Home Page: http://arxiv.org/abs/2204.01227

License: Apache License 2.0

Shell 0.17% Python 99.83%

variational-autoencoder gaussian-processes text-generation t5-model pointer-generator lstm transformer style-transfer paraphrase-generation

gp-vae's People

Contributors

Stargazers

Watchers

Forkers

readmlll changzhijiang whackerlane

gp-vae's Issues

from copynet.utils import kl_anneal_weight

Is this a custom function？ Can provide it?

How to solve the problem of "zero kld!!!"?

When I am trying to train the t5-gpave, there is a problem of "zero kld!!!".

Also, the "zero kld!!!" is also in the training of the LSTM-based variational encoder-decoder with GP priors.
Thank you for your help and I am looking forward to hearing from you.

inference speed and diversity

Hi!
Thanks for your great work! I'm working on getting results on another paraphrase dataset under T5 + GP prior setting. I have the following two questions:

I found that the generation speed is relatively slow due to the inference batch size 1, and something get wrong if I change it. Is there any way to speed up the generation?
if I want to get a trade-off between quality and diversity, is it suitable to set the scalar to 7 just like it used in the paper for the paraphrasing task?

prior_mean = torch.zeros([hidden_states.size(0), posterior_mean.size(-1)]) \
    .to(posterior_mean.dtype).to(posterior_mean.device)
prior_logvar = torch.zeros([hidden_states.size(0), posterior_logvar.size(-1)]) \
      .to(posterior_logvar.dtype).to(posterior_logvar.device)

wyu-du / gp-vae Goto Github PK

gp-vae's People

Contributors

Stargazers

Watchers

Forkers

gp-vae's Issues

from copynet.utils import kl_anneal_weight

How to solve the problem of "zero kld!!!"?

inference speed and diversity

output data of your experiment

Pre-trained model weights

请问可以提供其他数据集吗

prior_logvar should be 1 when calculating KL

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent