Comments (7)
This is a known problem with latent SDEs -- the diffusion is always a bit too small. I think this could be mitigated by tweaking the training objective (probably the reconstruction loss encourages small diffusions), or by additionally training wrt a discriminator (see Neural SDEs as Infinite-Dimensional GANs). To my knowledge no-one has really tried this, or tried working out the details though.
from torchsde.
Have you tried increasing the penalty on the path-space KL?
The prior diffusion is unlikely to collapse, since it's forced to match multiple posterior diffusions approximately, simultaneously.
from torchsde.
This is a known problem with latent SDEs -- the diffusion is always a bit too small. I think this could be mitigated by tweaking the training objective (probably the reconstruction loss encourages small diffusions), or by additionally training wrt a discriminator (see Neural SDEs as Infinite-Dimensional GANs). To my knowledge no-one has really tried this, or tried working out the details though.
Thanks for the reply. I am trying to adjust some loss functions.
Have you tried increasing the penalty on the path-space KL?
The prior diffusion is unlikely to collapse, since it's forced to match multiple posterior diffusions approximately, simultaneously.
Thank you, so which loss should I increase log_pxs or logqp0 + logqp_path?
from torchsde.
I'm referring to the logqp_path
part.
from torchsde.
I'm referring to the
logqp_path
part.
Yes, it helps! Thank you guys for the timely reply. I am closing this issue.
from torchsde.
I'm referring to the
logqp_path
part.Yes, it helps! Thank you guys for the timely reply. I am closing this issue.
Awesome! Feel free to reopen if/when you have follow-up questions.
from torchsde.
This is really cool to see -- thank you for sharing!
Honestly, "better neural SDEs" is still an open research topic. I would love to see more work like yours in this space. :)
from torchsde.
Related Issues (20)
- vector-valued SDE cumbersome workflow HOT 1
- Zero drift and zero diffusion matrices lead to non-zero changes of variable
- Irregular data and sampling posterior in latent_sde_lorenz.py
- Low CPU and GPU usage in training sde_gan, Seeking Help to Improve Performance. HOT 2
- Different `t` for data in a minibatch HOT 5
- torchsde pypi package is misformated HOT 27
- Deprecation torchsde version HOT 5
- Something went wrong Expecting value: line 1 column 1 (char 0) HOT 1
- Incorporating real stock time series data
- SDE-LSTM structure for time series forecasting
- Performance Improvement Inquiry: Experiencing Slow Execution with TorchSDE HOT 2
- Latent SDE failed to generate longer samples
- issue with my text to image ai Device type privateuseone is not supported for torch.Generator() api. HOT 1
- 我的 Mac上 只有torchsde-0.2.6.dist-info,怎样才能安装insightface
- 如果 torchsde 当前没有解决这个问题的新版本,你可以联系该项目的维护者或作者
- sdeint_adjoint for f_and_g_prod type SDE, how to set parameter?
- failure to install pip install qiskit-aer to run qsvm on ibm simulator
- learning the generative model of periodical process HOT 1
- extra_solver_state Documentation HOT 1
- WARNING: Error parsing dependencies of torchsde HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from torchsde.