Giter VIP home page Giter VIP logo

Comments (7)

maxime-langevin avatar maxime-langevin commented on August 16, 2024 1

Currently the framework for dealing with multiple batches would be to instanciate the dataset using the "get_attributes_from_list" method (and giving it a gene-expression matrix for each batch), that yields a batch-specific prior. I agree that it is particularly important when dealing with different technologies that might have very different library size distributions.

from scvi-tools.

jeff-regier avatar jeff-regier commented on August 16, 2024

Probably we also want a "prior" on library size that depends on batch id, aka a conditional prior. I'm not sure whether we already have that. In the paper, library size doesn't depend on batch id but it probably should -- especially if one batch is scRNA-seq data and the other is smFISH data.

from scvi-tools.

maxime-langevin avatar maxime-langevin commented on August 16, 2024

@jeff-regier, I implemented what we discussed to try to improve the variational distribution (namely dropping the log-nomal for a standard normal), but it gives poorer and more unstable scores for log-likelihood (probably because there's not the KL term anymore to roughly guide the encoder).
To be sure that it was worth it to get a better variational distribution, I ran scVI without trying to learn the library size (takin the exact value everytime rather than encoding/decoding it).
It converges faster but overall does not give a better likelihood than the one in Romain's paper.

from scvi-tools.

jeff-regier avatar jeff-regier commented on August 16, 2024

Isn't there still a KL term for library size? It's just a KL divergence btw a normal (the variational dist) and a standard normal (the prior), rather a KL divergence between a log normal and a log normal, right?

from scvi-tools.

maxime-langevin avatar maxime-langevin commented on August 16, 2024

Yes there is still a KL term, what I wanted to say was that in the case of the log-normal prior the KL term actually gives the model information on the mean and variance of the log library size, while the KL between the normal and standard normal doesn't.

from scvi-tools.

jeff-regier avatar jeff-regier commented on August 16, 2024

OK, let's make sure we're calculating log p(x) correctly, as discussed. If it still isn't better, we can probably conclude that a log normal isn't that bad of a variational distribution for library size.

from scvi-tools.

jeff-regier avatar jeff-regier commented on August 16, 2024

@maxime1310 should we close this and stick with the variational distribution we have? It seems like log normal may be pretty good after all.

from scvi-tools.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.