erzakiev / frictionless Goto Github PK
View Code? Open in Web Editor NEWA blackbox statistical optimization approach to find genetic signatures in scRNA-seq data
A blackbox statistical optimization approach to find genetic signatures in scRNA-seq data
The tie-adjustment factor
overflows the int
format... Observed when, for example, the number of dropouts (i.e. 0 expression) was tied many times in a big sample of INT_MAX
value which is +2147483647
It was meant to be a troubleshooting feature that would allow for a precise control of the algorithm, but it is clearly unnecessary (and actually annoying) for the end user
Ditto
When the signature size is not fixed at a certain number, we faced a problem of explosive growth of the number of combinations in which a signature can be constructed (the issue is discussed in the study of a similar problem in the case subnetworks).
We tried penalization of the Friedman's S for each sample by dividing it with the number of genes in the solution n
exponentiated to some real power alpha
[0.1 ... 3]: S'=S/(n^alpha)
, yet it either produced very big signatures (read half or more of the input gene universe) with the lower values of alpha
, or reduced the signatures to the minimum allowed number (3 in our case) in case of higher alphas
.
We invite anyone to contribute their ideas on how to properly penalize the statistic so that the signatures don't expand to the totality of the inputted gene space yet don't collapse to the minimum number of genes allowed (i.e. 3).
With the help of Carola Doerr we can avoid redundant calculations, leading to the same basins of attraction over and over again (which is the case for the current vanilla implementation). She suggested using a hybrid taboo/double verification approach, which is in active development and will be posted shortly along with the details of the approach.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.