Light

davharris / mistnet2 Goto Github PK

View Code? Open in Web Editor NEW

4.0 1.0 0.0 181 KB

Neural Networks with Latent Random Variables in R

License: Other

R 88.19% C++ 2.83% HTML 8.98%

mistnet2's Introduction

mistnet2

Neural Networks with Latent Random Variables in R

Fit models with both complex nonlinearities (using neural networks) as well as structured outputs (using graphical models).

mistnet2's People

Contributors

Stargazers

Watchers

mistnet2's Issues

Fix argument passing from mistnet to mistnet_fit to mistnet_fit_optimx

For example, starttests is currently ignored

Gaussian priors with full-rank covariance (GP prior)

clarify x vs y in distribution objects

Right now, some functions take x (maybe just dldx and density?) and others take y. This should be as consistent as possible.

Add a predict function

Basically just sample from z's prior and then feedforward, I think.

Then, once MCMC sampling for z exists (#21), we could condition on partial observations too.

Monte Carlo EM

So the package can fit models like the original mistnet package or fine-tune model estimates that were based on point estimates from BFGS

Add speed tests

To keep this from being machine-dependent, I could say that the total computation for feedforward or for backprop shouldn't take more than x% longer than the underlying matrix multiplication.

This would help ensure that I'm aware of any situations where I'm adding a lot of overhead to the computation.

add integration test with known parameters and a single layer

and replace toy.R

Ising/MRF distribution

Probably with pseudolikelihood approximation?

test with adjustable priors and adjustable error_distributions

adjustable priors
adjustable error_distributions

Improve object/class names

priors should probably be weight_priors or something, since priors can go elsewhere
error_distributions should probably just be distributions, since they can also refer to priors
networks should probably be mistnets, in case I want other kinds of networks (e.g. with other kinds of latent variables)
par_skeleton might be better as adjustable_parameters or just parameters, given the way it's used.
make_gamlss_distribution should probably just be make_distribution

...

Add more assertions and error messages

Should log_density be called log_probability?

Probably

tied Z values

e.g. these repeated observations should have the same Z values in this column

functions should be more flexible/smarter re: parameters

I shouldn't need to do log_density(net, par = unlist(net$par_list)).

log_density(net) and log_density(net, par = net$par_list) are unambiguous and create less room for error.

Semantics for sum of log distributions

It would be great to have semantics for adding log distributions to one another (i.e. setting two or more independent objective functions for the same parameters).

How to manage distributions with different structure?

Some families have a sigma or tau term for scale or shape. Others have special terms like bd for binomial denominators. Sometimes, these will be adjustable parameters and will need to be handed to optimx. Other times, they won't be. How to structure this so it always makes sense?

clean up optimx `control` list

Right now, passing a value to REPORT or kkt overwrites maximize and starttests.

Also, REPORT seems to get ignored (or maybe isn't even passed where it needs to go)

net = mistnet_fit(net, itnmax = 7000, 
                  control = list(maximize = TRUE, starttests = FALSE, 
                                 kkt = FALSE, REPORT = 1))

Write documentation re: initialization

Add real support for fully-observed network inputs

add options for weight initialization

This sort of detail doesn't need to be taken care of for this milestone, so let's close #10 and move the long-term wish list here.

predict() shouldn't return its internal state.

Multiple output distributions

e.g. "these columns are Gaussian, these are Poisson, these are Bernoulli"

Implement a alternative to dropout that doesn't include sampling

Sampling interferes with second-order batch methods like L-BFGS-B, but dropout is probably important.

Sida Wang and Christoper Manning's "Fast Dropout Training" might be optimal. In the mean time, the variant of an L2 penalty they suggest in between Eqn 9 and Eqn 10 is easy: instead of just having the penalty for weight ij be proportional to its square, multiply that square by c, which is basically the variance of the input variable multiplied by that weight.

Also, batch normalization might solve some of the same problems?

Penalties on weights (and maybe even biases)
Penalties on the latent variables
Penalties on the outputs

How should I structure and manage them all?

Zero or one column in x
Zero or one column in z

Kronecker-structured covariance for GP priors

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.