The lwtnn from malanfer

What is this?

This is a few lightweight classes to apply a trained neural net. The main design principles are:

Minimal dependencies: The core class should only depend on C++11 and Eigen. The JSON parser to read in NNs also requires boost PropertyTree.
Flat structure: Each layer in the NN inherits from the ILayer or IRecurrentLayer abstract base class, the NN itself is just a stack of derived classes.
Easy to extend: Should cover 95% of deep network architectures we would realistically consider.
Hard to break: The NN constructor checks the serialized NN for consistency. To prevent bugs with incorrectly ordered variables, all inputs and outputs are stored in std::maps.

We also include converters from several popular formats to the lwtnn JSON format. Currently the following formats are supported:

AGILEPack
Keras (currently only dense layers, more will be added as needed)
Julian's format, based on Numpy and JSON

The converter scripts can be found in converters/. Run them with -h for more information.

How do I use it?

Quick Start

After running make, just run ./test-nn.sh. If nothing goes wrong you should see something like:

out1 4
out2 3
out3 2
out4 1

There may be some problems if you don't have python 3 or don't have pyyaml installed, but these should be limited to the YAML -> JSON converter. At the very least calling ./bin/lwtag-test-hlwrapper with no arguments (which doesn't depend on the converter) should work.

Cool, what the hell did that do?

Take a look inside test-nn.sh, it does two things:

Runs ./converters/agile2json.py. This takes an AGILEPack output and write a JSON file to standard out.
Pipes the output to ./bin/lwtag-test-full. This will construct a NN from the resulting JSON and run a single test pattern.

Of course this isn't very useful, to do more you have to understand...

The High Level Interface

Open include/LightweightNeuralNetwork.hh and find the class declaration for LightweightNeuralNetwork. The constructor takes three arguments:

A vector of Inputs: these structures give the variable name, offset, and scale. Note that these are applied as v = (input + offset) * scale, so if you're normalizing inputs with some mean and standard_deviation, these are given by offset = - mean and scale = 1 / standard_deviation.
A vector of LayerConfig structures. See the below section for an explanation of this class.
A vector of output names.

The constructor should check to make sure everything makes sense internally. If anything goes wrong it will throw a NNConfigurationException.

After the class is constructed, it has one method, compute, which takes a map<string, double> as an input and returns a map of named outputs (of the same type). It's fine to give compute a map with more arguments than the NN requires, but if some argument is missing it will throw an NNEvaluationException. All the exceptions inherit from LightweightNNException.

The Low Level Interface

The Stack class is initialized with two parameters: the number of input parameters, and a std::vector<LayerConfig> to specify the layers. Each LayerConfig structure contains:

A vector of weights. This can be zero-length, in which case no matrix is inserted (but the bias and activation layers are).
A bias vector. Again, it can be zero length for no bias in this layer.
An activation function. Defaults to LINEAR (i.e. no activation function).

Note that the dimensions of the matrices aren't specified after the n_inputs in the Stack constructor, because this should be constrained by the dimensions of the weight vectors. If something doesn't make sense the constructor should throw an NNConfigurationException.

The Stack::compute(VectorXd) method will return a VectorXd of outputs.

Testing an Arbitrary NN

The lwtnn-test-arbitrary-net executable takes in a JSON file along with two text files, one to specify the variable names and another to give the input values. Run with no arguments to get help.

Recurrent Networks

Currently we support LSTMs in sequential models. The low level interface is implemented as RecurrentStack. See lwtnn-test-rnn for a working example.

Have problems?

If you find a bug in this code, or have any ideas, criticisms, etc, please email me at [email protected].

To Do List

The copy and assignment constructors for LightweightNeuralNetwork and Stack are currently deleted, because the defaults would cause all kinds of problems and I'm too lazy to write custom versions. It's not clear that we'll need them anyway, but if someone ends up wanting something like a std::map<XXX, LightweightNeuralNetwork> I could add them.

malanfer / lwtnn Goto Github PK

lwtnn's Introduction

What is this?

How do I use it?

Quick Start

Cool, what the hell did that do?

The High Level Interface

The Low Level Interface

Testing an Arbitrary NN

Recurrent Networks

Have problems?

To Do List

lwtnn's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent