GoMLX -- A full-featured ML Framework ("Jax for Go")

GoMLX is a fast and easy-to-use set of ML libraries and tools. It can be seen as a TensorFlow/Jax/PyTorch for Go.

It is built on top of OpenXLA, a just-in-time compiler of numeric computations to CPU, GPU and TPU. It's the same engine that powers Google's Jax and TensorFlow, and it has the same speed in many cases.

Quick Start: see our tutorial, or a guided example for Kaggle Dogs Vs Cats using GoMLX.

It was developed primarily as a platform to easily experiment with ML ideas, and to allow one to use Go for ML. Hopefully it can grow beyond that -- see Long-Term Goals below.

It strives to be simple to read and reason about, leading the user to correct and transparent mental model of what is going on (no surprises) -- aligned with Go philosophy. At the cost of more typing (more verbose) at times.

Documentation is kept up-to-date (if it is not well documented, it is as if the code is not there) and error messages are useful and try to make it easy to solve issues.

Still under development, and should be considered experimental for now.

Overview

GoMLX has many important components of an ML framework in place, from the bottom to the top of the stack. But it is still only a slice of what a major ML library/framework should provide (like TensorFlow, Jax or PyTorch).

It includes:

XLA integration for model training and evaluation -- including GPU (and presumably TPU, but never tested so likely not working).
Autograd: automatic differentiation -- only gradients for now, no jacobian.
Context: automatic variable management for ML models.
ML layers library with some of the most popular machine learning "layers": dense (simple FFN layer),
activation functions, Layer Normalization, Batch Normalization, Convolutions, Pooling, Dropout, Multi-Head-Attention (for transformer layers), PiecewiseLinear (for calibration and normalization).
Training library, with some pretty-printing. Including plots for Jupyter notebook, using Bash Kernel.
- Also, various debugging tools: collecting values for particular nodes for plotting, simply logging the value of nodes during training, stack-trace of the code where nodes are created (TODO: automatic printing stack-trace when a first NaN appears during training).
SGD and Adam optimizers.
Various losses and metrics.
Examples: Synthetic linear model; Adult/Census model; Cifar-10 demo; Dogs & Cats classifier demo; IMDB Movie Review demo.
Pre-Trained models to use: InceptionV3 (image model) -- more to come.
Docker with integrated JupyterLab and GoNB (a Go kernel)

Installation

For now Linux only. It does work well also in WSL (Windows Subsystem for Linux) in Windows or using Docker.

Likely it would work in Macs with some work --> contributions are very welcome, I don't have a Mac. It will likely work in Docker for Mac, but not natively supporting M1/M2.

Pre-built Docker

The easiest to start playing with it, it's just pulling the docker image that includes GoMLX + JupyterLab + GoNB (a Go kernel for Jupyter) and Nvidia's CUDA runtime (for optional support of GPU) pre-installed -- it is ~5Gb to download.

From a directory you want to make visible in Jupyter, do:

For GPU support add the flag --gpus all to the docker run command bellow.

docker pull janpfeifer/gomlx_jupyterlab:latest
docker run -it --rm -p 8888:8888 -v "${PWD}":/home/jovyan/work janpfeifer/gomlx_jupyterlab:latest

It will display a URL starting with 127.0.0.1:8888 in the terminal (it will include a secret token needed) that you can open in your browser.

You can open and interact with the tutorial from there, it is included in the docker under the directory Projects/gomlx/examples/tutorial.

More details on the docker here.

Linux

The library depends on the following libraries to compile and run:

libunwind8: usually available in most Linux systems.
liblzma5: compression library, also usually available.
TC Malloc, usually packaged as libgoogle-perftools-dev: fast malloc version, and memory debugging tools.
hdf5-tools: access to .h5 file format, used by hold pre-trained weights for some some models.

Typically, this can be installed with:

sudo apt-get install libunwind8 libgoogle-perftools-dev liblzma5 hdf5-tools

Second you need the pre-compiled GoMLX+XLA C library, included in each release. The library is pretty large, ~500Mb (with GPU and TPU, it statically links most of what it needs) -- for Just-In-Time (JIT) compilation it includes the whole LLVM compiler.

Latest version in github.com/gomlx/gomlx/releases/latest/download/gomlx_xla-linux-amd64.tar.gz.

The contents are a libgomlx_xla.so file and a few .h files needed for the compilation of GoMLX. They are separated on two top level directories /lib and /include, and for now the recommended way is to just untar them in /usr/local, which is usually in the default path for inclusion and dynamic library loading. So you can do:

cd /usr/local
tar xzvf .../path/to/gomlx_xla-linux-amd64.tar.gz

This should be enough for most installations. If CGO is not finding the library, you may need to configure some environment variables (LD_LIBRARY_PATH, CGO_CPPFLAGS, CGO_LDFLAGS) to include the corresponding directories under /usr/local (most linux distributions won't need this).

After that, just import it as with any Go library.

More on building the C library, see docs/building.md.

GPU Support (NVidia)

Typically one needs the same NVidia libraries as TensorFlow/Jax. On a fresh 23.04 Ubuntu install, all it took was having the commercial NVidia drivers installed (not the Nouveau drivers), and additionally installing:

apt install nvidia-cudnn

After that, another needed step — some misconfiguration among NVidia's CuDNN library, Ubuntu package maintainer and XLA code, I'm not sure — is to create the following directory and symbolic link:

sudo mkdir /usr/lib/nvidia-cuda-toolkit/nvvm
sudo ln -s /usr/lib/nvidia-cuda-toolkit/libdevice /usr/lib/nvidia-cuda-toolkit/nvvm/

Without this you'll see errors complaining about not finding libdevice.10.bc.

Tutorial

See the tutorial here. It covers a bit of everything.

After that look at the demos in the examples/ directory.

The library itself is well documented (pls open issues if something is missing), and the code is not too hard to read (except the bindings to C/XLA, which were done very adhoc). Godoc available in pkg.go.dev.

Finally, feel free to ask questions: time allowing (when not in work) I'm always happy to help -- I created groups.google.com/g/gomlx-discuss.

Long-term Goals

Building and training models in Go -- as opposed to Python (or some other language) -- with focus on:
- Being simple to read and reason about, leading the user to a correct and transparent mental model of what is going on. Even if that means being more verbose when writing.
- Clean, separable APIs: individual APIs should be self-contained and decoupled where possible.
- Composability: Any component should be replaceable, so they can be customized and experimented. That means sometimes more coding (there is not one magic train object that does everything), but it makes it clear what is happening, and it's easy to replace parts with a third party versions or something custom.
- Up-to-date documentation: if the documentation is not there or if it's badly written, it's as if the code was not there either.
- Clear and actionable error reporting
To be a productive research and educational platform to experiment with new ML ideas and learn.
- Support mirrored training on multiple devices and various forms of distributed training (model and/or data parallelism) in particular to support for large language models and similarly large model training.
To be a robust and reliable platform for production. Some sub-goals:
- Support modern accelerator hardware like TPUs and GPUs.
- Save models to industry tools like TensorFlow Serving.
- Import pre-trained models from Hugging Face Hub and TensorFlow Hub where possible.
- Compile models to binary as in C-libraries and/or WebAssembly, to be linked and consumed (inference) anywhere (any language).

Collaborating

The project is looking forward contributions for anyone interested. Many parts are not yet set in stone, so there is plenty of space for improvements and re-designs for those interested and with good experience in Go, Machine Learning and APIs in general. See the TODO file for inspiration.

No governance guidelines have been established yet, this also needs work.

tdegris / gomlx Goto Github PK

gomlx's Introduction

GoMLX -- A full-featured ML Framework ("Jax for Go")

Quick Start: see our tutorial, or a guided example for Kaggle Dogs Vs Cats using GoMLX.

Overview

Installation

Pre-built Docker

Linux

GPU Support (NVidia)

Tutorial

Long-term Goals

Collaborating

Advanced Topics

gomlx's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent