Tutorial: throws an error when calling `m(x)` (output of Dense)

Flux Model Zoo

This repository contains various demonstrations of the Flux machine learning library. Any of these may freely be used as a starting point for your own models.

The models are broadly categorised into the folders vision (e.g. large convolutional neural networks (CNNs)), text (e.g. various recurrent neural networks (RNNs) and natural language processing (NLP) models), games (Reinforcement Learning / RL). See the READMEs of respective models for more information.

Usage

Each model comes with its own Julia project. To use this, open Julia in the project folder, and enter

using Pkg; Pkg.activate("."); Pkg.instantiate()

This will install all needed packages, at the exact versions when the model was last updated. Then you can run the model code with include("<model-to-run>.jl"), or by running the model script line-by-line.

Models may also be run with NVIDIA GPU support, if you have a CUDA installed. Most models will have this capability by default, pointed at by calls to gpu in the model code.

Gitpod Online IDE

Each model can be used in Gitpod, just open the repository by gitpod

Based on Gitpod's policies, free access is limited.
All of your work will place in the Gitpod's cloud.
It isn't an officially maintained feature.

Contributing

We welcome contributions of new models and documentation.

Share a new model

If you want to share a new model, we suggest you follow these guidelines:

Models should be in a folder with a project and manifest file to pin all relevant packages.
Models should include a README(.md) to explain what the model is about, how to run it, and what results it achieves (if applicable).
Models should ideally be CPU/GPU agnostic and not depend directly on GPU functionality.
Please keep the code short, clean, and self-explanatory, with as little boilerplate as possible.

Create or improve documentation

You can contribute in one of the following ways

Add or improve documentation to existing models: Write the following information:
- Give a brief introduction to the model’s architecture and the goal it archives.
- Describe the Flux API that the model demonstrates (high-level API, AD, custom operations, custom layers, etc.).
- Add literature background for the model. More specifically, add articles, blog posts, videos, and any other resource that is helpful to better understand the model.
- Mention the technique that is being demonstrated. Briefly describe the learning technique being demonstrated (Computer vision, regression, NLP, time series, etc.).
Write in-depth tutorials for a model: You can further extend the documentation of a model and create a tutorial to explain in more detail the architecture, the training routine, use your own data, and so forth. After you write a tutorial, create a PR with it for the Tutorials section on the FluxML website.

Update a model

Each example lists the version of Flux for which it was most recently updated. Bringing them up to the latest is a great way to learn! Flux has a NEWS page listing important changes. (For other packages, see their releses page: MLUtils, MLDatasets, etc.)

To run the old examples, Flux v0.11 can be installed and run on Julia 1.6, the LTS version. Flux v0.12 works on Julia 1.8. Flux v0.14 is the latest right now, this and v0.13 are marked with ☀️; models upgraded to use
explicit gradients (v0.13.9+ or v0.14) have a +.

Examples in the Model Zoo

Vision

MNIST
- Simple multi-layer perceptron ☀️ v0.13 +
- Simple ConvNet (LeNet) ☀️ v0.14 +
- Variational Auto-Encoder ☀️ v0.13 +
- Deep Convolutional Generative Adversarial Networks ☀️ v0.13 +
- Conditional Deep Convolutional Generative Adversarial Networks ☀️ v0.13
- Score-Based Generative Modeling (Diffusion Model) ☀️ v0.13
- Spatial Transformer ☀️ v0.13 +
CIFAR10
- VGG 16/19 ☀️ v0.13 +
- ConvMixer "Patches are all you need?" ☀️ v0.13

Text

CharRNN ☀️ v0.13 +
Character-level language detection ☀️ v0.13 +
Seq2Seq phoneme detection on CMUDict ⛅️ v0.11
Recursive net on IMDB sentiment treebank ⛅️ v0.11

Other & contributed models

Logistic Regression Iris ☀️ v0.13 +
Autoregressive Model ☀️ v0.13 +
BitString Parity Challenge ⛅️ v0.11
MLP on housing data (low level API) ⛅️ v0.11
FizzBuzz ☀️ v0.13 +
Meta-Learning ❄️ v0.7
Speech recognition ❄️ v0.6

Tutorials

A 60 Minute Blitz ⛅️ v0.11
DataLoader example with image data ⛅️ v0.11
Transfer Learning ☀️ v0.13 +

Examples Elsewhere

MLJFlux is a bridge to MLJ.jl, a package for mostly non-neural-network machine learning. They have some examples of interest, which like the model zoo's examples, each include a local Project & Manifest file:

Iris ⛅️ v0.11
Boston ⛅️ v0.11
MNIST ⛅️ v0.11

fluxml / model-zoo Goto Github PK

model-zoo's Introduction

Flux Model Zoo

Usage

Gitpod Online IDE

Contributing

Share a new model

Create or improve documentation

Update a model

Examples in the Model Zoo

Examples Elsewhere

model-zoo's People

Contributors

Stargazers

Watchers

Forkers

model-zoo's Issues

could have reused more of Flux (see the mnist example), the library's

abstractions are very lightweight and don't force you into any particular

strategy."

Recommend Projects

Recommend Topics

Recommend Org