Giter VIP home page Giter VIP logo

neurops's Introduction

NeurOps

Neuron representations (Adapted from a Maile et al. (2022)) This repository contains an implementation of neural operations: growing and pruning for architectural optimization of neural networks. It extends the basic operations and functionality of https://github.com/k8lion/Neurogenesis from Julia to PyTorch, in addition to implementing techniques from other architectural optimization works. It allows for growing and pruning of basic neural network layers, Linear and Conv2d, from both a masking approach as well as by changing the weight tensor shapes. It also handles normalization layers and optimizer states to enable iterative architectural optimization in addition to standard training. Various metrics and initialization strategies are implemented for informing the neural operations. These techniques are extended to larger models, including deep sequential models and transformers.

The goal of this repository is to streamline and unify architectural optimization techniques to permit further research. We hope to continue to extend the utility of this repository in future updates, including more languages and frameworks (namely Julia/Flux and Jax/Flax), more growing and pruning algorithms, more architecture backbones, more structures to grow and prune, and more subfields of architectural optimization such as neural architecture search. We welcome use of this repository as well as contributions. See pytorch/README.md for further details, as well as tutorial.ipynb for example usage.

neurops's People

Contributors

k8lion avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

rhit-luttredn

neurops's Issues

To_add mostly showing 0 when using VGG-11 architecture

So majorly VGG-11 is showing 0 in front of "neurons to add". Could it be please confirmed whether grow algorithm running is fine or not in this because such an issue was not observed for LeNet.

Just to add, these are few results of last 2 iterations:

Validation: Average loss: 0.0144, Accuracy: 3467/5000 (69.34%)
Layer 0 score: 21/25, neurons to add: 0
Layer 1 score: 34/36, neurons to add: 0
Layer 2 score: 63/67, neurons to add: 0
Layer 3 score: 61/65, neurons to add: 0
Layer 4 score: 124/133, neurons to add: 0
Layer 5 score: 112/128, neurons to add: 0
Layer 6 score: 105/128, neurons to add: 0
Layer 7 score: 78/128, neurons to add: 0
Layer 8 score: 304/1024, neurons to add: 0
Layer 9 score: 475/1024, neurons to add: 0
The grown model now has 1787097 effective parameters.
Validation after growing: Average loss: 0.0143, Accuracy: 3467/5000 (69.34%)
Train Epoch: 0 [0/45000 (0%)] Loss: 0.047333
Train Epoch: 0 [12800/45000 (28%)] Loss: 0.073766
Train Epoch: 0 [25600/45000 (57%)] Loss: 0.026595
Train Epoch: 0 [38400/45000 (85%)] Loss: 0.014527

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.