Giter VIP home page Giter VIP logo

pytorch-splitnet's Introduction

SplitNet PyTorch Implementation

PyTorch implementation of SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Parallelization, ICML 2017

model-architecture

Results

Learned Convolution Filters of g2-b1-w3 (left) and g3-b1-w3 (right).

Learned Group Indicators of g2-b2-p (left) and g3-b2-p (right).

Installation

$ git clone https://github.com:kuc2477/dl-papers && cd dl-papers
$ pip install -r requirements.txt

CLI

Implementation CLI is provided by main.py.

Usage

$ ./main.py --help
$ usage: SplitNet torch implementation [-h] [--dataset {cifar10,mnist,cifar100}]
                                 [--total-block-number TOTAL_BLOCK_NUMBER]
                                 [--widen-factor WIDEN_FACTOR]
                                 [--baseline-strides BASELINE_STRIDES [BASELINE_STRIDES ...]]
                                 [--baseline-channels BASELINE_CHANNELS [BASELINE_CHANNELS ...]]
                                 [--split-sizes SPLIT_SIZES [SPLIT_SIZES ...]]
                                 [--gamma1 GAMMA1] [--gamma2 GAMMA2]
                                 [--gamma3 GAMMA3]
                                 [--weight-decay WEIGHT_DECAY]
                                 [--dropout-prob DROPOUT_PROB] [--lr LR]
                                 [--lr-decay LR_DECAY]
                                 [--lr-decay-epochs LR_DECAY_EPOCHS [LR_DECAY_EPOCHS ...]]
                                 [--epochs EPOCHS] [--batch-size BATCH_SIZE]
                                 [--test-size TEST_SIZE]
                                 [--eval-log-interval EVAL_LOG_INTERVAL]
                                 [--loss-log-interval LOSS_LOG_INTERVAL]
                                 [--weight-log-interval WEIGHT_LOG_INTERVAL]
                                 [--checkpoint-interval CHECKPOINT_INTERVAL]
                                 [--model-dir MODEL_DIR]
                                 [--resume-best | --resume-latest] [--best]
                                 [--no-gpus] (--test | --train)
 

Train

$ python -m visom.server &
$ ./main.py --train [--resume-latest|--resume-best]

Test

$ ./main.py --test

Note

  • Block diagonal weight utilizing test phase is not implemented yet.

Reference

Author

Ha Junsoo / @kuc2477 / MIT License

pytorch-splitnet's People

Contributors

kuc2477 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.