CS 294-082

Public Repository for CS 294-082 - Experimental Design for Machine Learning on Multimedia Data (Graduate) - Coursework (Spring '19)

Hello! I'm glad you're here. Here is an overview of the project from the original report:

"We use the VGG16Net model developed by the Visual Geometry Group at the University of Oxford and repeatedly train it on the MNIST dataset while attempting to reduce its parameters without significantly reducing model accuracy."

Prerequisites

Clone the repository and create a new virtual environment on Anaconda or virtualenv (I used Anaconda) to install all the needed dependencies.

pip install -r requirements.txt

If deepcompressor isn't installed yet try pip install -i https://test.pypi.org/simple/ deepcompressor.

Deep Compression Pipeline

Step 1

Run import deepcompressor and create a new CapacityEstimator ce = CapacityEstimator(). Note: If these two lines don't work even after installing from test.pypi.org please contact me ASAP. Try pip show deepcompressor to see the root directory. There should be 1 init.py file and 4 other .py files.

Step 2

Create the model.

vgg16 = model('vgg16', True)
vgg16.set_capacity_estimator(ce)
vgg16.parallelize()
vgg16.prepare_dataset()

Step 3

Set the training parameters.

num_epochs = 6
max_epochs_stop = 3
num_classes = 10
batch_size = 10
learning_rate = 0.001
print_every = 1

Step 4

Create a DeepCompressor with a Model and its Trainer.

training_params = [num_epochs, max_epochs_stop, num_classes, batch_size, learning_rate, print_every]
deep_compressor = deep_compressor(vgg16, trainer(vgg16, training_params))

Step 5

Repeatedly train, test and squeeze the network.

deep_compressor.train(dropout=100)
deep_compressor.test() # ce.estimate(25088, 4096, 4096, 256, 10)
deep_compressor.squeeze(100)
deep_compressor.train(dropout=100)
deep_compressor.test() # ce.estimate(25088, 4096-100, 4096, 256, 10)
deep_compressor.squeeze(100)
deep_compressor.train(dropout=100)
deep_compressor.test() # ce.estimate(25088, 4096-200, 4096, 256, 10)
deep_compressor.squeeze(100)
deep_compressor.train(dropout=100)
deep_compressor.test() # ce.estimate(25088, 4096-300, 4096, 256, 10)
deep_compressor.squeeze(100)
deep_compressor.train(dropout=100)
deep_compressor.test() # ce.estimate(25088, 4096-400, 4096, 256, 10)
deep_compressor.squeeze(100)
deep_compressor.train(dropout=100)
deep_compressor.test() # ce.estimate(25088, 4096-500, 4096, 256, 10)

Step 6

Plot Generalization vs Capacity and save the figure if you want to.

df = pd.DataFrame(data=deep_compressor.model.history)
sns.set(style="darkgrid")
fig = sns.lineplot(x='cap', y='acc', data=df);
fig.set(ylabel='Generalization', xlabel='Memory Equivalent Capacity');
fig.get_figure().savefig('GC_curve.jpg')

Look at FinalScript.pdf to see what the entire process looks like with 2 epochs per training cycle.

Versioning

We use PyPI for versioning.

Authors

Varun Murthy ([email protected])

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Acknowledgments

Hat tip to Dr. Gerald Friedland ([email protected]) of the Department of EECS, University of California at Berkeley.

murthy-varuns / cs294-082 Goto Github PK

cs294-082's Introduction

CS 294-082

Prerequisites

Deep Compression Pipeline

Step 1

Step 2

Step 3

Step 4

Step 5

Step 6

Versioning

Authors

License

Acknowledgments

cs294-082's People

Contributors

Stargazers

Watchers

Recommend Projects

Recommend Topics

Recommend Org