We first implement from scratch the Frankle and Carbin's Lottery Ticket hypothesis for CNNs with CIFAR10 dataset. We then make a few interesting observations about lottery tickets.
Lottery ticket hypothesis: A randomly-initialized, dense neural network contains a sub-network that is initialized such that when trained in isolation it can match the test accuracy of the original network after training for at most the same number of iterations. These sub-networks are also referred to as winning tickets.
- Trainability on unseen data: Train the network on half of the dataset and find a winning ticket and then re-train on the other half and check the test accuracy. Repeat for 4,10,40,80% splits. We observe that winning tickets identified on the first half exhibit lottery ticket pattern on the second unseen half.
- Faster winning tickets: Using subsets of training data will provide fast and efficient ways to identify winning tickets which continue to exhibit the lottery ticket behavior that matches the behavior for a winning ticket discovered using the full dataset. Furthermore, a winning ticket found from a subset of dataset will converge faster than that found from the entire dataset.
The full report is here.