Light

adam9500370 / kaggle-hpa Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 1.0 17 KB

Kaggle Human Protein Atlas Image Classification Challenge (top 7%)

Home Page: https://www.kaggle.com/c/human-protein-atlas-image-classification

Python 100.00%

pytorch kaggle-competition multi-label-image-classification

kaggle-hpa's Introduction

Kaggle-HPA

PyTorch implementation for Kaggle Human Protein Atlas Image Classification Challenge

Training

Model
- ResNet18 backbone (pretrained on ImageNet)
- Small decoder for image reconstruction
Input size: (512, 512, 4) (RGBY)
Augmentation
- Random horizontal flip
- Random vertical flip
- Random affine transformation (rotation, translation, scale)
Batch size: 40
Optimizer: SGD
Learning rate: cosine annealing from 5e-2 to 4e-4 (1 cycle)
Weight decay: 1e-4
Loss functions
- Binary Cross Entropy (BCE)
  - Log-damped class frequency pos_weights, without oversampling rare data
- Mean Square Error (MSE) for image reconstruction (unsupervised)
  - To learn a good feature representation as an auxiliary signal
- Prediction for # of class co-occurrence (COOC)
  - Get higher confidence of predictions with higher threshold
5 folds (random split), 20 epochs for each fold

Final prediction

The mean of the predictions of 5-fold model with 8 TTA (transpose + flip)
Threshold: 0.4
Score (with data leak): public LB 0.568 / private LB 0.516 (138th / 2236)

Requirements

pytorch 0.4.1
torchvision 0.2.1
numpy
opencv
scikit-learn
pandas
tqdm

pip install -r requirements.txt

Usage

Data

Download data from Kaggle HPA competition
Extract train/test.zip
Download HPAv18 external data and HPAv18RGBY_WithoutUncertain_wodpl.csv
Put the external data into the corresponding external directory
Modify the path appropriately in config.json
Run PYTHONPATH=. python loaders/hpa_loader.py first to generate training/validation data lists

To train/test the model, create final submission

python [train, test, merge].py -h for more details

kaggle-hpa's People

Contributors

Stargazers

Watchers

Forkers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.