RetinaNet_pytorch

A Python3.5/Pytroch implementation of RetinaNet: Focal Loss for Dense Object Detection. And the official implementations are available here. Besides, special thanks for those two repositories：

Prerequisites

python 3.5.x
pytorch 0.4.1
tensorboardX
pillow
scipy
numpy
matplotlib
easydict

Results

mAP

Backbone: ResNet50

VOC2007	LUNA16
76.6	65.4

Acc and Loss

The training loss and accuracy :

Detection Results

VOC2007 detection results:

LUNA16: lung nodules detection. The mAP for different anchor scale and aspect ratios:

Anchor_Scale	Anchor_Size	Aspect_ratio	mAP
4	(32, 64, 128, 256, 512)	(1.0, 2.0, 0.5)	46.2
1	(8, 16, 32, 64, 128)	(1.0, 2.0, 0.5)	54.1
1	(8, 16, 32, 64, 128)	(1.0)	56.5
1	(8, 16, 32, 64, 128)	(1.0, 1.2, 0.8)	65.4

LUNA16 detection results:

Repo Organization

RetinaNet: neural networks and components that form parts of RetinaNet.
config: define configuration information of Faster RCNN.
data: scripts for creating, downloading, organizing datasets.
loss: implementation of focal loss.
pretrained_model: get and store pretrained ResNet model.
targets: generate anchors and calculate targets.
utils: tools package, containing some necessary functions.

Installation

Clone this repository (RetinaNet_pytorch):

 git clone --recursive https://github.com/Jacqueline121/RetinaNet_pytorch.git

Install dependencies:

 cd RetinaNet_pytorch 
 pip install -r requirements.txt

Train

Prepare the Data

For PASCAL VOC, you can follow the instructions in this repository to download the data. And then, you can store date according the following structure:

|+-- data    
|   |+-- dataset    
|       |+-- VOC2007    
|           |+-- Annotations    
|               |+-- xxxx.xml    
|           |+-- Cache    
|           |+-- ImageSets    
|           |+-- JPEGImages    
|           |+-- Results    
|       |+-- VOC2012    
|           |+-- Annotations    
|               |+-- xxxx.xml    
|           |+-- Cache    
|           |+-- ImageSets    
|           |+-- JPEGImages    
|           |+-- Results

Annotations: store annotaion information(.xml file) for each images.
Cache: store annotaion cache.
ImageSets: store training dataset and testing dataset(.txt file) with the format:
JPEGImages: store images.
Results: store detection results.

You can also use your own dataset as long as you follow the file structure desribed above to store the data.

Get pretrained model

Download the pretrained ResNet model: ResNet50, ResNet101.
Put the pretrained model in $PROJECT/pretrained_model
cd $PROJECT/pretrained_model
```
 python get_pretrained_model.py
```
It will produce a 'model.pth' file.

Train

python train.py --dataset='Dataset_Name'

For example: python train.py --dataset='VOC2007'

Test

python test.py --dataset='VOC2007'

If you want to visualize the detection result, you can use:

python test.py --vis

jacqueline121 / retinanet_pytorch Goto Github PK