Giter VIP home page Giter VIP logo

archsegmentation's Introduction

ArchSegmentation

This is my architecture for semantic segmentation.The proposed model EAFPN adds more attention on the boundary of objects during training.My experiments show that this model is able to improve the segmentation result and suitable for man-made objects with straight boundary.The advantage of EAFPN is it adds no extra computation during inferencing.You may reference to the result of my model to see the difference.

Note: This repo is copy of EAFPN(without model implementation,the original private repo will be released after my paper released)

results

result_with_edge is the output of my proposed model EAFPN, result_without_edge is the output of original FPN."edgexx" means edges detected by net work in different stage and egde4 is the comfusion of each stage. Both model use ResNet-101 as backbone. EAFPN adds no extra computation in inferencing phase

environment

  • pytorch 1.0.0
  • tensorflow
  • tensorboard
  • tensorboardX
  • opencv

performance

RoadSign (Non open dataset)

tested on 2988 1920*1020 images.Our FPN model was first trained on cityscape, then finetuned on our own data collected on Beijing-Shanghai Expressway contains 24000+ 1920*1080 annotated images. EASwiftnet may get a better result after more training.

Methods Backbone TrainSet EvalSet Mean Accuracy Mean IoU IOU back ground IOU roadsign
FPN resnet-50 train val 91.37% 87.93% 99.85% 75.75%
FPN resnet-18 train val 89.57% 84.16% 99.795% 68.525%
EAFPN resnet-50 train val 95.35% 93.10% 99.92% 86.285%
EAFPN resnet-18 tran val 95.11% 92.30%
EASwiftnet* resnet18 train val 92.25% *90.192% 99.92% 80.46%

Inria Aerial Image dataset:

The training set contains 180 color image tiles of size 5000ร—5000, covering a surface of 1500 m ร— 1500 m each (at a 30 cm resolution). The test set contains the same amount of tiles as the training set (but the reference data is not disclosed).

I crop the training data to 500*500 for training, and 1000 500*500 images for validate.All experiments's results are listed as below.

Methods Backbone TrainSet EvalSet Mean Accuracy Mean IoU
FPN ResNet-101 train val 89.60% 83.55%
EAFPN ResNet-101 train val 92.84% 87.60%
EAFPN ResNext 101 train val 92.31% 87.14%
EFPN(edge detection with ASPP) ResNet101 train val 93.40% 88.32%
stcked FPN* ResNet-101 train val 96.77% 89.80%

FPN is the original model of FPN, EAFPN detects boundary to strengthen the segmentation result.

EAFPN(edge detection with ASPP) uses ASPP module to capture multi scale feature map, this model is training. Stacked FPN is a validating experiment to validate if edge detection is helpful to improve the segmentation result.

archsegmentation's People

Contributors

citygity avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.