Giter VIP home page Giter VIP logo

thesis_capsulenetwork_satelliteimage's Introduction

Application of Capsule Network for land cover classification in satellite images

Aim

The work aims at exploration of one of the interesting current advances in neural networks to land cover classification in satellite images.

Overview

  1. Capsule net or CapsNet is a new deep learning technique for image classification.
  2. Sara Sabour, Nicholas Frost and Geoffrey Hinton proposed it in a paper titled “Dynamic Routing Between Capsules”.
  3. It solves several problems Convolutional Neural Networks or CNN cannot solve.
  4. CNN is good for classifying images that are close to the training set.
  5. CapsNet gives relatively better results in MNIST dataset.

Data

We extracted data from Sentinel-2 space mission of European Space Agency(ESA). We used Sentinel-1C, top-of-atmosphere reflectances in cartografic geometry. Data belong to zone code UTM zone 34N and acquired in Sentinel-2 tiling grid T34UED. Official tile id is S2A_OPER_MSI_L1C_TL_SGS__20161017T133123_A006897_T34UED_N02.04 and data strip id is S2A_OPER_MSI_L1C_DS_SGS__20161017T133123_S20161017T094431_N02.04. The bands originally having resolution coarser than 10m (i.e. 20 and 60m) had been oversampled to 10m.

  • 13 bands(channels) in total
  • 2 :Blue
  • 3 Green
  • 4 Red
  • Fewer data than regular (404 for training and 793 for testing)
  • Pixel values range between 0 and 28000
  • Needed to be cut
  • Image sizes of training data is 9x9 and 32x32
  • Both 3 band and 13 band are used in training and testing

Algortihms :

We used VGGNet and AlexNet to compare results. Due to image sizes, paddings are set to same. For Capsule Network, we only changed kernel sizes in convolutions. We droped it to 3.

Results :

Highest accuracy we got is 90.41% with CapsNet trained with 9x9 image sizes with all 13 bands while closest accuracy received is 89.66% with AlexNet trained with 9x9 image sizes with all 13 bands. When we look at the“EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification” paper, we see that highest accuracy they received is with RGB bands but in out tests we’ve observed that, 13 bands gives better accuracy. They received 98.57% with ResNet-50. Our dataset is considarably small compare to theies. Ours has 1197 images with 4 labels while theirs has 28000 images with 10 labels.

Architectures 9x9x3 32x32x3 9x9x13 32x32x13
AlexNet 78.18 72.76 89.66 85.37
VGGNet A 78.56 79.19 87.89 86.00
VGGNet D 78.81 73.14 88.27 85.88
VGGNet E 76.80 79.32 86.51 84.74
CapsNet 78.31 76.29 90.41 86.25

CapsNet Confusion matrix :

alt text

Few Sources to read

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION ImageNet Classification with Deep Convolutional Neural Networks “Dynamic Routing Between Capsules”. Capsnet-keras Capsnet-tensorflow “EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification”

thesis_capsulenetwork_satelliteimage's People

Contributors

huseyinutkuaslan avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.