Giter VIP home page Giter VIP logo

deep_learning_for_camera_trap_images's Introduction

deep_learning_for_camera_trap_images

This repository contains the code used for the following paper:

Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning

Authors: Mohammad Sadegh Norouzzadeh, Anh Nguyen, Margaret Kosmala, Ali Swanson, Meredith Palmer, Craig Packer, Jeff Clune

If you use this code in an academic article, please cite the following paper:

	@article {Norouzzadeh201719367,
		author = {Norouzzadeh, Mohammad Sadegh and Nguyen, Anh and Kosmala, Margaret and Swanson, Alexandra and Palmer, Meredith S. and Packer, Craig and Clune, Jeff},
		title = {Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning},
		year = {2018},
		doi = {10.1073/pnas.1719367115},
		publisher = {National Academy of Sciences},
		issn = {0027-8424},
		URL = {http://www.pnas.org/content/early/2018/06/04/1719367115},
		eprint = {http://www.pnas.org/content/early/2018/06/04/1719367115.full.pdf},
		journal = {Proceedings of the National Academy of Sciences}
	}

Most of the code in this repository is taken from here

This repository has four independent parts:

1- The code used for Task I: Detecting Images That Contain Animals (phase1 folder)

2- The code used for Task II,III, and IV: identifying, counting, and describing animals in images (phase 2 folder)

3- The code used for Task II only, (all the transfer learning experiments for Task II used this part of the repo) (phase2_recognition_only folder)

4- resize.py is used for resizing the input images for all the other parts

For more information on how to use this repo please refer to the base repo at this link

1. Requirements

Requirements

To use this code, you will need to install the following:

  • Python 2.7
  • Tenorflow
  • NumPy
  • SciPy
  • MatPlot Lib

2. Running

Pre-trained models could be found at the following links:

  • Phase 1 (VGG architecture):

http://www.cs.uwyo.edu/~mnorouzz/share/pretrained/phase1.zip

  • Phase 2 (ResNet-152 architecture):

http://www.cs.uwyo.edu/~mnorouzz/share/pretrained/phase2.zip

  • Phase 2 recognition only (ResNet-152 architecture):

http://www.cs.uwyo.edu/~mnorouzz/share/pretrained/phase2_recognition_only.zip

3. Licenses

This code is licensed under MIT License.

4. Questions?

For questions/suggestions, feel free to email, tweet to @arashnorouzzade or create a github issue.

deep_learning_for_camera_trap_images's People

Contributors

arashno avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.