Giter VIP home page Giter VIP logo

srn_multilabel's Introduction

Spatial Regularization Network

This repository contains testing code and trained models for

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang, "Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification", CVPR 2017. pdf.

Directories and Files

  • caffe/: an early version of Yuanjun Xiong's caffe, with OpenMPI-based Multi-GPU support.
  • tools/: demo code for model testing and evaluation.
  • run_test.sh: script for model testing.
  • evaluation.m: matlab code for classification results evaluation.

Prepare data

  • Download train/test split files for NUS-WIDE, MS-COCO and WIDR-Attribute, and extract it to datasets/.
  • Download dataset images
    • NUS-WIDE: this dataset contains many untagged images, and some download links are invalid now. By removing invalid and untagged images, we finally get 119,986 images for training and 80,283 images for testing.
    • MS-COCO_2014: 82,783 images in "train2014" for training, and 40,504 images in "val2014" for testing.
    • WIDER-Attribute: original images are provided here, cropped images for each human bounding box can be downloaded here. 28,340 cropped images in "train" and "val" for training, 29,177 cropped images in "test" for testing.
  • Download trained models, and extract it to models/
    • models, containing:
      • trained models for NUS-WIDE, MS-COCO and WIDR-Attribute.
      • a ResNet-101 model pretrained on ImageNet.
  • (Optional) Download reference classification results, and extract it to results/.

Build

See Yuanjun Xiong's github for building this version of caffe.

Run Test

  • Edit run_test.sh
    • uncomment to specify settings of one dataset.
    • modify variable "ROOT": the root directory holding images of each dataset.
    • modify parameters of "--gpus" to specify available gpus for testing.
  • Edit tools/model_test.py
    • add "path to your caffe" to the search path of python at line 4.

Evaluation

srn_multilabel's People

Watchers

wyz avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.