Giter VIP home page Giter VIP logo

mixturedenseregression's Introduction

Mixture Dense Regression for Object Detection and Human Pose Estimation (CVPR 2020)

Abstract

Mixture models are well-established learning approaches that, in computer vision, have mostly been applied to inverse or ill-defined problems. However, they are general-purpose divide-and-conquer techniques, splitting the input space into relatively homogeneous subsets in a data-driven manner. Not only ill-defined but also well-defined complex problems should benefit from them. To this end, we devise a framework for spatial regression using mixture density networks. We realize the framework for object detection and human pose estimation. For both tasks, a mixture model yields higher accuracy and divides the input space into interpretable modes. For object detection, mixture components focus on object scale, with the distribution of components closely following that of ground truth the object scale. This practically alleviates the need for multi-scale testing, providing a superior speed-accuracy trade-off. For human pose estimation, a mixture model divides the data based on viewpoint and uncertainty -- namely, front and back views, with back view imposing higher uncertainty. We conduct experiments on the MS COCO dataset and do not face any mode collapse.

For questions, please contact me at [email protected].

Acknoledgement

Our repo is forked from the amazing codebase of the Object as Points paper

Installation

1- Fiest use mixturedense.yml to reproduce the exact Anaconda environment that we have used for our experiments:

conda env create -f mixturedense.yml

To activate the environment:

source activate mixturedense

2- Install COCOAPI

3- Compile deformable convolutional conda env create -f environment.yml(from DCNv2).

cd src/lib/models/networks/DCNv2
./make.sh

Train

To train models from scratch see sample comands at experiments

Tets

To test the models for detction and pose estimation on a images (stored in a directory) use the inference_ctdet.py and inference_pose.py scripts, respectively

License

Citation

@article{varamesh2019mixture,
  title={Mixture Dense Regression for Object Detection and Human Pose Estimation},
  author={Varamesh, Ali and Tuytelaars, Tinne},
  journal={arXiv preprint arXiv:1912.00821},
  year={2019}
}

mixturedenseregression's People

Contributors

alivaramesh avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.