Giter VIP home page Giter VIP logo

hassod's Introduction

HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

This is the official PyTorch implementation of our NeurIPS 2023 paper:

HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

[Project Page] [Paper-arXiv] [Paper-OpenReview] [Video-YouTube] [Video-Bilibili]

Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang

πŸ”Ž Overview

HASSOD-gif

HASSOD is a fully self-supervised approach for object detection and instance segmentation, demonstrating a significant improvement over the previous state-of-the-art methods by discovering a more comprehensive range of objects. Moreover, HASSOD understands the part-to-whole object composition like humans do, while previous methods cannot. Notably, we improve class-agnostic Mask AR from 20.2 to 22.5 on LVIS, and from 17.0 to 26.0 on SA-1B.

πŸ› οΈ Instructions

To use our code and reproduce the results, please follow these detailed documents step by step:

  • Preparation: Prepare the environment, data, and pre-trained models
  • Reproduction: Produce pseudo-labels and train the object detector (download links included for our pseudo-labels and model)
  • Demo: Once the preparation is finished, you can try out the demo code and test our model on any image.

πŸ™ Acknowledgements

Our code is developed based on the following repositories:

We greatly appreciate their open-source work!

βš–οΈ License

This project is released under the Apache 2.0 license. Other codes from open source repository follows the original distributive licenses.

🌟 Citation

If you find our research interesting or use our code, data, or model in your research, please consider citing our work.

@inproceedings{cao2023hassod,
    title={{HASSOD}: Hierarchical Adaptive Self-Supervised Object Detection},
    author={Cao, Shengcao and Joshi, Dhiraj and Gui, Liangyan and Wang, Yu-Xiong},
    booktitle={NeurIPS},
    year={2023}
}

hassod's People

Contributors

shengcao-cao avatar

Stargazers

 avatar JichengYUAN avatar Anda Chen avatar  avatar  avatar  avatar Tianyi Xu avatar aiwym avatar spyex avatar Andrey Babaev avatar Siarhei Anfilets avatar  avatar An-zhi WANG avatar  avatar Ecneics avatar Varun Ganjigunte Prakash avatar Evgenii Akhmetgareev avatar  avatar β€Ž  avatar Titrom25 avatar Paolo Manchisi avatar  avatar Mehti Musa(y)ev avatar Vladislav Sorokin avatar Raul G avatar Georgy Gunkin avatar Francesco avatar Γ€lex avatar pascal-maker avatar  avatar Jie X avatar Dhruv Kool Rajamani avatar JYOTHIR ROSE K J avatar Daniel Puglisi avatar Makdoud avatar JiangYingLian avatar tm avatar Jefferson Silva avatar Umit Kacar, PhD avatar Ashwin Venkat avatar VARUN SAKUNIA avatar james.yookh avatar Dickson Neoh avatar Pooya Mohammadi Kazaj avatar Gordon avatar Mauro Sciancalepore avatar Xiyu Zhang avatar

Watchers

Kostas Georgiou avatar  avatar

hassod's Issues

Inference script

Hey nice job! Results seem very interesting.

Is it possible to have an inference script or a gradio demo with all the necessary to run the model?

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.