SAM-RBox

This is an implementation of SAM (Segment Anything Model) for generating rotated bounding boxes with MMRotate, which is a comparison method of H2RBox-v2: Boosting HBox-supervised Oriented Object Detection via Symmetric Learning.

NOTE: This project has been involved into OpenMMLab's new repo PlayGround. For more details, please refer to this.

Recently, SAM has demonstrated strong zero-shot capabilities by training on the largest segmentation dataset to date. Thus, we use a trained horizontal FCOS detector to provide HBoxes into SAM as prompts, so that corresponding Masks can be generated by zero-shot, and finally the rotated RBoxes are obtained by performing the minimum circumscribed rectangle operation on the predicted Masks. Thanks to the powerful zero-shot capability, SAM-RBox based on ViT-B has achieved 63.94%. However, it is also limited to the time-consuming post-processing, only 1.7 FPS during inference.

Prepare Env

The code is based on MMRotate 1.x and official API of SAM.

Here is the installation commands of recommended environment.

pip install torch==1.10.1+cu111 torchvision==0.11.2+cu111 -f https://download.pytorch.org/whl/cu111/torch_stable.html

pip install openmim
mim install mmengine 'mmcv>=2.0.0rc0' 'mmrotate>=1.0.0rc0'

pip install git+https://github.com/facebookresearch/segment-anything.git
pip install opencv-python pycocotools matplotlib onnxruntime onnx

Note

Prepare DOTA data set according to MMRotate doc.
Download the detector weight from MMRotate model zoo.
python main_sam_dota.py prompts SAM with HBox obtained from annotation file (such as DOTA trainval).
python main_rdet-sam_dota.py prompts SAM with HBox predicted by a well-trained detector for non-annotated data (such as DOTA test).
Many configs, including pipeline (i.e. transforms), dataset, dataloader, evaluator, visualizer, are set in data.py.
You can change the detector config and the corresponding weight path in main_rdet-sam_dota.py to any detector that can be built with MMRotate.

Citation

@article{yu2023h2rboxv2,
  title={H2RBox-v2: Boosting HBox-supervised Oriented Object Detection via Symmetric Learning},
  author={Yu, Yi and Yang, Xue and Li, Qingyun and Zhou, Yue and Zhang, Gefan and Yan, Junchi and Da, Feipeng},
  journal={arXiv preprint arXiv:2304.04403},
  year={2023}
}

@inproceedings{yang2023h2rbox,
  title={H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection},
  author={Yang, Xue and Zhang, Gefan and Li, Wentong and Wang, Xuehui and Zhou, Yue and Yan, Junchi},
  booktitle={International Conference on Learning Representations},
  year={2023}
}

@article{kirillov2023segany,
  title={Segment Anything}, 
  author={Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C. and Lo, Wan-Yen and Doll{\'a}r, Piotr and Girshick, Ross},
  journal={arXiv:2304.02643},
  year={2023}
}

Other awesome SAM projects:

Grounded-Segment-Anything
Zero-Shot Anomaly Detection
EditAnything: ControlNet + StableDiffusion based on the SAM segmentation mask
IEA: Image Editing Anything
sam-with-mmdet (mmdet 3.0.0, provide RTMDet)
Prompt-Segment-Anything (mmdet 3.0.0, H-DETR, DINO, Focal backbone) ......

li-qingyun / sam-mmrotate Goto Github PK

sam-mmrotate's Introduction

SAM-RBox

Prepare Env

Note

Citation

Other awesome SAM projects:

sam-mmrotate's People

Contributors

Stargazers

Watchers

Forkers

sam-mmrotate's Issues

Suggestion - Integrate MobileSAM into the pipeline for lightweight and faster inference

这个可以加上tacking吗？

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent