Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Claudio Gennaro, Fabrizio Falchi. The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding. CVPR 2024. [paper][code]
2023
MIC: Zhao Wang, Aoxue Li, Fengwei Zhou, Zhenguo Li, Qi Dou. Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization. BMVC 2023. [paper]
EdaDet: Cheng Shi, Sibei Yang. EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment. ICCV 2023. [paper]
Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy. Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. ICCV 2023. [paper][code]
Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng. What Makes Good Open-Vocabulary Detector: A Disassembling Perspective. KDD workshop 2023. [paper]
Prannay Kaul, Weidi Xie, Andrew Zisserman. Multi-Modal Classifiers for Open-Vocabulary Object Detection. ICML 2023. [paper][code]
OpenSeeD: Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang. A Simple Framework for Open-Vocabulary Segmentation and Detection. arXiv 2023. [paper][code]
Relja Arandjelović, Alex Andonian, Arthur Mensch, Olivier J. Hénaff, Jean-Baptiste Alayrac, Andrew Zisserman. Three Ways to Improve Feature Alignment for Open Vocabulary Eetection. arXiv 2023. [paper]
MEDet: Peixian Chen, Kekai Sheng, Mengdan Zhang, Yunhang Shen, Ke Li, Chunhua Shen. Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization. arXiv 2022. [paper][code]
LocOV: Maria A. Bravo, Sudhanshu Mittal, Thomas Brox. Localized Vision-Language Matching for Open-vocabulary Object Detection. DAGM German Conference on Pattern Recognition (GCPR) 2022. [paper][code]
Object-Centric-OVD: Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, Fahad Shahbaz Khan. Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection. NeurIPS 2022. [paper][code]
VL-PLM: Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, Vijay Kumar B.G, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris Metaxas. Exploiting Unlabeled Data with Vision and Language Models for Object Detection. ECCV 2022. [paper][code]
PromptDet: Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma. PromptDet: Towards Open-vocabulary Detection using Uncurated Images. ECCV 2022. [paper][code]