Giter VIP home page Giter VIP logo

3d-vl-sot's Introduction

3D-VL-SOT

Currently, tasks in this repository include 3D Object Tracking (3DOT) and Vision Language Tracking (VL) and Un/Self-Supervised.

3D Object Tracking

Preprints

  • PTTR++: Zhipeng Luo, Changqing Zhou, Liang Pan, Gongjie Zhang, Tianrui Liu, Yueru Luo, Haiyu Zhao, Ziwei Liu, Shijian Lu. PTTR++: Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with Transformer.[Paper] [Code]
  • PCET: Pan Wang, Liangliang Ren, Shengkai Wu, Jinrong Yang, En Yu, Hangcheng Yu, Xiaoping Li. Implicit and Efficient Point Cloud Completion for 3D Single Object Tracking. [Paper] [Code]
  • MBPTrack: Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang. Improving 3D Point Cloud Tracking with Memory Networks and Box Priors. [Paper] [[Code]]
  • StreamTrack: Zhipeng Luo, Gongjie Zhang, Changqing Zhou, Zhonghua Wu, Qingyi Tao, Lewei Lu, Shijian Lu. Modeling Continuous Motion for 3D Point Cloud Object Tracking. [Paper] [[Code]]
  • OPSNet: Kaijie Zhao, Haitao Zhao, Zhongze Wang, Jingchao Peng, Zhengwei Hu. Object Preserving Siamese Network for Single Object Tracking on Point Clouds. [Paper] [[Code]]

2023

  • OSP2B: Jiahao Nie, Zhiwei He, Yuxiang Yang, Zhengyi Bao, Mingyu Gao, Jing Zhang. OSP2B: One-Stage Point-to-Box Network for 3D Siamese Tracking. In IJCAI, 2023. [Paper] [[Code]]
  • CXTrack: Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang. Improving 3D Point Cloud Tracking with Contextual Information. In CVPR, 2023. [Paper] [Code]
  • GLT-T: Jiahao Nie, Zhiwei He, Yuxiang Yang, Mingyu Gao, Jing Zhang. Global-Local Transformer Voting for 3D Single Object Tracking in Point Clouds. In AAAI, 2023. [Paper][Code]

2022

  • CMT: Zhiyang Guo, Yunyao Mao, Wengang Zhou, Min Wang, Houqiang Li. Context-Matching-Guided Transformer for 3D Tracking in Point Clouds. In ECCV, 2022. [Paper] [Code]
  • STNet: Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang. 3D Siamese Transformer Network for Single Object Tracking on Point Clouds. In ECCV, 2022. [Paper] [Code]
  • M2-Track: Chaoda Zheng, Xu Yan, Haiming Zhang, Baoyuan Wang, Shenghui Cheng, Shuguang Cui, Zhen Li. Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds. In CVPR, 2022 oral. [Paper] [Code]
  • PTTR: Changqing Zhou, Zhipeng Luo, Yueru Luo, Tianrui Liu, Liang Pan, Zhongang Cai, Haiyu Zhao, Shijian Lu. PTTR: Relational 3D Point Cloud Object Tracking with Transformer. In CVPR, 2022. [Paper] [Code]
  • GPT: Minseong Park, Hongje Seong, Wonje Jang, Euntai Kim. Graph-Based Point Tracker for 3D Object Tracking in Point Clouds. In AAAI, 2022. [Paper] [Code]
  • 3D DetecTrack: Junho Koh, Jaekyum Kim, Jinhyuk Yoo, Yecheol Kim, Dongsuk Kum, Jun Won Choi. Joint 3D Object Detection and Tracking Using Spatio-Temporal Representation of Camera Image and LiDAR Point Clouds. In AAAI, 2022. [Paper] [Code]

2021

  • MLVSNet: Zhoutao Wang, Qian Xie, Yu-Kun Lai, Jing Wu, Kun Long, Jun Wang. MLVSNet: Multi-level Voting Siamese Network for 3D Visual Tracking. In ICCV, 2021. [Paper] [Code]
  • BAT: Chaoda Zheng, Xu Yan, Jiantao Gao, Weibing Zhao, Wei Zhang, Zhen Li, Shuguang Cui. Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds. In ICCV, 2021. [Paper] [Code]
  • CenterPoint: Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl. Center-based 3D Object Detection and Tracking. In CVPR, 2021. [Paper] [Code]
  • PTT: Jiayao Shan, Sifan Zhou, Zheng Fang, Yubo Cui. PTT: Point-Track-Transformer Module for 3D Single Object Tracking in Point Clouds. In IROS, 2021. [Paper] [Code]
  • lttr: Yubo Cui, Zheng Fang, Jiayao Shan, Zuoxu Gu, Sifan Zhou. 3D Object Tracking with Transformer. In BMVC, 2021. [Paper] [Code]

2020

  • Peiliang Li, Jieqi Shi, Shaojie Shen. Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking. In CVPR, 2020. [Paper] [Code]
  • P2B: Haozhe Qi, Chen Feng, Zhiguo Cao, Feng Zhao, Yang Xiao. P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds. In CVPR, 2020. [Paper] [Code]
  • 3D-ZeF: Malte Pedersen, Joakim Bruslund Haurum, Stefan Hein Bengtson, Thomas B. Moeslund. 3D-ZeF: A 3D Zebrafish Tracking Benchmark Dataset. In CVPR, 2020. [Paper] [Project] [Challenge]
  • F-Siamese: Hao Zou, Jinhao Cui, Xin Kong, Chujuan Zhang, Yong Liu, Feng Wen, Wanlong Li. F-Siamese Tracker: A Frustum-based Double Siamese Network for 3D Single Object Tracking. In IROS, 2020. [Paper] [Code]

Vision Language Tracking

Preprints

stay tuned

2023

  • TransVLT: Haojie Zhao, Xiao Wang, Dong Wang, Huchuan Lu, Xiang Ruan. Transformer vision-language tracking via proxy token guided cross-modal fusion. In PR Letters, 2023. [Paper]

2022

  • ModaMixer: Mingzhe Guo, Zhipeng Zhang, Heng Fan, Liping Jing. Divert More Attention to Vision-Language Tracking. In NeurIPS, 2022. [Paper] [Code]
  • CTRNLT: Yihao Li, Jun Yu, Zhongpeng Cai, and Yuwen Pan. Cross-modal target retrieval for tracking by natural language. In CVPRW, 2022. [paper]

2021

  • CapsuleTNL: Ding Ma, Xiangqian Wu. Capsule-based Object Tracking with Natural Language. In ACMMM, 2021.[Paper]
  • SNLT: Qi Feng, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff. Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers. In CVPR, 2021. [Paper] [Code]
  • TNL2K: Xiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, YaoWei Wang, Yonghong Tian, Feng Wu. Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark. In CVPR, 2021. [Paper] [Code]
  • GTI: Zhengyuan Yang, Tushar Kumar, Tianlang Chen, Jinsong Su, Jiebo Luo. Grounding-tracking-integration. In TCSVT, 2021. [paper]

2020

  • RTNL: Qi Feng, Vitaly Ablavsky, Qinxun Bai, Guorong Li, Stan Sclaroff. Real-time Visual Object Tracking with Natural Language Description. In WACV, 2020. [Paper] [Code]

2017

  • LPN: Zhenyang Li, Ran Tao, Efstratios Gavves, Cees G. M. Snoek, Arnold W.M. Smeulders. Tracking by Natural Language Specification. In CVPR, 2017.[Paper] [Code]

Un-Self-Supervised

Preprints

  • TASST: Xin Li, Wenjie Pei, Zikun Zhou, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang. Self-Supervised Tracking via Target-Aware Data Synthesis. [Paper] [Code]

2022

  • ULAST: Qiuhong Shen, Lei Qiao, Jinyang Guo, Peixia Li, Xin Li, Bo Li, Weitao Feng, Weihao Gan, Wei Wu, Wanli Ouyang. Unsupervised Learning of Accurate Siamese Tracking. In CVPR, 2022. [Paper] [Code]
  • UDAT: Junjie Ye, Changhong Fu, Guangze Zheng, Danda Pani Paudel, Guang Chen. Unsupervised Domain Adaptation for Nighttime Aerial Tracking. In CVPR, 2022. [Paper] [Code]

2021

  • PUL: Qiangqiang Wu, Jia Wan, Antoni B. Chan. Progressive Unsupervised Learning for Visual Object Tracking. In CVPR, 2021. [Paper] [Code]

  • EMUT: Adam W. Harley, Yiming Zuo, Jing Wen, Ayush Mangal, Shubhankar Potdar, Ritwick Chaudhry, Katerina Fragkiadaki. Track, Check, Repeat: An EM Approach to Unsupervised Tracking. In CVPR, 2021. [Paper] [Code]

2020

  • MAST: Zihang Lai, Erika Lu, Weidi Xie. MAST: A Memory-Augmented Self-supervised Tracker. In CVPR, 2020. [Paper] [Code]

2019

  • UDT: Ning Wang, Yibing Song, Chao Ma, Wengang Zhou, Wei Liu, Houqiang Li. Unsupervised Deep Tracking. In CVPR, 2019. [Paper] [Code]
  • LUDT:Ning Wang, Wengang Zhou, Yibing Song, Chao Ma, Wei Liu, Houqiang Li. Unsupervised Deep Representation Learning for Real-Time Tracking. In International Journal of Computer Vision Volume 129 Issue 2 Feb 2021. [Paper] [Code]

3d-vl-sot's People

Contributors

haooozi avatar laisimiao avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.