Giter VIP home page Giter VIP logo

pointmcd's Introduction

PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition

This is the official implementation of [PointMCD] (TMM 2023), which is designed for boosting deep 3D point cloud encoders by distilling discriminative cross-modal visual knowledge extracted from multi-view rendered images for a variety of 3D shape analysis and recognition applications.

This code has been tested with Python 3.9, PyTorch 1.10.1, CUDA 11.1 and cuDNN 8.0.5 on Ubuntu 20.04.

Usage

[Datasets] Download our pre-processed datasets and put them under the data folder. Or you can also use our pre-processing code to render multi-view images and the corresponding point-wise visibility for your own data.

[Scripts] We provided the training scripts of different teacher networks and released their corresponding pre-trained model parameters under the ckpt/teacher folder. The teacher knowledge information is exported in advance, which can be downloaded from here, and then put under the expt folder. The trained student models will be stored under the ckpt/student folder.

As a universal plug-in component for generic deep set architectures, one can easily integrate our approach to various types of deep point cloud encoders, such as PointNet++ and CurveNet, as experimented in our paper. Your further efforts in applying PointMCD to other more powerful point cloud backbones and richer downstream task scenarios are warmly welcomed.

Citation

If you find our work useful in your research, please consider citing:

@article{zhang2023pointmcd,
  title={PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition},
  author={Zhang, Qijian and Hou, Junhui and Qian, Yue},
  journal={IEEE Transactions on Multimedia},
  year={2023}
}

pointmcd's People

Contributors

keeganhk avatar

Stargazers

Haziq Jeelani avatar be happy avatar  avatar David Benhaim avatar Lulu Chou avatar yycc-move avatar Ziwen Zhao avatar  avatar ProgrammerUnknown avatar Yuanhao Li avatar  avatar  avatar Marco Liang avatar DooDay avatar Johnny Chew avatar Nowwa avatar Jason Sung avatar tim avatar  avatar David Qiao avatar joie de vivre avatar 厦漳泉 avatar  avatar  avatar  avatar 0xLemon avatar DELAG avatar 霎弼海龍 avatar 李易连 avatar  avatar  avatar Farming Tong avatar  avatar Gaoheng Zhang avatar Nikhil Bhende avatar  avatar マダム感 avatar Audrey Bitoni avatar 樹都京葬 avatar  avatar BLOCKCHAIN DEV avatar Leo Pan avatar Not Fatal Error Yet avatar Kirsten Price avatar  avatar Xiaobing Han avatar  avatar  avatar

Watchers

 avatar Leo Pan avatar Lulu Chou avatar Farming Tong avatar Nowwa avatar Jason Sung avatar Marco Liang avatar DELAG avatar Not Fatal Error Yet avatar  avatar

pointmcd's Issues

A question about hidden points removal

Hi, I recently read your work "PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition" and was very inspired. I still have a question about hidden point removal algorithm: May I ask how your camera viewpoint is designed in the algorithm, because the viewpoint setting [0,0,diameter] (neither xyz nor yaw, pitch, roll form) of the hidden point removal algorithm in open3d does not match the rendered image parameters, and I would like to know if you redesign the camera param inputs. I look forward to your reply, thank you very much.

If only use mvcnn,will the result be better?

firstly,your work is very creative!!!
But i have some question.In 3D tasks,multi-view methods are better than other methods.In your work,the teacher network boost the student network,if i only use trained teacher network for testing,will the result be better than student network?
thanks!!!!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.