pointmcd's Introduction

PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition

This is the official implementation of [PointMCD] (TMM 2023), which is designed for boosting deep 3D point cloud encoders by distilling discriminative cross-modal visual knowledge extracted from multi-view rendered images for a variety of 3D shape analysis and recognition applications.

This code has been tested with Python 3.9, PyTorch 1.10.1, CUDA 11.1 and cuDNN 8.0.5 on Ubuntu 20.04.

Usage

[Datasets] Download our pre-processed datasets and put them under the data folder. Or you can also use our pre-processing code to render multi-view images and the corresponding point-wise visibility for your own data.

[Scripts] We provided the training scripts of different teacher networks and released their corresponding pre-trained model parameters under the ckpt/teacher folder. The teacher knowledge information is exported in advance, which can be downloaded from here, and then put under the expt folder. The trained student models will be stored under the ckpt/student folder.

As a universal plug-in component for generic deep set architectures, one can easily integrate our approach to various types of deep point cloud encoders, such as PointNet++ and CurveNet, as experimented in our paper. Your further efforts in applying PointMCD to other more powerful point cloud backbones and richer downstream task scenarios are warmly welcomed.

Citation

If you find our work useful in your research, please consider citing:

@article{zhang2023pointmcd,
  title={PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition},
  author={Zhang, Qijian and Hou, Junhui and Qian, Yue},
  journal={IEEE Transactions on Multimedia},
  year={2023}
}

pointmcd's People

Contributors

Stargazers

Watchers

pointmcd's Issues

Could you please offer the pointnet++ distiller code?

hello, I'm recently translate your method to other tasks, but your project only provides pointnet and dgcnn.
could you please offer the pointnet++ distiller code?
Thanks very much!!!

A question about hidden points removal

Hi, I recently read your work "PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition" and was very inspired. I still have a question about hidden point removal algorithm: May I ask how your camera viewpoint is designed in the algorithm, because the viewpoint setting [0,0,diameter] (neither xyz nor yaw, pitch, roll form) of the hidden point removal algorithm in open3d does not match the rendered image parameters, and I would like to know if you redesign the camera param inputs. I look forward to your reply, thank you very much.

If only use mvcnn，will the result be better？

firstly，your work is very creative!!!
But i have some question.In 3D tasks,multi-view methods are better than other methods.In your work,the teacher network boost the student network，if i only use trained teacher network for testing，will the result be better than student network？
thanks!!!!

Recommend Projects

keeganhk / pointmcd Goto Github PK

pointmcd's Introduction

PointMCD: Boosting Deep Point Cloud Encoders via Multi-view Cross-modal Distillation for 3D Shape Recognition

Usage

Citation

pointmcd's People

Contributors

Stargazers

Watchers

Forkers

pointmcd's Issues

Could you please offer the pointnet++ distiller code?

A question about hidden points removal

If only use mvcnn，will the result be better？

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent