Giter VIP home page Giter VIP logo

pcgcv2's Introduction

Multiscale Point Cloud Geometry Compression

โ€‹ We apply an end-to-end learning framework to compress the 3D point cloud geometry (PCG) efficiently. Leveraging the sparsity nature of point cloud, we introduce the multiscale structure to represent native PCG compactly, offering the hierarchical reconstruction capability via progressive learnt re-sampling. Under this framework, we devise the sparse convolution-based autoencoder for feature analysis and aggregation. At the bottleneck layer, geometric occupancy information is losslessly encoded with a very small percentage of bits consumption, and corresponding feature attributes are lossy compressed.

News

  • 2021.1.1 Our paper has been accepted by DCC2021! [paper] [presentation]
  • 2021.2.25 We have updated MinkowskiEngine to v0.5. The bug on GPU is fixed. And the encoding and decoding runtime is reduced.

Framework

introduction

Multiscale PCG Compression: (a) "Conv c*n^3" denotes the sparse convolution with 'c' output channels and n^3 kernel size, "Q" stands for Quantization, "AE" and "AD" are Arithmetic Encoder and Decoder respectively. "IRN" is Inception-Residual Network used for efficient feature aggregation. (a) network structure of IRN unit, (c) adaptive contexts conditioned on autoregressive priors. introduction

โ€‹ Binary classification based hierarchical reconstruction: The top part shows the encoding process: (a), (b), (c), (d) are exemplified from a size of 32^3 to 4^3, by halving each geometric dimension scale step-by-step; The bottom part illustrates corresponding hierarchical reconstructions, geometric models are upscaled and classified gradually from the rightmost to the leftmost position: (e), (g), (i) are convolutionally upscaled from lower scales with size of 8^3, 16^3 and 32^3. Different colors are used to differentiate the probabilities of voxel-being-occupied (i.e., the greener, the closer to 1, and the bluer, the closer to 0); and (f), (h), (j) are the reconstructions after classification with green blocks for true classified voxels, blue for false positive, and red for false negative voxels.

Requirments

Usage

Training

 python train.py --dataset='training_dataset/' --dataset_8i = 'testdata/8iVFB/'

Testing

sudo chmod 777 utils/tmc3
python eval.py --filedir='testdata/8iVFB/redandblack_vox10_1550.ply' --ckptdir='ckpts/c8_a2_32000.pth'

or test all data

python eval.py --test_all

Examples

demo.ipynb

Comparison

Objective Comparison

See results.ipynb

Qualitative Evaluation

introduction

Update

  • 2020.06 paper submission.
  • 2020.10.29 open source code.
  • 2021.02.25 bug fixed.

Authors

These files are provided by Nanjing University Vision Lab. And thanks for the help from Prof. Dandan Ding from Hangzhou Normal University and Prof. Zhu Li from University of Missouri at Kansas. Please contact us ([email protected] and [email protected]) if you have any questions.

pcgcv2's People

Contributors

yydlmzyz avatar xtorker avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.