0% found this document useful (0 votes)

126 views68 pages

Cvpr17 Pointnet Slides

PointNet is a deep learning model that can directly take unordered point clouds as input and output classifications or segmentations. It achieves permutation invariance through symmetric functions and learns an embedding that aligns different shapes. PointNet achieves state-of-the-art results on 3D classification and segmentation benchmarks while being robust to missing data. It provides a unified framework for various 3D tasks like classification, segmentation, and scene parsing.

Uploaded by

Dr. Chekir Amira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

126 views68 pages

Cvpr17 Pointnet Slides

Uploaded by

Dr. Chekir Amira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 68

PointNet: Deep Learning on Point Sets for

3D Classification and Segmentation

Charles R. Qi*
Hao Su*
Kaichun Mo
Leonidas J. Guibas
Big Data + Deep Representation Learning

Robot Perception Augmented Reality Shape Design

source: Scott J Grunewald source: Google Tango source: solidsolutions

Emerging 3D Applications
Big Data + Deep Representation Learning

Robot Perception Augmented Reality Shape Design

source: Scott J Grunewald source: Google Tango source: solidsolutions

Need for 3D Deep Learning!

3D Representations

Point Cloud Mesh Volumetric Projected View

RGB(D)
3D Representation: Point Cloud
Point cloud is close to raw sensor data

LiDAR

Depth Sensor
Point Cloud
3D Representation: Point Cloud
Point cloud is close to raw sensor data

Point cloud is canonical

Mesh

LiDAR
Volumetric

Depth Sensor
Point Cloud
Depth Map
Previous Works
Most existing point cloud features are handcrafted
towards specific tasks

Source: https://github.com/PointCloudLibrary/pcl/wiki/Overview-and-Comparison-of-Features
Previous Works

Point cloud is converted to other representations

before it’s fed to a deep neural network

Conversion Deep Net

Voxelization 3D CNN

Projection/Rendering 2D CNN

Feature extraction Fully Connected

Research Question:

Can we achieve effective feature learning

directly on point clouds?
Our Work: PointNet

End-to-end learning for scattered, unordered point data

PointNet
Our Work: PointNet

End-to-end learning for scattered, unordered point data

Unified framework for various tasks

Object Classification

PointNet Object Part Segmentation

Semantic Scene Parsing
...
Our Work: PointNet

End-to-end learning for scattered, unordered point data

Unified framework for various tasks

Challenges

Unordered point set as input

Model needs to be invariant to N! permutations.

Invariance under geometric transformations

Point cloud rotations should not alter classification results.
Challenges

Unordered point set as input

Model needs to be invariant to N! permutations.

Invariance under geometric transformations

Point cloud rotations should not alter classification results.
Unordered Input

Point cloud: N orderless points, each represented by a D

dim vector

N
Unordered Input

Point cloud: N orderless points, each represented by a D

dim vector

D D

N represents the same set as N

Unordered Input

Point cloud: N orderless points, each represented by a D

dim vector

D D

N represents the same set as N

Model needs to be invariant to N! permutations

Permutation Invariance: Symmetric Function

f (x1 , x2 ,…, xn ) ≡ f (xπ1 , xπ 2 ,…, xπ n ) , xi ∈! D

Permutation Invariance: Symmetric Function

f (x1 , x2 ,…, xn ) ≡ f (xπ1 , xπ 2 ,…, xπ n ) , xi ∈! D

Examples:
f (x1 , x2 ,…, xn ) = max{x1 , x2 ,…, xn }
f (x1 , x2 ,…, xn ) = x1 + x2 +…+ xn
…
Permutation Invariance: Symmetric Function

f (x1 , x2 ,…, xn ) ≡ f (xπ1 , xπ 2 ,…, xπ n ) , xi ∈! D

Examples:
f (x1 , x2 ,…, xn ) = max{x1 , x2 ,…, xn }
f (x1 , x2 ,…, xn ) = x1 + x2 +…+ xn
…
How can we construct a family of symmetric
functions by neural networks?
Permutation Invariance: Symmetric Function
Observe:
f (x1 , x2 ,…, xn ) = γ ! g(h(x1 ),…,h(xn )) is symmetric if g is symmetric
Permutation Invariance: Symmetric Function
Observe:
f (x1 , x2 ,…, xn ) = γ ! g(h(x1 ),…,h(xn )) is symmetric if g is symmetric
h
(1,2,3)
(1,1,1)

(2,3,2)
…

(2,3,4)
Permutation Invariance: Symmetric Function
Observe:
f (x1 , x2 ,…, xn ) = γ ! g(h(x1 ),…,h(xn )) is symmetric if g is symmetric
h
(1,2,3) simple symmetric function

(1,1,1) g

(2,3,2)
…

(2,3,4)
Permutation Invariance: Symmetric Function
Observe:
f (x1 , x2 ,…, xn ) = γ ! g(h(x1 ),…,h(xn )) is symmetric if g is symmetric
h
(1,2,3) simple symmetric function

(1,1,1) g γ

(2,3,2)
…

(2,3,4) PointNet (vanilla)

Permutation Invariance: Symmetric Function

What symmetric functions can be constructed by PointNet?

Symmetric functions

PointNet
(vanilla)
Universal Set Function Approximator

Theorem:
A Hausdorff continuous symmetric function f : 2X → ! can be
arbitrarily approximated by PointNet.

S ⊆ !d PointNet (vanilla)
Basic PointNet Architecture

Empirically, we use multi-layer perceptron (MLP) and max pooling:

h
(1,2,3) MLP

g γ
(1,1,1) MLP

max MLP
(2,3,2) MLP
…

(2,3,4) MLP PointNet (vanilla)

Challenges

Unordered point set as input

Model needs to be invariant to N! permutations.

Invariance under geometric transformations

Point cloud rotations should not alter classification results.
Input Alignment by Transformer Network

Idea: Data dependent transformation for automatic alignment

3 T-Net transform 3
params

N Transform N

Transformed
Data
Data
Input Alignment by Transformer Network

Idea: Data dependent transformation for automatic alignment

3 T-Net transform 3
params

N Transform N

Transformed
Data
Data
Input Alignment by Transformer Network

Idea: Data dependent transformation for automatic alignment

3 T-Net transform 3
params

N Transform N

Transformed
Data
Data
Input Alignment by Transformer Network

The transformation is just matrix multiplication!

3 T-Net transform 3
params: 3x3

Matrix
N
Mult.

Transformed
Data
Data
Embedding Space Alignment

transform
T-Net params: 64x64

Matrix
Mult.

Input Transformed
embeddings: embeddings:
Nx64 Nx64
Embedding Space Alignment

transform
T-Net params: 64x64 Regularization:

Matrix Transform matrix A 64x64

Mult. close to orthogonal:

Input Transformed
embeddings: embeddings:
Nx64 Nx64
PointNet Classification Network
PointNet Classification Network
PointNet Classification Network
PointNet Classification Network
PointNet Classification Network
PointNet Classification Network
PointNet Classification Network
Extension to PointNet Segmentation Network

local embedding global feature

Extension to PointNet Segmentation Network

local embedding global feature

Results
Results on Object Classification

3D CNNs

dataset: ModelNet40; metric: 40-class classification accuracy (%)

Results on Object Part Segmentation
Results on Object Part Segmentation

dataset: ShapeNetPart; metric: mean IoU (%)

Results on Semantic Scene Parsing

Input
Output

dataset: Stanford 2D-3D-S (Matterport scans)

Robustness to Data Corruption

dataset: ModelNet40; metric: 40-class classification accuracy (%)

Robustness to Data Corruption

Less than 2% accuracy drop with 50% missing data

dataset: ModelNet40; metric: 40-class classification accuracy (%)

Robustness to Data Corruption

dataset: ModelNet40; metric: 40-class classification accuracy (%)

Robustness to Data Corruption

Why is PointNet so robust

to missing data?

3D CNN
Visualizing Global Point Cloud Features

3 MLP 1024
maxpool

n
shared global feature

Which input points are contributing to the global feature?

(critical points)
Visualizing Global Point Cloud Features

Original Shape:

Critical Point Sets:

Visualizing Global Point Cloud Features

3 MLP 1024
maxpool

n
shared global feature

Which points won’t affect the global feature?

Visualizing Global Point Cloud Features

Original Shape:

Critical Point Set:

Upper bound set:

Visualizing Global Point Cloud Features (OOS)

Original Shape:

Critical Point Set:

Upper bound Set:

Conclusion
• PointNet is a novel deep neural network that directly
consumes point cloud.
• A unified approach to various 3D recognition tasks.
• Rich theoretical analysis and experimental results.

Code & Data Available!

http://stanford.edu/~rqi/pointnet

See you at Poster 9!

Thank you!
THE END
Speed and Model Size

Inference time 11.6ms, 25.3ms GTX1080, batch size 8

Permutation Invariance: How about Sorting?

“Sort” the points before feeding them into a network.

Unfortunately, there is no canonical order in high dim space.

lexsorted
(1,2,3) (1,1,1)
(1,1,1) (1,2,3)
(2,3,2) (2,3,2) MLP
(2,3,4) (2,3,4)
Permutation Invariance: How about Sorting?

“Sort” the points before feeding them into a network.

Unfortunately, there is no canonical order in high dim space.

Multi-Layer Perceptron
lexsorted (ModelNet shape classification)
(1,2,3) (1,1,1) Accuracy
(1,1,1) (1,2,3)
(2,3,2) (2,3,2) MLP Unordered Input 12%
(2,3,4) (2,3,4) Lexsorted Input 40%
PointNet (vanilla) 87%
Permutation Invariance: How about RNNs?

Train RNN with permutation augmentation.

However, RNN forgets and order matters.

LSTM LSTM LSTM … LSTM

MLP MLP MLP … MLP

(1,2,3) (1,1,1) (2,3,2) (2,3,4)

Permutation Invariance: How about RNNs?

Train RNN with permutation augmentation.

However, RNN forgets and order matters.

LSTM Network
… (ModelNet shape classification)
LSTM LSTM LSTM LSTM

Accuracy
MLP MLP MLP … MLP LSTM 75%

PointNet (vanilla) 87%

(1,2,3) (1,1,1) (2,3,2) (2,3,4)
PointNet Classification Network

ModelNet40 Accuracy
PointNet (vanilla) 87.1%
+ input 3x3 87.9%
+ feature 64x64 86.9%
+ feature 64x64 + reg 87.4%
+ both 89.2%
Visualizing Point Functions

1x3 1x1024
Compact View: FCs

1x3 1x1024
Expanded View: FC FC FC FC FC
64 64 64 128

Which input point will activate neuron X?

Find the top-K points in a dense volumetric grid that activates neuron X.
Visualizing Point Functions

2021mini-alexnet
No ratings yet
2021mini-alexnet
73 pages
Landrieu GT Appr Opt
No ratings yet
Landrieu GT Appr Opt
190 pages
Module 2_Deep_Learning_Fundamentals
No ratings yet
Module 2_Deep_Learning_Fundamentals
98 pages
Machine Learning And Deep Learning Techniques For Medical Science K Gayathri Devi pdf download
No ratings yet
Machine Learning And Deep Learning Techniques For Medical Science K Gayathri Devi pdf download
78 pages
Je Partage Liste Des Revues Scientifiques de Catégorie A Avec Vous
No ratings yet
Je Partage Liste Des Revues Scientifiques de Catégorie A Avec Vous
420 pages
lecture_3
No ratings yet
lecture_3
62 pages
fpga_resume_interessant
No ratings yet
fpga_resume_interessant
53 pages
Training Deep Neural Networks
No ratings yet
Training Deep Neural Networks
55 pages
Fei2024
No ratings yet
Fei2024
52 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Image Based Modeling and Rendering: Exploring Visual Realism: Techniques in Computer Vision
From Everand
Image Based Modeling and Rendering: Exploring Visual Realism: Techniques in Computer Vision
Fouad Sabry
No ratings yet
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
DraftResearchGate
No ratings yet
DraftResearchGate
42 pages
Point Completion Network _point Cld
No ratings yet
Point Completion Network _point Cld
17 pages
Lecture PointNet-part1-summary-2
No ratings yet
Lecture PointNet-part1-summary-2
51 pages
Lecture 2 Deep Learning Overview
No ratings yet
Lecture 2 Deep Learning Overview
98 pages
MPCT Multiscale Point Cloud Transformer With a Residual Network
No ratings yet
MPCT Multiscale Point Cloud Transformer With a Residual Network
12 pages
PowerPoint Presentation-3
No ratings yet
PowerPoint Presentation-3
28 pages
GEN AI.Question Bank EndSem
No ratings yet
GEN AI.Question Bank EndSem
9 pages
00005_3D_Semantic_Novelty_Detection
No ratings yet
00005_3D_Semantic_Novelty_Detection
21 pages
Thesis Z Ai
No ratings yet
Thesis Z Ai
46 pages
Journal.pone.0314086
No ratings yet
Journal.pone.0314086
14 pages
Ponder
No ratings yet
Ponder
16 pages
2770_variational_autoencoding_of_de
No ratings yet
2770_variational_autoencoding_of_de
18 pages
A Comprehensive Review on Fake News Detection With Deep Learning
No ratings yet
A Comprehensive Review on Fake News Detection With Deep Learning
20 pages
Unit5_PPT
No ratings yet
Unit5_PPT
13 pages
SHASHANK ML.docx
No ratings yet
SHASHANK ML.docx
23 pages
2018_DGCNN
No ratings yet
2018_DGCNN
12 pages
Path_planning_of_welding_robot_based_on_deep_learn
No ratings yet
Path_planning_of_welding_robot_based_on_deep_learn
12 pages
1811.07246v1-2
No ratings yet
1811.07246v1-2
10 pages
A Comprehensive Overview of Deep Learning Techniques For 3D Point Cloud Classification and Semantic Segmentation
No ratings yet
A Comprehensive Overview of Deep Learning Techniques For 3D Point Cloud Classification and Semantic Segmentation
54 pages
Sensors: Machine Learning in Agriculture: A Comprehensive Updated Review
No ratings yet
Sensors: Machine Learning in Agriculture: A Comprehensive Updated Review
55 pages
A Closer Look at Rotation-Invariant Deep Point Cloud Analysis
No ratings yet
A Closer Look at Rotation-Invariant Deep Point Cloud Analysis
10 pages
Model Order Reduction Based On Runge Kutta Neural Networks
No ratings yet
Model Order Reduction Based On Runge Kutta Neural Networks
25 pages
3 Short
No ratings yet
3 Short
10 pages
SO Net
No ratings yet
SO Net
17 pages
Hassani Unsupervised Multi-Task Feature Learning on Point Clouds ICCV 2019 Paper
No ratings yet
Hassani Unsupervised Multi-Task Feature Learning on Point Clouds ICCV 2019 Paper
12 pages
Neural Network Project Report.
No ratings yet
Neural Network Project Report.
12 pages
A Comprehensive Review of 3D Point Cloud Descriptors: A, A A B A
No ratings yet
A Comprehensive Review of 3D Point Cloud Descriptors: A, A A B A
43 pages
CorrNet3D
No ratings yet
CorrNet3D
10 pages
Lecture 9
No ratings yet
Lecture 9
24 pages
Paper of Rolling Net
No ratings yet
Paper of Rolling Net
9 pages
TPAMI 2023 Unsupervised Point Cloud Representation Learning With Deep Neural Networks A Survey
No ratings yet
TPAMI 2023 Unsupervised Point Cloud Representation Learning With Deep Neural Networks A Survey
20 pages
3D Point Cloud Classification Segmentation Model Based on Improved PointNet
No ratings yet
3D Point Cloud Classification Segmentation Model Based on Improved PointNet
4 pages
2104.13044v1
No ratings yet
2104.13044v1
8 pages
unit3 DL JNTUK
No ratings yet
unit3 DL JNTUK
15 pages
GoogleNET and ResNet v4 With Nin and Bias
No ratings yet
GoogleNET and ResNet v4 With Nin and Bias
82 pages
Rfnet-4D++: Joint Object Reconstruction and Flow Estimation From 4D Point Clouds With Cross-Attention Spatio-Temporal Features
No ratings yet
Rfnet-4D++: Joint Object Reconstruction and Flow Estimation From 4D Point Clouds With Cross-Attention Spatio-Temporal Features
14 pages
Learning General and Distinctive 3D Local Deep Descriptors for Point Cloud Registration
No ratings yet
Learning General and Distinctive 3D Local Deep Descriptors for Point Cloud Registration
7 pages
(2020) Gaussian Error Linear Units (Gelus)
No ratings yet
(2020) Gaussian Error Linear Units (Gelus)
9 pages
MicroNet_ICCV2021
No ratings yet
MicroNet_ICCV2021
10 pages
Report On Neural Networks
No ratings yet
Report On Neural Networks
15 pages
ResNet Deep Residual Learning for Image Recognition (1)
No ratings yet
ResNet Deep Residual Learning for Image Recognition (1)
12 pages
Monitoring Climate Change Effects On Coral Reefs U
No ratings yet
Monitoring Climate Change Effects On Coral Reefs U
11 pages
point transformers
No ratings yet
point transformers
11 pages
Deep Learning on Point Clouds and Its Application_ a Survey
No ratings yet
Deep Learning on Point Clouds and Its Application_ a Survey
22 pages
Remote Sensing: Review: Deep Learning On 3D Point Clouds
No ratings yet
Remote Sensing: Review: Deep Learning On 3D Point Clouds
34 pages
MPCT_Multiscale_Point_Cloud_Transformer_With_a_Residual_Network
No ratings yet
MPCT_Multiscale_Point_Cloud_Transformer_With_a_Residual_Network
12 pages
Final Visit PPT
No ratings yet
Final Visit PPT
14 pages
A Tiny Machine Learning Model For Point Cloud Obje
No ratings yet
A Tiny Machine Learning Model For Point Cloud Obje
13 pages
DL Tutorial NIPS2015 PDF
No ratings yet
DL Tutorial NIPS2015 PDF
133 pages
1423
No ratings yet
1423
5 pages
Applied Sciences: Go Wider: An Efficient Neural Network For Point Cloud Analysis Via Group Convolutions
No ratings yet
Applied Sciences: Go Wider: An Efficient Neural Network For Point Cloud Analysis Via Group Convolutions
15 pages
Using Artificial Intelligence For Improving Stroke
No ratings yet
Using Artificial Intelligence For Improving Stroke
8 pages
N-19248
No ratings yet
N-19248
6 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
44 pages
9dl Merged
No ratings yet
9dl Merged
7 pages
Pointconv: Deep Convolutional Networks On 3D Point Clouds
No ratings yet
Pointconv: Deep Convolutional Networks On 3D Point Clouds
10 pages
Adobe Scan Dec 17, 2023 (1)
No ratings yet
Adobe Scan Dec 17, 2023 (1)
1 page
Randla-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
No ratings yet
Randla-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
16 pages
Point Transformer
No ratings yet
Point Transformer
11 pages
Lecture 16 Hao
No ratings yet
Lecture 16 Hao
56 pages
Pap 3B
No ratings yet
Pap 3B
6 pages
Pointnet
No ratings yet
Pointnet
10 pages
3 D Point Cloud Reviews
No ratings yet
3 D Point Cloud Reviews
22 pages
1.fan - A Point Set Generation Network For 3D Object Reconstruction From A Single Image - CVPR - 2017 - Paper
No ratings yet
1.fan - A Point Set Generation Network For 3D Object Reconstruction From A Single Image - CVPR - 2017 - Paper
9 pages
Pan_3D_Object_Detection_With_Pointformer_CVPR_2021_paper
No ratings yet
Pan_3D_Object_Detection_With_Pointformer_CVPR_2021_paper
10 pages
PN0823 SS Southerland
No ratings yet
PN0823 SS Southerland
5 pages
36-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
No ratings yet
36-Gated RNNs - Optimization For Long-Term Dependencies - Explicit Memory-07!10!2024
4 pages
DOC-20250304-WA0006.
No ratings yet
DOC-20250304-WA0006.
2 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
Zhang Deep Graphical Feature Learning For The Feature Matching Problem ICCV 2019 Paper
No ratings yet
Zhang Deep Graphical Feature Learning For The Feature Matching Problem ICCV 2019 Paper
10 pages
Huang 2016 ICPR
No ratings yet
Huang 2016 ICPR
6 pages
Pointwise Convolutional Neural Networks
No ratings yet
Pointwise Convolutional Neural Networks
10 pages
Point Transformer
No ratings yet
Point Transformer
10 pages
Pointnet++: Deep Hierarchical Feature Learning On Point Sets in A Metric Space
No ratings yet
Pointnet++: Deep Hierarchical Feature Learning On Point Sets in A Metric Space
14 pages
IoT-based Healthcare Monitoring System For War Sol
No ratings yet
IoT-based Healthcare Monitoring System For War Sol
9 pages
"The Pope Has A New Baby!" Fake News Detection Using Deep Learning
No ratings yet
"The Pope Has A New Baby!" Fake News Detection Using Deep Learning
8 pages
Learning Efficient Point Cloud Generation For Dense 3D Object Reconstruction
No ratings yet
Learning Efficient Point Cloud Generation For Dense 3D Object Reconstruction
8 pages
Back Propagation Neural Network 1: Lili Ayu Wulandhari PH.D
No ratings yet
Back Propagation Neural Network 1: Lili Ayu Wulandhari PH.D
8 pages
Pointnet: Deep Learning On Point Sets For 3D Classification and Segmentation
No ratings yet
Pointnet: Deep Learning On Point Sets For 3D Classification and Segmentation
19 pages
Compact Covariance Descriptors in 3D Point Clouds For Object Recognition
No ratings yet
Compact Covariance Descriptors in 3D Point Clouds For Object Recognition
6 pages
Multi-Column Deep Neural Networks For Image Classification
No ratings yet
Multi-Column Deep Neural Networks For Image Classification
8 pages
Practical Neural Networks (3) : Part 3 - Feedback Nets and Competitive Nets
100% (2)
Practical Neural Networks (3) : Part 3 - Feedback Nets and Competitive Nets
5 pages
Final Neural 2018 May
No ratings yet
Final Neural 2018 May
2 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Deep Learning - IIT Ropar - Unit 13 - Week 10
No ratings yet
Deep Learning - IIT Ropar - Unit 13 - Week 10
4 pages
Crop Disease Detection Using Deep Convolutional Neural Networks
No ratings yet
Crop Disease Detection Using Deep Convolutional Neural Networks
4 pages
Li Supervised Fitting of Geometric Primitives To 3D Point Clouds
No ratings yet
Li Supervised Fitting of Geometric Primitives To 3D Point Clouds
9 pages
Deep Learning - Week 11
No ratings yet
Deep Learning - Week 11
4 pages
BE Computer Engineering Syllabus 2019 Course
No ratings yet
BE Computer Engineering Syllabus 2019 Course
3 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet
SVM Notes
No ratings yet
SVM Notes
8 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Cvpr17 Pointnet Slides

Uploaded by

Cvpr17 Pointnet Slides

Uploaded by

PointNet: Deep Learning on Point Sets for

3D Classification and Segmentation

Robot Perception Augmented Reality Shape Design

source: Scott J Grunewald source: Google Tango source: solidsolutions

Robot Perception Augmented Reality Shape Design

source: Scott J Grunewald source: Google Tango source: solidsolutions

Need for 3D Deep Learning!

Point Cloud Mesh Volumetric Projected View

Point cloud is canonical

Point cloud is converted to other representations

Conversion Deep Net

Feature extraction Fully Connected

Can we achieve effective feature learning

End-to-end learning for scattered, unordered point data

End-to-end learning for scattered, unordered point data

Unified framework for various tasks

PointNet Object Part Segmentation

End-to-end learning for scattered, unordered point data

Unified framework for various tasks

Unordered point set as input

Invariance under geometric transformations

Unordered point set as input

Invariance under geometric transformations

Point cloud: N orderless points, each represented by a D

Point cloud: N orderless points, each represented by a D

N represents the same set as N

Point cloud: N orderless points, each represented by a D

N represents the same set as N

Model needs to be invariant to N! permutations

f (x1 , x2 ,…, xn ) ≡ f (xπ1 , xπ 2 ,…, xπ n ) , xi ∈! D

f (x1 , x2 ,…, xn ) ≡ f (xπ1 , xπ 2 ,…, xπ n ) , xi ∈! D

f (x1 , x2 ,…, xn ) ≡ f (xπ1 , xπ 2 ,…, xπ n ) , xi ∈! D

(2,3,4) PointNet (vanilla)

What symmetric functions can be constructed by PointNet?

Empirically, we use multi-layer perceptron (MLP) and max pooling:

(2,3,4) MLP PointNet (vanilla)

Unordered point set as input

Invariance under geometric transformations

Idea: Data dependent transformation for automatic alignment

Idea: Data dependent transformation for automatic alignment

Idea: Data dependent transformation for automatic alignment

The transformation is just matrix multiplication!

Matrix Transform matrix A 64x64

local embedding global feature

local embedding global feature

dataset: ModelNet40; metric: 40-class classification accuracy (%)

dataset: ShapeNetPart; metric: mean IoU (%)

dataset: Stanford 2D-3D-S (Matterport scans)

dataset: ModelNet40; metric: 40-class classification accuracy (%)

Less than 2% accuracy drop with 50% missing data

dataset: ModelNet40; metric: 40-class classification accuracy (%)

dataset: ModelNet40; metric: 40-class classification accuracy (%)

Why is PointNet so robust

Which input points are contributing to the global feature?

Critical Point Sets:

Which points won’t affect the global feature?

Critical Point Set:

Upper bound set:

Critical Point Set:

Upper bound Set:

Code & Data Available!

See you at Poster 9!

Inference time 11.6ms, 25.3ms GTX1080, batch size 8

“Sort” the points before feeding them into a network.

“Sort” the points before feeding them into a network.

Train RNN with permutation augmentation.

LSTM LSTM LSTM … LSTM

MLP MLP MLP … MLP

(1,2,3) (1,1,1) (2,3,2) (2,3,4)

Train RNN with permutation augmentation.

PointNet (vanilla) 87%

Which input point will activate neuron X?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.