0% found this document useful (0 votes)

14 views51 pages

Lecture PointNet-part1-summary-2

Uploaded by

omargohary2608

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views51 pages

Lecture PointNet-part1-summary-2

Uploaded by

omargohary2608

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 51

PointNet: Deep Learning on Point Sets

for 3D Classification and

Segmentation- Part 1
Big Data+DeepRepresentation
Learning
Robot Perception Augmented Reality Shape
Design

source: Scott J source: Google source:

Grunewald Tango solidsolutions

Emerging 3D Applications
Big Data+DeepRepresentation
Learning
Robot Perception Augmented Reality Shape
Design

source: Scott J source: Google source:

Grunewald Tango solidsolutions

Need for 3D Deep Learning!

3D Representation: Point
Cloud
Point cloud is close to raw sensor data

Point cloud is canonical

Mes
h

LiDA
Volumetr
R
ic

Depth
Point Cloud
Sensor
Depth
Map
Previous
Works
Most existing point cloud features are
handcrafted
towards specific tasks

Source: https://github.com/PointCloudLibrary/pcl/wiki/Overview-and-Comparison-of-Features
Question:

Can we achieve effective feature

learning directly on point
clouds?
Our Work:
PointNet
End-to-end learning for scattered, unordered point
data

PointNet
Our Work:
PointNet
End-to-end learning for scattered, unordered point
data
Unified framework for various tasks

Object Classification

PointNet Object Part Segmentation

Semantic Scene Parsing
...
Our Work:
PointNet
End-to-end learning for scattered, unordered point
data
Unified framework for various tasks
Challenge
s

Unordered point set as input

Model needs to be invariant to N! permutations.

Invariance under geometric transformations

Point cloud rotations should not alter classification
results.
Challenge
s

Unordered point set as input

Model needs to be invariant to N! permutations.

Invariance under geometric transformations

Point cloud rotations should not alter classification
results.
Unordered
Input
Point cloud: N orderless points, each represented
by a D dim vector
D

N
Unordered
Input
Point cloud: N orderless points, each represented
by a D dim vector
D D

N represents the same set N

as
Unordered
Input
Point cloud: N orderless points, each represented
by a D dim vector
D D

N represents the same set N

Model needs to be invariant to N! permutations

Permutation Invariance: Symmetric
Function
f (x1 , x2 ,, xn )  , x ,, x ), xi D

f (x ℝ
1 2 n
Permutation Invariance: Symmetric
Function
f (x1 , x2 ,, xn )  , x ,, x ), xi D

f (x ℝ
1 2 n

Examples:
f (x1 , x2 ,, xn )  max{x1 , x2 ,,
xn }
f (x1 , x2 ,, xn )  x1  x2  xn
…
Permutation Invariance: Symmetric
Function
f (x1 , x2 ,, xn )  , x ,, x ), xi D

f (x ℝ
1 2 n

Examples:
f (x1 , x2 ,, xn )  max{x1 , x2 ,, xn }
f (x1 , x2 ,, xn )  x1  x2  xn
…
How can we construct a family of symmetric
functions by neural networks?
Permutation Invariance: Symmetric
Function
Observe:

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

symmetric
Permutation Invariance: Symmetric
Function
Observe:

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

symmetric

h
(1,2,3)
(1,1,1)
…

(2,3,2)
Permutation Invariance: Symmetric
Function
Observe:

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

symmetric
(1,2,3 simple symmetric
) h function
(1,1,1 g
)
(2,3,2
)
…

(2,3,4
Permutation Invariance: Symmetric
Function
Observe:

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

symmetric
(1,2,3 simple symmetric
) h function
(1,1,1 g 
)
(2,3,2
)
…

(2,3,4 PointNet (vanilla)

)
Permutation Invariance: Symmetric
Function
What symmetric functions can be constructed by
PointNet?

Symmetric functions

PointNet

(vanilla)
Universal Set Function
Approximator
Theorem:
A Hausdorff continuous symmetric function f : 2 X 
ℝ can be arbitrarily approximated by PointNet.

S  ℝd PointNet (vanilla)
Basic PointNet
Architecture
Empirically, we use multi-layer perceptron (MLP) and max
pooling:
h
(1,2,3 MLP
) g
(1,1,1 MLP 
) ma MLP
(2,3,2 MLP
x
)
…

(2,3,4 MLP PointNet (vanilla)

)
Challenge
s

Unordered point set as input

Model needs to be invariant to N! permutations.

Invariance under geometric transformations

Point cloud rotations should not alter classification
results.
Input Alignment by Transformer
Network
Idea: Data dependent transformation for automatic
alignment
3 T- transfor 3
Net m
params
N Transfor N
m

Transform
Dat
ed
a
Data
Input Alignment by Transformer
Network
Idea: Data dependent transformation for automatic
alignment
3 T- transfor 3
Net m
params
N Transfor N
m

Transform
Dat
ed
a
Data
Input Alignment by Transformer
Network
The transformation is just matrix
multiplication!
3 T- transfor 3
Net params:
m
3x3
Matrix
N
Mult.

Transform
Dat
ed
a
Data
Embedding Space
Alignment

transform
T- params:
Net 64x64
Matri
x
Mult
.
Input Transform
embedding ed
s: Nx64 embedding
s: Nx64
Embedding Space
Alignment

transform
T- params: Regularization:
Net 64x64
Matri Transform matrix A
x 64x64 close to
Mult orthogonal:
.
Input Transform
embedding ed
s: Nx64 embedding
s: Nx64
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
Extension to
PointNet Segmentation
Network

local embedding global feature

Extension to
PointNet Segmentation
Network

local embedding global feature

Result
s
Results on Semantic Scene
Parsing
Input
Output

dataset: Stanford 2D-3D-S (Matterport scans)

Visualizing Global Point
Cloud Features
3 MLP 102
4 maxpo
ol
n
share global feature
d

Which input points are contributing to the global

feature?
(critical points)
Visualizing Global Point
Cloud Features
Original
Shape:

Critical Point
Sets:
Visualizing Global Point
Cloud Features
3 MLP 102
4 maxpo
ol
n
share global feature
d

Which points won’t affect the global

feature?
Conclusio
n
• PointNet is a novel deep neural network that
directly consumes point cloud.
• A unified approach to various 3D recognition
tasks.
• Rich theoretical analysis and experimental
results.
Code & Data Available!
http://stanford.edu/~rqi/poi
ntnet

See you at Poster 9!

Permutation Invariance:
How about Sorting?
“Sort” the points before feeding them into a
network.
Unfortunately, there is no canonical order in high
dim space.
lexsorte
(1,2,3 d
) (1,1,1)
(1,1,1 (2,3,2
(1,2,3) MLP
) )
(2,3,2 (2,3,4
) )
(2,3,4
)
Permutation Invariance:
How about Sorting?
“Sort” the points before feeding them into a
network.
Unfortunately, there is no canonical order in high
dim space. Multi-Layer Perceptron
lexsorte (ModelNet shape
d classification)
(1,2,3 Accuracy
) (1,1,1)
(1,1,1 (2,3,2
(1,2,3) MLP Unordered 12%
) ) Input
(2,3,2 (2,3,4 Lexsorted Input 40%
) ) PointNet 87%
(2,3,4 (vanilla)
)
Permutation Invariance:
How about RNNs?
Train RNN with permutation
augmentation. However, RNN
forgets and order matters.
LSTM Network
… (ModelNet shape
LSTM LSTM LSTM LSTM
classification)
Accuracy
MLP MLP MLP … MLP LSTM 75%

PointNet 87%
(1,2,3 (1,1,1 (2,3,2 (2,3,4 (vanilla)
) ) ) )
PointNet Classification
Network

ModelNet40 Accuracy
PointNet (vanilla) 87.1%
+ input 3x3 87.9%
+ feature 64x64 86.9%
+ feature 64x64 + 87.4%
reg
+ both 89.2%
VisualizingPoint
Functions
1x 1x102
Compact 3 FCs 4
View:
1x 1x102
Expanded 3 FC FC FC FC FC 4
View: 6 6 6 12
4 4 4 8

Which input point will activate neuron X?

Find the top-K points in a dense volumetric grid that activates
neuron X.

NNDL Lab Manual
No ratings yet
NNDL Lab Manual
41 pages
Webpdf
No ratings yet
Webpdf
671 pages
Nnet - Ug 1 150 PDF
No ratings yet
Nnet - Ug 1 150 PDF
150 pages
Gen AI Unit 3
No ratings yet
Gen AI Unit 3
52 pages
ML CT Question Paper 2023 24
No ratings yet
ML CT Question Paper 2023 24
2 pages
Neural Network Project Report.
No ratings yet
Neural Network Project Report.
12 pages
AML - Lecture - 11 - 19nov24
No ratings yet
AML - Lecture - 11 - 19nov24
103 pages
Ann Unit 1
No ratings yet
Ann Unit 1
26 pages
2021mini Alexnet
No ratings yet
2021mini Alexnet
73 pages
SNNS and ANNS - Notes
No ratings yet
SNNS and ANNS - Notes
14 pages
التعلم العميق
No ratings yet
التعلم العميق
100 pages
Lecture 3
No ratings yet
Lecture 3
62 pages
GNN - PEter
No ratings yet
GNN - PEter
96 pages
Fei 2024
No ratings yet
Fei 2024
52 pages
Activation Function: Deep Neural Networks
No ratings yet
Activation Function: Deep Neural Networks
47 pages
MPCT Multiscale Point Cloud Transformer With A Residual Network
No ratings yet
MPCT Multiscale Point Cloud Transformer With A Residual Network
12 pages
Thesis Z Ai
No ratings yet
Thesis Z Ai
46 pages
Slides For 'Large Language Model: From Theory To Implementations', Chapter 1
No ratings yet
Slides For 'Large Language Model: From Theory To Implementations', Chapter 1
40 pages
DL Tutorial NIPS2015 PDF
No ratings yet
DL Tutorial NIPS2015 PDF
133 pages
Cvpr17 Pointnet Slides
No ratings yet
Cvpr17 Pointnet Slides
68 pages
LSTM
No ratings yet
LSTM
123 pages
3D Semantic Novelty Detection
No ratings yet
3D Semantic Novelty Detection
21 pages
2022ADeepLearning BasedModelforDateFruitClassification
No ratings yet
2022ADeepLearning BasedModelforDateFruitClassification
17 pages
Chapter 6 Deep Learning Knowledge
No ratings yet
Chapter 6 Deep Learning Knowledge
24 pages
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
No ratings yet
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
21 pages
Szegedy - Intriguing Properties of Neural Networks
No ratings yet
Szegedy - Intriguing Properties of Neural Networks
10 pages
SO Net
No ratings yet
SO Net
17 pages
UNIT 4 (MCQS)
No ratings yet
UNIT 4 (MCQS)
13 pages
2021 Deep Polynomial Neural Networks
No ratings yet
2021 Deep Polynomial Neural Networks
14 pages
Pointconv: Deep Convolutional Networks On 3D Point Clouds
No ratings yet
Pointconv: Deep Convolutional Networks On 3D Point Clouds
10 pages
A Closer Look at Rotation-Invariant Deep Point Cloud Analysis
No ratings yet
A Closer Look at Rotation-Invariant Deep Point Cloud Analysis
10 pages
DAC: Deep Autoencoder-Based Clustering, A General Deep Learning Framework of Representation Learning
No ratings yet
DAC: Deep Autoencoder-Based Clustering, A General Deep Learning Framework of Representation Learning
12 pages
Hassani Unsupervised Multi-Task Feature Learning On Point Clouds ICCV 2019 Paper
No ratings yet
Hassani Unsupervised Multi-Task Feature Learning On Point Clouds ICCV 2019 Paper
12 pages
2207.05209 Fourier Neural Operator With Learned Deformations
No ratings yet
2207.05209 Fourier Neural Operator With Learned Deformations
17 pages
Rfnet-4D++: Joint Object Reconstruction and Flow Estimation From 4D Point Clouds With Cross-Attention Spatio-Temporal Features
No ratings yet
Rfnet-4D++: Joint Object Reconstruction and Flow Estimation From 4D Point Clouds With Cross-Attention Spatio-Temporal Features
14 pages
Corr Net 3 D
No ratings yet
Corr Net 3 D
10 pages
44 Variational Point Encoding Def
No ratings yet
44 Variational Point Encoding Def
10 pages
Dual Transformer For Point Cloud Analysis: Xian-Feng Han Yi-Fei Jin
No ratings yet
Dual Transformer For Point Cloud Analysis: Xian-Feng Han Yi-Fei Jin
8 pages
MicroNet ICCV2021
No ratings yet
MicroNet ICCV2021
10 pages
1 s2.0 S0950705122010772 Main
No ratings yet
1 s2.0 S0950705122010772 Main
10 pages
Final Visit PPT
No ratings yet
Final Visit PPT
14 pages
Learning General and Distinctive 3D Local Deep Descriptors For Point Cloud Registration
No ratings yet
Learning General and Distinctive 3D Local Deep Descriptors For Point Cloud Registration
7 pages
Applied Sciences: Go Wider: An Efficient Neural Network For Point Cloud Analysis Via Group Convolutions
No ratings yet
Applied Sciences: Go Wider: An Efficient Neural Network For Point Cloud Analysis Via Group Convolutions
15 pages
Triplet Loss
No ratings yet
Triplet Loss
30 pages
Artificial Neural Network Unit-3
No ratings yet
Artificial Neural Network Unit-3
2 pages
Shakibajahromi RIMeshGNN A Rotation-Invariant Graph Neural Network For Mesh Classification WACV 2024 Paper
No ratings yet
Shakibajahromi RIMeshGNN A Rotation-Invariant Graph Neural Network For Mesh Classification WACV 2024 Paper
11 pages
Pointconv: Deep Convolutional Networks On 3D Point Clouds
No ratings yet
Pointconv: Deep Convolutional Networks On 3D Point Clouds
10 pages
Clustering-Enhanced Pointcnn For Point Cloud Classification Learning
No ratings yet
Clustering-Enhanced Pointcnn For Point Cloud Classification Learning
6 pages
GoogleNET and ResNet v4 With Nin and Bias
No ratings yet
GoogleNET and ResNet v4 With Nin and Bias
82 pages
Symplectic Networks For Identifying Hamiltonian Systems.
No ratings yet
Symplectic Networks For Identifying Hamiltonian Systems.
18 pages
Pointnet: Deep Learning On Point Sets For 3D Classification and Segmentation
No ratings yet
Pointnet: Deep Learning On Point Sets For 3D Classification and Segmentation
19 pages
Multi Distance Metric Network For Few Shot Learning: Farong Gao Lijie Cai Zhangyi Yang Shiji Song Cheng Wu
No ratings yet
Multi Distance Metric Network For Few Shot Learning: Farong Gao Lijie Cai Zhangyi Yang Shiji Song Cheng Wu
12 pages
3D U-Net Learning Dense Volumetric Segmentation FR
No ratings yet
3D U-Net Learning Dense Volumetric Segmentation FR
9 pages
1.fan - A Point Set Generation Network For 3D Object Reconstruction From A Single Image - CVPR - 2017 - Paper
No ratings yet
1.fan - A Point Set Generation Network For 3D Object Reconstruction From A Single Image - CVPR - 2017 - Paper
9 pages
Group Equivariant Convolutional Networks: Taco S. Cohen
No ratings yet
Group Equivariant Convolutional Networks: Taco S. Cohen
12 pages
Week 6 Prev & Current Assignments
No ratings yet
Week 6 Prev & Current Assignments
21 pages
1.8.citedby - Fusing The Old With The New: Learning Relative Camera Pose With Geometry-Guided Uncertainty - 2104.08278
No ratings yet
1.8.citedby - Fusing The Old With The New: Learning Relative Camera Pose With Geometry-Guided Uncertainty - 2104.08278
11 pages
Zhang Deep Graphical Feature Learning For The Feature Matching Problem ICCV 2019 Paper
No ratings yet
Zhang Deep Graphical Feature Learning For The Feature Matching Problem ICCV 2019 Paper
10 pages
Tabnet: Attentive Interpretable Tabular Learning: Sercan O. Arık Tomas Pfister
No ratings yet
Tabnet: Attentive Interpretable Tabular Learning: Sercan O. Arık Tomas Pfister
12 pages
Pointnet++: Deep Hierarchical Feature Learning On Point Sets in A Metric Space
No ratings yet
Pointnet++: Deep Hierarchical Feature Learning On Point Sets in A Metric Space
14 pages
Point Transformer
No ratings yet
Point Transformer
11 pages
Point Transformers
No ratings yet
Point Transformers
11 pages
Deep Learning On Point Clouds and Its Application - A Survey
No ratings yet
Deep Learning On Point Clouds and Its Application - A Survey
22 pages
Guidelines - Deep Learning
No ratings yet
Guidelines - Deep Learning
2 pages
Build Deep Learning NN Models
No ratings yet
Build Deep Learning NN Models
6 pages
MPCT Multiscale Point Cloud Transformer With A Residual Network
No ratings yet
MPCT Multiscale Point Cloud Transformer With A Residual Network
12 pages
Understanding Deep Convolutional Networks: Review
No ratings yet
Understanding Deep Convolutional Networks: Review
16 pages
Deep Learning M2-T1-Student Question Bank
No ratings yet
Deep Learning M2-T1-Student Question Bank
2 pages
International Baccalaureate (IB) : Artificial Neural Networks - #3
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #3
13 pages
3 D Point Cloud Reviews
No ratings yet
3 D Point Cloud Reviews
22 pages
Lecture 16 Hao
No ratings yet
Lecture 16 Hao
56 pages
Pointwise Convolutional Neural Networks
No ratings yet
Pointwise Convolutional Neural Networks
10 pages
Point Transformer
No ratings yet
Point Transformer
10 pages
Pan 3D Object Detection With Pointformer CVPR 2021 Paper
No ratings yet
Pan 3D Object Detection With Pointformer CVPR 2021 Paper
10 pages
Learning Efficient Point Cloud Generation For Dense 3D Object Reconstruction
No ratings yet
Learning Efficient Point Cloud Generation For Dense 3D Object Reconstruction
8 pages
Embedding Expansion: Augmentation in Embedding Space For Deep Metric Learning
No ratings yet
Embedding Expansion: Augmentation in Embedding Space For Deep Metric Learning
14 pages
20IT7301 - Deep Learning Syllabus
No ratings yet
20IT7301 - Deep Learning Syllabus
3 pages
Handwritten Character Recognition From Images Using CNN-ECOC Handwritten Character Recognition From Images Using CNN-ECOC
No ratings yet
Handwritten Character Recognition From Images Using CNN-ECOC Handwritten Character Recognition From Images Using CNN-ECOC
7 pages
RNN, NLP
No ratings yet
RNN, NLP
2 pages
Quiz - Review On Machine Learning
No ratings yet
Quiz - Review On Machine Learning
6 pages
Unit Iv DL
No ratings yet
Unit Iv DL
26 pages
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
2 pages
Perceptron - Wikipedia
No ratings yet
Perceptron - Wikipedia
9 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Deep Tensor Convolution On Multicores
No ratings yet
Deep Tensor Convolution On Multicores
10 pages
DL Unit-3
No ratings yet
DL Unit-3
9 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture PointNet-part1-summary-2

Uploaded by

Lecture PointNet-part1-summary-2

Uploaded by

PointNet: Deep Learning on Point Sets

for 3D Classification and

source: Scott J source: Google source:

source: Scott J source: Google source:

Need for 3D Deep Learning!

Point cloud is canonical

Can we achieve effective feature

PointNet Object Part Segmentation

Unordered point set as input

Invariance under geometric transformations

Unordered point set as input

Invariance under geometric transformations

N represents the same set N

N represents the same set N

Model needs to be invariant to N! permutations

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

f (x1, x2 ,, xn )   ∘ g(h(x1 ),, h(xn )) is symmetric if g is

(2,3,4 PointNet (vanilla)

(2,3,4 MLP PointNet (vanilla)

Unordered point set as input

Invariance under geometric transformations

local embedding global feature

local embedding global feature

dataset: Stanford 2D-3D-S (Matterport scans)

Which input points are contributing to the global

Which points won’t affect the global

See you at Poster 9!

Which input point will activate neuron X?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.