Lecture PointNet-part1-summary-2
Lecture PointNet-part1-summary-2
Emerging 3D Applications
Big Data+DeepRepresentation
Learning
Robot Perception Augmented Reality Shape
Design
LiDA
Volumetr
R
ic
Depth
Point Cloud
Sensor
Depth
Map
Previous
Works
Most existing point cloud features are
handcrafted
towards specific tasks
Source: https://github.com/PointCloudLibrary/pcl/wiki/Overview-and-Comparison-of-Features
Question:
PointNet
Our Work:
PointNet
End-to-end learning for scattered, unordered point
data
Unified framework for various tasks
Object Classification
N
Unordered
Input
Point cloud: N orderless points, each represented
by a D dim vector
D D
as
Unordered
Input
Point cloud: N orderless points, each represented
by a D dim vector
D D
as
f (x ℝ
1 2 n
Permutation Invariance: Symmetric
Function
f (x1 , x2 ,, xn ) , x ,, x ), xi D
f (x ℝ
1 2 n
Examples:
f (x1 , x2 ,, xn ) max{x1 , x2 ,,
xn }
f (x1 , x2 ,, xn ) x1 x2 xn
…
Permutation Invariance: Symmetric
Function
f (x1 , x2 ,, xn ) , x ,, x ), xi D
f (x ℝ
1 2 n
Examples:
f (x1 , x2 ,, xn ) max{x1 , x2 ,, xn }
f (x1 , x2 ,, xn ) x1 x2 xn
…
How can we construct a family of symmetric
functions by neural networks?
Permutation Invariance: Symmetric
Function
Observe:
symmetric
Permutation Invariance: Symmetric
Function
Observe:
symmetric
h
(1,2,3)
(1,1,1)
…
(2,3,2)
Permutation Invariance: Symmetric
Function
Observe:
symmetric
(1,2,3 simple symmetric
) h function
(1,1,1 g
)
(2,3,2
)
…
(2,3,4
Permutation Invariance: Symmetric
Function
Observe:
symmetric
(1,2,3 simple symmetric
) h function
(1,1,1 g
)
(2,3,2
)
…
Symmetric functions
PointNet
(vanilla)
Universal Set Function
Approximator
Theorem:
A Hausdorff continuous symmetric function f : 2 X
ℝ can be arbitrarily approximated by PointNet.
S ℝd PointNet (vanilla)
Basic PointNet
Architecture
Empirically, we use multi-layer perceptron (MLP) and max
pooling:
h
(1,2,3 MLP
) g
(1,1,1 MLP
) ma MLP
(2,3,2 MLP
x
)
…
Transform
Dat
ed
a
Data
Input Alignment by Transformer
Network
Idea: Data dependent transformation for automatic
alignment
3 T- transfor 3
Net m
params
N Transfor N
m
Transform
Dat
ed
a
Data
Input Alignment by Transformer
Network
Idea: Data dependent transformation for automatic
alignment
3 T- transfor 3
Net m
params
N Transfor N
m
Transform
Dat
ed
a
Data
Input Alignment by Transformer
Network
The transformation is just matrix
multiplication!
3 T- transfor 3
Net params:
m
3x3
Matrix
N
Mult.
Transform
Dat
ed
a
Data
Embedding Space
Alignment
transform
T- params:
Net 64x64
Matri
x
Mult
.
Input Transform
embedding ed
s: Nx64 embedding
s: Nx64
Embedding Space
Alignment
transform
T- params: Regularization:
Net 64x64
Matri Transform matrix A
x 64x64 close to
Mult orthogonal:
.
Input Transform
embedding ed
s: Nx64 embedding
s: Nx64
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
PointNet Classification
Network
Extension to
PointNet Segmentation
Network
Critical Point
Sets:
Visualizing Global Point
Cloud Features
3 MLP 102
4 maxpo
ol
n
share global feature
d
PointNet 87%
(1,2,3 (1,1,1 (2,3,2 (2,3,4 (vanilla)
) ) ) )
PointNet Classification
Network
ModelNet40 Accuracy
PointNet (vanilla) 87.1%
+ input 3x3 87.9%
+ feature 64x64 86.9%
+ feature 64x64 + 87.4%
reg
+ both 89.2%
VisualizingPoint
Functions
1x 1x102
Compact 3 FCs 4
View:
1x 1x102
Expanded 3 FC FC FC FC FC 4
View: 6 6 6 12
4 4 4 8