Lecture 6 - Convolution Neural Network (CNN)

The document discusses convolutional neural networks (CNNs) and their use in image classification. It explains that CNNs are more effective than fully connected neural networks for image classification tasks due to their use of local connectivity and weight sharing. The key layers used to build CNNs are convolutional layers, pooling layers, and fully connected layers. Convolutional layers apply filters to input images to extract features, pooling layers downsample the output to reduce dimensionality, and fully connected layers output class predictions.

Uploaded by

Đặng Anh Khoa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

156 views

Lecture 6 - Convolution Neural Network (CNN)

Uploaded by

Đặng Anh Khoa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Lecture 6

Convolutional Neural Network (CNN)

1
Convolutional Neural Network Lecture 6
Review of Fully Neural Network

HCM City Univ. of Technology, Faculty of Mechanical Engineering 2 Duong Van Tu

Convolutional Neural Network Lecture 6
Review of Fully Neural Network

• The CIFAR-10 dataset: The CIFAR-10 dataset consists of 60000 32x32 colour
images in 10 classes, with 6000 images per class.

• Images are only of size 32x32x3

(32 wide, 32 high, 3 color channels).

• A single fully-connected neuron of a first

hidden layer of a NN would have
32*32*3 = 3072 weights.

HCM City Univ. of Technology, Faculty of Mechanical Engineering 3 Duong Van Tu

Convolutional Neural Network Lecture 6
Review of Fully Neural Network

• For example, an image of more respectable size, e.g. 200x200x3, would lead to
neurons that have 200*200*3 = 120,000 weights.
• This full connectivity is wasteful and the huge number of parameters would
quickly lead to overfitting.

HCM City Univ. of Technology, Faculty of Mechanical Engineering 4 Duong Van Tu

Convolutional Neural Network Lecture 6
Convolutional Neural Network (ConvNet)

• A convolutional neural network is a feed-forward neural network that is generally

used to analyze visual images by processing data with grid-like topology. It’s also
known as a ConvNet. A convolutional neural network is used to detect and
classify objects in an image.

HCM City Univ. of Technology, Faculty of Mechanical Engineering 5 Duong Van Tu

Convolutional Neural Network Lecture 6
Convolutional Neural Network (ConvNet)

How Does CNN Recognize Images

HCM City Univ. of Technology, Faculty of Mechanical Engineering 6 Duong Van Tu

Convolutional Neural Network Lecture 6
Convolutional Neural Network (ConvNet)

• The layers of a ConvNet have neurons arranged in 3 dimensions: width, height,

depth.
• The neurons in a layer will only be connected to a small region of the layer
before it, instead of all of the neurons in a fully-connected manner.
• The final output layer would for CIFAR-10 have dimensions 1x1x10 which is a
single vector of class scores.