0% found this document useful (0 votes)

11 views

Cnn

convolution neural network

Uploaded by

Electrical Tech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Cnn

convolution neural network

Uploaded by

Electrical Tech

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

Introduction to Deep Learning

Nandita Bhaskhar
Content adapted from CS231n and past CS229 teams
April 29th, 2022
Overview
● Motivation for deep learning
● Areas of Deep Learning
● Convolutional neural networks
● Recurrent neural networks
● Deep learning tools

2
Classical Approaches
Saturate!
● Computer vision is especially hard for
conventional image processing
techniques
● Humans are just intrinsically better at
perceiving the world!

https://xkcd.com/1425/
3
What about the MLPs we learnt in class?

Recall:
● Input Layer
● Hidden layer
● Activations
● Outputs

Pic Credit: Becoming Human: Artificial Intelligence Magazine 4

What about the MLPs we learnt in class?
Expensive to learn. Will not generalize well

Does not exploit the order and local relations in the data!

64x64x3=12288 parameters
We also want many layers
5
Overview
● Motivation for deep learning
● Areas in Deep Learning
● Convolutional neural networks
● Recurrent neural networks
● Deep learning tools

6
What are different pillars of deep learning?

Convolutional NN Recurrent NN
Image Time Series

Graph NN
Networks/Relational
Deep RL
Control System

7
Overview
● Motivation for deep learning
● Areas of Deep Learning
● Convolutional neural networks
● Recurrent neural networks
● Deep learning tools

8
Convolutional Neural Networks

Convolutional
Neural Network

Recurrent NN Deep RL Graph NN

9
Let us look at images in detail

10
2D Convolution

11
Pic Credit: Apple, Chip Huyen
Sharpening

Convolving Filters
https://ai.stanford.edu/~syyeung/cvweb/tutorials.html

Edge Detection: Laplacian Filters

0 -1 0 -1 -1 -1

-1 4 -1 -1 8 -1

0 -1 0 -1 -1 -1

12
https://ai.stanford.edu/~syyeung/cvweb/tutorials.html
Convolving Filters
● Why not extract features using
filters?

● Better, why not let the data dictate

what filters to use?

● Learnable filters!!

13
Convolution on multiple channels
● Images are generally RGB !!

● How would a filter work on a

image with RGB channels?

● The filter should also have 3

channels.

● Now the output has a channel

for every filter we have used.

14
Slide Credit: CS231n 15
Slide Credit: CS231n 16
Slide Credit: CS231n 17
Slide Credit: CS231n 18
Slide Credit: CS231n 19
Slide Credit: CS231n 20
21
Slide Credit: CS231n
22
Slide Credit: CS231n
Slide Credit: CS231n 23
Slide Credit: CS231n 24
Slide Credit: CS231n 25
Slide Credit: CS231n 26
Slide Credit: CS231n 27
Slide Credit: CS231n 28
Slide Credit: CS231n 29
Parameter Sharing

Lesser the parameters less computationally intensive the training. This is a

win win as we are reusing parameters.

30
Translational invariance
Since we are training filters to
detect cats and the moving
these filters over the data, a
differently positioned cat will
also get detected by the same
set of filters.

31
Filteres? Layers of filters?

Images that maximize filter outputs at certain How deeper layers can learn deeper
layers. We observe that the images get more embeddings. How an eye is made up of multiple
complex as filters are situated deeper curves and a face is made up of two eyes.
32
How do we use convolutions?

Let convolutions extract features!

33
Image credit: LeCun et al. (1998)
Fun Fact: Convolution really is just a linear operation
● In fact convolution is a giant matrix
multiplication.

● We can expand the 2 dimensional

image into a vector and the conv
operation into a matrix.

34
How do we learn?

We now have a network with:

● a bunch of weights
● a loss function

To learn:
● Just do gradient descent and backpropagate the error derivates

35
How do we learn?
Instead of

There are “optimizers”

● Momentum: Gradient + Momentum

● Nestrov: Momentum + Gradients
● Adagrad: Normalize with sum of sq
● RMSprop: Normalize with moving
avg of sum of squares
● ADAM: RMsprop + momentum

36
Mini-batch Gradient Descent
Expensive to compute gradient for large dataset

Memory size

Compute time

Mini-batch: takes a sample of training data

How to we sample intelligently?

37
Is deeper better?
Deeper networks seem to be
more powerful but harder to train.

● Loss of information during

forward propagation
● Loss of gradient info during
back propagation

There are many ways to “keep

the gradient going”

38
Solution
Connect the layers, create a gradient highway or information

highway.

ResNet (2015)
39
Image credit: He et al. (2015)
Initialization
● Can we initialize all neurons to zero? ● Relu units once knocked out and
their output is zero, their gradient
● If all the weights are same we will not flow also becomes zero.
be able to break symmetry of the
network and all filters will end up ● We need small random numbers at
learning the same thing. initialization.

● Large numbers, might knock relu ● Variance : 1/sqrt(n)

units out. ● Mean: 0

Popular initialization setups

(Xavier, He) (Uniform, Normal) 40

Dropout
● What does cutting off some network
connections do?

● Trains multiple smaller networks in

an ensemble.

● Can drop entire layer too!

● Acts like a really good regularizer

41
Tricks for training
● Data augmentation if your data set is
smaller. This helps the network
generalize more.

● Early stopping if training loss goes

above validation loss.

● Random hyperparameter search or

grid search?

42
Overview
● Motivation for deep learning
● Areas of Deep Learning
● Convolutional neural networks
● Recurrent neural networks
● Deep learning tools

43
CNN sounds like fun!
What are some deep learning pillars?

Recurrent NN
Time Series

Convolutional NN Deep RL Graph NN

44
We can also have 1D architectures (remember this)
● CNN works on any data where there is
a local pattern

● We use 1D convolutions on DNA

sequences, text sequences and music
notes

● But what if time series has causal

dependency or any kind of sequential
dependency?

45
To address sequential dependency?
Use recurrent neural network (RNN)
Latent Output Unrolling an RNN
Previous output

One time step

RNN Cell

They are really the same cell,

NOT many different cells like kernels of CNN
46
How does RNN produce result?
Evolving “embedding”

Result after reading

full sentence

I love CS ! 47
There are 2 types of RNN cells
Store in “long term memory” Response to current input Reset gate Update gate

Response to
current input

Long Short Term Memory (LSTM) Gated Recurrent Unit (GRU)

48
Recurrent AND deep?
Taking last value
Pay “attention” to
everything

Stacking Attention Model 49

“Recurrent” AND convolutional?

Temporal convolutional network

Temporal dependency achieved through

“one-sided” convolution

More efficient because deep learning

packages are optimized for matrix
multiplication = convolution

No hard dependency

50
More? Take CS230, CS236, CS231N, CS224N

Convolutional NN Recurrent NN
Image Time Series

Graph NN
Networks/Relational
Deep RL
Control System

51
Not today, but take CS234 and CS224W

Convolutional NN Recurrent NN
Image Time Series

Graph NN
Networks/Relational
Deep RL
Control System

52
Overview
● Motivation for deep learning
● Areas of Deep Learning
● Convolutional neural networks
● Recurrent neural networks
● Deep learning tools

53
Tools for deep learning Specialized
Groups

Popular Tools
54
Where can I get free stuff?
Google Colab
Azure Notebook
Free (limited-ish) GPU access
Kaggle kernel???
Works nicely with Tensorflow
Amazon SageMaker?
Links to Google Drive

Register a new Google Cloud account To SAVE money

=> Instant $300??

CLOSE your GPU instance
=> AWS free tier (limited compute)

=> Azure education account, $200? ~$1 an hour

55
Good luck!
Well, have fun too :D

CLARA CLARANS Example
No ratings yet
CLARA CLARANS Example
3 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Lec6 RNN Attention Search
No ratings yet
Lec6 RNN Attention Search
62 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
DL-19-CNN Sequential Model 210223
No ratings yet
DL-19-CNN Sequential Model 210223
18 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Introduction to Deep Learning 17th January 2025 (2)
No ratings yet
Introduction to Deep Learning 17th January 2025 (2)
60 pages
Unit-4 DL PPTs (2 Files Merged)
No ratings yet
Unit-4 DL PPTs (2 Files Merged)
16 pages
Group I - PPT
No ratings yet
Group I - PPT
20 pages
Deep Learning Report for Students
No ratings yet
Deep Learning Report for Students
32 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
49 pages
Introduction to Deep Learning
No ratings yet
Introduction to Deep Learning
47 pages
Image Classification Using Convolutional Neural Networks (CNNS)
No ratings yet
Image Classification Using Convolutional Neural Networks (CNNS)
61 pages
Unit 6
No ratings yet
Unit 6
41 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
Lecture Notes on Lecture Notes on Deep Learning.docx
No ratings yet
Lecture Notes on Lecture Notes on Deep Learning.docx
8 pages
Unit Iv (CNN)
No ratings yet
Unit Iv (CNN)
8 pages
Introduction To Deep Neural Networks - DataCamp
No ratings yet
Introduction To Deep Neural Networks - DataCamp
10 pages
Introduction To Deep Learning: by Gargee Sanyal
No ratings yet
Introduction To Deep Learning: by Gargee Sanyal
20 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
19 Deep Learning
100% (1)
19 Deep Learning
49 pages
Aquino Dominic Bien FA2.2
No ratings yet
Aquino Dominic Bien FA2.2
3 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
IC Unit6 DeepLearning
No ratings yet
IC Unit6 DeepLearning
35 pages
ch4_CNN
No ratings yet
ch4_CNN
35 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Introduction To Deep Convolutional Neural Networks: March 2016
No ratings yet
Introduction To Deep Convolutional Neural Networks: March 2016
51 pages
Deepnet Lourentzou
No ratings yet
Deepnet Lourentzou
49 pages
Ai 4 All
No ratings yet
Ai 4 All
18 pages
Deep Learning: Technical Introduction: Thomas Epelbaum
No ratings yet
Deep Learning: Technical Introduction: Thomas Epelbaum
106 pages
2
No ratings yet
2
9 pages
Convolutional Neural Networks in Python _ DataCamp
No ratings yet
Convolutional Neural Networks in Python _ DataCamp
22 pages
Module 3 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 3 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
20 pages
FT04_Haghighat_Independent_2023
No ratings yet
FT04_Haghighat_Independent_2023
40 pages
Unit 3
No ratings yet
Unit 3
105 pages
Unit 4
No ratings yet
Unit 4
27 pages
Lec_2
No ratings yet
Lec_2
42 pages
Deep Learnig-CNN-new_DMI-compressed
No ratings yet
Deep Learnig-CNN-new_DMI-compressed
118 pages
four unit
No ratings yet
four unit
3 pages
Neural Network (RNN & CNN)
No ratings yet
Neural Network (RNN & CNN)
31 pages
Deep Learning
No ratings yet
Deep Learning
34 pages
AI_slide_2
No ratings yet
AI_slide_2
82 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
2630_20230529_Mahdi__Momen_Aldawood_hh_15261_946399124 (1)
No ratings yet
2630_20230529_Mahdi__Momen_Aldawood_hh_15261_946399124 (1)
11 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
DL_Unit3_1 (1)
No ratings yet
DL_Unit3_1 (1)
67 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
cs329-lecture 5-2025
No ratings yet
cs329-lecture 5-2025
30 pages
Deep Learning Notes (1) 2
No ratings yet
Deep Learning Notes (1) 2
54 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Aquino Dominic Bien FA2.2 References
No ratings yet
Aquino Dominic Bien FA2.2 References
1 page
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
Slides CNN Unit 3
No ratings yet
Slides CNN Unit 3
36 pages
mergeddv
No ratings yet
mergeddv
2 pages
Intro To Deep Learning
100% (1)
Intro To Deep Learning
35 pages
Simplifying Neural Networks and Deep Learning Basics!
No ratings yet
Simplifying Neural Networks and Deep Learning Basics!
27 pages
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet
Deep Learning
No ratings yet
Deep Learning
152 pages
Cluster Analysis or Clustering Is The Art of Separating The Data Points Into Dissimilar Group With A
No ratings yet
Cluster Analysis or Clustering Is The Art of Separating The Data Points Into Dissimilar Group With A
11 pages
Neural Network Course Guide Book
No ratings yet
Neural Network Course Guide Book
2 pages
15CS324E lp2017
No ratings yet
15CS324E lp2017
4 pages
05_NN
No ratings yet
05_NN
151 pages
Solved With ChatGPT
No ratings yet
Solved With ChatGPT
3 pages
Transformers Without Tears
No ratings yet
Transformers Without Tears
11 pages
Assign 7
No ratings yet
Assign 7
5 pages
08 ML WEKA Classification
No ratings yet
08 ML WEKA Classification
73 pages
Time Delay Neural Network
No ratings yet
Time Delay Neural Network
6 pages
Lecture 26 RNN
No ratings yet
Lecture 26 RNN
16 pages
RNN Part1
No ratings yet
RNN Part1
12 pages
191CSC503T - Data Mining-Cat 2-Question Bank
No ratings yet
191CSC503T - Data Mining-Cat 2-Question Bank
6 pages
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
No ratings yet
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
29 pages
Airline Reservation
No ratings yet
Airline Reservation
2 pages
Data Science Deep Learning & Artificial Intelligence
No ratings yet
Data Science Deep Learning & Artificial Intelligence
9 pages
Advanced_AI_Viva_Questions
No ratings yet
Advanced_AI_Viva_Questions
2 pages
Data Mining MCQ
No ratings yet
Data Mining MCQ
4 pages
Ai Assignment 2 Answer
No ratings yet
Ai Assignment 2 Answer
12 pages
Complete Deep Learning Interview Question
No ratings yet
Complete Deep Learning Interview Question
46 pages
Linear Classifiers in Python: Chapter1
No ratings yet
Linear Classifiers in Python: Chapter1
16 pages
Machine Learning Mastery Notes
No ratings yet
Machine Learning Mastery Notes
4 pages
Radial Basis Function Neural Network RBFNN
No ratings yet
Radial Basis Function Neural Network RBFNN
14 pages
Perceptron PDF
0% (1)
Perceptron PDF
8 pages
Recurrent & Recursive Nets
No ratings yet
Recurrent & Recursive Nets
10 pages
Lec.7,8,9
No ratings yet
Lec.7,8,9
23 pages
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
No ratings yet
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
4 pages
Comparative Study On Spoken Language Identification Based On Deep Learning
No ratings yet
Comparative Study On Spoken Language Identification Based On Deep Learning
5 pages
Kmeans and Apriori
No ratings yet
Kmeans and Apriori
20 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Cnn

Uploaded by

Cnn

Uploaded by

Introduction to Deep Learning

Pic Credit: Becoming Human: Artificial Intelligence Magazine 4

Recurrent NN Deep RL Graph NN

Edge Detection: Laplacian Filters

● Better, why not let the data dictate

● How would a filter work on a

● The filter should also have 3

● Now the output has a channel

Lesser the parameters less computationally intensive the training. This is a

Let convolutions extract features!

● We can expand the 2 dimensional

We now have a network with:

There are “optimizers”

● Momentum: Gradient + Momentum

Mini-batch: takes a sample of training data

How to we sample intelligently?

● Loss of information during

There are many ways to “keep

● Large numbers, might knock relu ● Variance : 1/sqrt(n)

Popular initialization setups

(Xavier, He) (Uniform, Normal) 40

● Trains multiple smaller networks in

● Can drop entire layer too!

● Acts like a really good regularizer

● Early stopping if training loss goes

● Random hyperparameter search or

Convolutional NN Deep RL Graph NN

● We use 1D convolutions on DNA

● But what if time series has causal

One time step

They are really the same cell,

Result after reading

Long Short Term Memory (LSTM) Gated Recurrent Unit (GRU)

Stacking Attention Model 49

Temporal convolutional network

Temporal dependency achieved through

More efficient because deep learning

Register a new Google Cloud account To SAVE money

=> Instant $300??

=> Azure education account, $200? ~$1 an hour

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.