0% found this document useful (0 votes)

44 views

Backpropagation Math

Uploaded by

Rifat ahmmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views

Backpropagation Math

Uploaded by

Rifat ahmmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Activation Function:

1. Step function: Mathematically it can be defined as

{
f ( z )= 0 z < 0
1 z≥0
It provides possible outputs = { 0 , 1 } .It can’t provide multi-value outputs – for examples, it
can’t be used for multi-class classification problem.
2. Signum function: Mathematically it can be defined as

{
f ( z )= −1 z< 0
1 z> 0
It provides possible outputs = {−1.1 } .
3. Linear function:
4. ReLU function: ReLU stands for Rectified Linear Unit.
Mathematically, it can be defined as,
f ( x )=max ⁡(0 , x )

Although it gives an impression of a linear function, ReLU has a derivative function and
allows for backpropagation while simultaneously making it computationally efficient.
The main advantages of ReLU activation function is, it doesn’t active all the neurons at
the same time, so it is far more computationally efficient when compared to the sigmoid
and tanh functions. ReLU accelerates the convergence of gradient descent towards the
global minimum of the loss function due to its linear, non-saturating property.
The drawback of the ReLU function is Dying ReLU problem. Some neurons will be
active and some neurons will be inactive at any given point of time. There are some
situations, a neuron which is not active, will never become active in ReLU activation
function. Such a problem is known as Dying ReLU problem.
The negative side of the graph makes the gradient value zero. Due to this reason, during
the backpropagation process, the weights and biases for some neurons are not updated.
This can create dead neurons which never get activated.
5. Leaky ReLU function: Leaky ReLU is an improved version of ReLU function to solve
the Dying ReLU problem as it has a small positive slope in the negative area.
Mathematically, it can be defined as,
f ( x )=max ⁡(0.1 x , x )

Leaky ReLU function enables backpropagation, even for negative input values.

The predictions may not be consistent, for negative input values. The gradient for
negative values is a small value that makes the learning of model parameters time-
consuming.
6. Tanh function: It is a non-linear activation function. Its output ranges from -1 to 1.
Mathematically, it can be defined as
z −z
e −e
f ( z )= z −z
e +e
Where z is the weighted sum of the neuron.
The output of the tanh activation function is zero centered; so we can easily map the
output values as strongly negative, neutral or strongly positive. Drawback of this function
is vanishing gradient problem.
7. Sigmoid function: This function takes any real value as input and outputs values in the
range of 0 to 1. Mathematically it can be defined as
z
e 1
f ( z )= z
= −z
1+e 1+e
Where z is the weighted sum.
It is commonly used for models where we have to predict the probability as an output.
Since probability of anything exists only between the range of 0 and 1, sigmoid is the
right choice because of its range. The function is differentiable and provides a smooth
gradient, i.e., preventing jumps in output values. It can also be used in backpropagation
algorithm.
It works on calculating probability values. The output of the sigmoid function was in
range of 0 to 1, which can be thought of as probability.
Suppose we have five output values of 0.8, 0.9, 0.7, 0.8 and 0.6 respectively. We can’t
move forward with sigmoid activation function because the above values don’t make
sense as the sum of all the classes/output probability should be equal to 1. Here, comes
the softmax activation function.
8. Softmax function: It is the combination of multiples sigmoid activation function.
Learning Rule:

1. Perceptron Rule:
Problem – 01: w 1=1.2 , w2=0.6 ,Threshold T =1 , learning rate η=0.5

A B A^B
0 0 0
0 1 0
1 0 0
1 1 1

Solution:

Step – 01: ∑ w1 x1=1.2× 0+0.6 × 0

¿ 0<1
¿ 0( AO=¿)

∑ w2 x2 =1.2× 0+0.6 ×1
¿ 0.6< 1
¿ 0( AO=¿)

∑ w3 x3 =1.2× 1+ 0.6 ×0
¿ 1.2>1
¿ 1( AO ≠¿)
w 1=w 1+ ∆ w 1=w 1+ η ( T −O ) × x 1=1.2+0.5 × ( 0−1 ) ×1=0.7

w 2=w 2+ ∆ w 2=w 2+ η (T −O ) × x 2=0.6+0.5 × ( 0−1 ) ×0=0.6

w 1=0.7 , w 2=0.6

Step – 02: ∑ w1 x1=0.7 ×0+ 0.6 ×0=0<1=0(¿=AO )

∑ w2 x2 =0.7 ×0+ 0.6 ×1=0.6<1=0 (¿= AO )
∑ w3 x3 =0.7 ×1+0.6 × 0=0.7<1=0 ( ¿= AO )
∑ w4 x 4 =0.7 ×1+0.6 × 1=1.3>1=1(¿=AO )

Problem – 02: w 1=0.6 , w 2=0.6 , Threshold T =1 ,learning rate η=0.5

A B A OR B
0 0 0
0 1 1
1 0 1
1 1 1

Solution:

Step – 01: ∑ w1 x1=0.6 ×0+ 0.6 ×0=0<1=0( AO=¿)

∑ w2 x2 =0.6 ×0+ 0.6 ×1=0.6<1=0 (AO ≠¿)
w 1=w 1+ ∆ w 1=w 1+ η ( T −O ) × x 1=0.6 +0.5 × ( 1−0 ) × 0=0.6

w 2=w 2+ ∆ w 2=w 2+ η (T −O ) × x 2=0.6+0.5 × ( 1−0 ) ×1=1.1

w 1=0.6 , w 2=1.1

Step – 02: ∑ w1 x1=0.6 ×0+1.1 × 0=0<1=0 ( AO =¿ )

∑ w2 x2 =0.6 ×0+1.1 ×1=1.1>1=1( AO=¿)
∑ w3 x3 =0.6 ×1+1.1 ×0=0.6<1=0( AO ≠¿)
w 1=w 1+ ∆ w 1=w 1+ η ( T −O ) × x 1=0.6 +0.5 × ( 1−0 ) × 1=1.1

w 2=w 2+ ∆ w 2=w 2+ η (T −O ) × x 2=1.1+0.5 × ( 1−0 ) × 0=1.1

w 1=1.1 , w2=1.1

Step – 03: ∑ w1 x1=1.1× 0+1.1× 0=0<1=0 (AO =¿)

∑ w2 x2 =1.1× 0+1.1× 1=1.1>1=1(AO=¿)
∑ w3 x3 =1.1× 1+ 1.1× 0=1.1>1=1( AO=¿)
∑ w4 x 4 =1.1× 1+ 1.1× 1=2.2>1=1( AO=¿)
2. Delta Rule:

Back Propagation Algorithm on Multi-Layer Perceptron Network

Problem – 01: Assume that the neurons have a sigmoid activation function, perform a forward
pass and a backward pass on the network. Assume that the actual output of y is 0.5 and learning
rate is 1. Perform another forward pass.

Solution: Forward Pass:

for H 3 : ∑ ( w ij∗x i )=w 13 x 1+ w23 x 2=0.1× 0.35+0.8 × 0.9=0.755

1
y 3= −0.755
=0.68
1−e
for H 4 :a 2=w 14 x 1 +w 24 x 2=0.35 × 0.4+0.9 × 0.6=0.68
1
y4= −0.68
=0.66
1−e
For O5 :a3 :w35 y 3 +w 45 y 4 =0.3 ×0.68+ 0.9 ×0.66=0.801
1
y 5= −0.801
=0.69 (Network output)
1−e
Error = ¿ y target − y 5=0.5−0.69=−0.19

Backward pass: Compute δ 3 , δ 4 and δ 5.

For output unit,
δ 5= y 5 ( 1− y 5 ) ( y target − y 5 )=0.69 ( 1−0.69 ) ( 0.5−0.69 )=−0.0406

For hidden unit,

δ 3= y 3 ( 1− y 3 ) δ 5 w53=0.68 ( 1−0.68 ) (−0.0406 ) ( 0.3 )=−0.00265

δ 4 = y 4 ( 1− y 4 ) δ 5 w 54=0.66 ( 1−0.66 ) (−0.0406 ) ( 0.9 )=−0.0082

Compute new weights:

∆ w45=η δ 5 y 4=1 × (−0.0406 ) × ( 0.6637 )=−0.0269

w 45 ( new )=∆ w 45+ w45 ( old )=−0.0269+0.9=0.8731

∆ w14=η δ 4 x 1=1 × (−0.0082 ) × ( 0.35 )=−0.00287

w 14 ( new )=w14 ( old )+ ∆ w14 =0.4−0.00287=0.3971

∆ w35=η δ 5 y3 =1× (−0.0406 ) ×0.68=−0.027608

w 35 ( new )=w35 ( old ) +∆ w35=0.3−0.0027608=0.2724

−3
∆ w23=η δ 3 x 2=1 × (−0.00265 ) × 0.9=−2.385× 10
−3
w 23 ( new )=w23 ( old ) +∆ w23=0.8−2.385 ×10 =0.7976

−4
∆ w13=η δ 3 x 1=1 × (−0.00265 ) × ( 0.35 )=−9.275 ×10
−4
w 13 ( new )=w13 ( old ) +∆ w13=0.1−9.275 ×10 =0.0991

−3
∆ w24=η δ 4 x 2=1 × (−0.0082 ) × ( 0.9 )=−7.38 ×10
−3
w 24 ( new )=w24 ( old )+ ∆ w 24=0.6−7.38 ×10 =0.5926
Perform another forward pass:
Forward pass: compute output for y 3 , y 4 and y 5

for H 3 : ∑ ( w ij∗x i )=w 13 x 1+ w23 x 2=0.0991× 0.35+0.7976 × 0.9=0.7525

1
y 3= −0.7525
=0.6797
1−e
for H 4 :a 2=w 14 x 1 +w 24 x 2=0.35 × 0.3971+ 0.9 ×0.5926=0.6723
1
y4= −0.6723
=0.6620
1−e
For O5 :a3 :w35 y 3 +w 45 y 4 =0.2724 × 0.6797+0.8731 ×0.6620=0.7631
1
y 5= −0.7631
=0.6820 (Network output)
1−e
Error = ¿ y target − y 5=0.5−0.6820=−0.1820
Problem – 02:
Assume that the neurons have a sigmoid activation function, perform a forward pass and a
backward pass on the network. Assume that the actual output of y is 1 and learning rate is 0.9.
Perform another forward pass.
Solution:
Forward pass: Compute output for y 4 , y 5 and y 6.
a 4=( w14∗x1 ) + ( w24∗x 2 )+ ( w 34∗x 3 ) +θ4

¿ ( 0.2 ×1 ) + ( 0.4 × 0 ) + (−0.5 × 1 )+ (−0.4 )=−0.7

1
O ( H 4)= y4= 0.7
=0.332
1+ e

a 5=( w 15∗x 1 ) + ( w 25∗x 3 ) + ( w 35∗x 3 ) +θ5

¿ (−0.3 ×1 )+ ( 0.1× 0 ) + ( 0.2 ×1 ) + ( 0.2 )=0.1

1
O ( H 5 )= y 5 = −0.1
=0.525
1+ e

a 6=( w46∗ y 4 ) + ( w56∗y 5 ) +θ 6

¿ (−0.3 × 0.332 )+ (−0.2× 0.525 ) +0.1=−0.105

1
O ( O 6 )= y6 = 0.105
=0.474
1+ e
Error= y target − y 6=1−0.474=0.526

Backward Pass: Compute δ 4 , δ 5 ∧δ 6

For output unit: δ 6= y 6 (1− y 6 )( y target − y 6 )

¿ 0.474 × ( 1−0.474 ) × ( 1−0.474 )=0.1311
For hidden unit: δ 5= y 5 (1− y 5)w 56 δ 6
¿ 0.525 × ( 1−0.525 ) × (−0.2 ) ×0.1311=−0.0065

δ 4 = y 4 (1− y 4 )w46 δ 6

¿ 0.332 × ( 1−0.332 ) × (−0.3 ×0.1331 ) =−0.0087

Compute new weights:

∆ w46=η δ 6 y 4 =0.9 ×0.1311× 0.332=0.03917

w 46 ( new )=w 46 ( old )+ ∆ w 46=−0.3+ 0.03917=−0.261

∆ w56=η δ 6 y 5=0.9 ×0.1311× 0.525=0.06194475

w 56 ( new )=w56 ( old ) +∆ w56=−0.2+ 0.06194475=−0.138

−3
∆ w35=η δ 4 x 3=0.9 × (−0.0087 ) ×1=−7.83 ×10
−3
w 35 ( new )=w35 ( old ) +∆ w35=0.2−7.83 × 10 =0.194

∆ w25=η δ 4 x 2=0.9 × (−0.0087 ) × 0=0

w 25 ( new )=w25 ( old ) +∆ w25=0.1+0=0.1

−3
∆ w15=η δ 4 x 1=0.9 × (−0.0087 ) × 1=−7.83 ×10
−3
w 15 ( new )=w15 ( old ) +∆ w15=−0.3−7.83 ×10 =−0.306

∆ w24=η δ 4 x 2=1 × (−0.0087 ) × 0=0

w 24 ( new )=w24 ( old )+ ∆ w 24=0.4+ 0=0.4

∆ w34=η δ 4 x 3=1 × (−0.0087 ) × ( 1 )=−0.0087

w 34 ( new )=w34 ( old )+ ∆ w34 =−0.5−0.0087=−0.508

∆ w14=η δ 4 x 1=0.9 × (−0.0087 ) ×1=−0.0078

w 14 ( new )=∆ w 14+ w14 ( old )=−0.0078+0.2=0.192

Compute bais weights:

θ6 ( new )=θ6 ( old )+ η δ 6 =0.1+ ( 0.1311× 0.9 )=0.218

θ5 ( new )=θ5 ( old )+ η δ 5=0.2+ (−0.0065 ×0.9 ) =0.194

θ 4 ( new )=θ 4 ( old ) + η δ 4 =−0.4+ (−0.0087 ×0.9 )=−0.408

Now, perform another forward pass:

Compute output for y 4 , y 5 and y 6.
a 4=( w14∗x1 ) + ( w24∗x 2 )+ ( w 34∗x 3 ) +θ4

¿ ( 0.192 ×1 ) + ( 0.4 × 0 ) + (−0.508 × 1 )+ (−0.408 )=−0.724

1
O ( H 4)= y4= 0.724
=0.327
1+ e

a 5=( w 15∗x 1 ) + ( w 25∗x 3 ) + ( w 35∗x 3 ) +θ5

¿ (−0.306 × 1 )+ ( 0.1× 0 ) + ( 0.194 × 1 ) + ( 0.194 )=0.082

1
O ( H 5 )= y 5 = −0.082
=0.520
1+ e

a 6=( w46∗ y 4 ) + ( w56∗y 5 ) +θ 6

¿ (−0.261 ×0.327 ) + (−0.138 ×0.520 )+ 0.218=0.061
1
O ( O6 )= y6 = −0.061
=0.515 (Network Output )
1+ e
Error= y target − y 6=1−0.515=0.485

Convolutional Neural Network (CNN)

Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
53% (19)
Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
103 pages
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
21 pages
Msep2013 L5
No ratings yet
Msep2013 L5
14 pages
Backpropagation Math
No ratings yet
Backpropagation Math
6 pages
(IJCST-V6I4P17) :P T V Lakshmi
No ratings yet
(IJCST-V6I4P17) :P T V Lakshmi
4 pages
Answers 2024
No ratings yet
Answers 2024
11 pages
1155_CS_F425_20230524120823_Mid_Semester_Question_Paper_DL
No ratings yet
1155_CS_F425_20230524120823_Mid_Semester_Question_Paper_DL
5 pages
Spring 2015 Mid-Sem Q_A
No ratings yet
Spring 2015 Mid-Sem Q_A
10 pages
9 Lesson 06
No ratings yet
9 Lesson 06
12 pages
L3-ANN
No ratings yet
L3-ANN
15 pages
DL ppt
No ratings yet
DL ppt
110 pages
Backpropagation: Loading Data
No ratings yet
Backpropagation: Loading Data
12 pages
sol3_2016
No ratings yet
sol3_2016
8 pages
LogisticRegression_ExercisesSolutions
No ratings yet
LogisticRegression_ExercisesSolutions
5 pages
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
50% (2)
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
103 pages
Task on Hebbian Learning
No ratings yet
Task on Hebbian Learning
8 pages
Week 2
No ratings yet
Week 2
17 pages
DeepLearning Practice Question Answers
No ratings yet
DeepLearning Practice Question Answers
43 pages
2021-exam2-solution
No ratings yet
2021-exam2-solution
11 pages
01 Introduction To Feedforward Neural Networks (Hugo)
No ratings yet
01 Introduction To Feedforward Neural Networks (Hugo)
78 pages
4.deep Learning Assignment4 Solution PDF
100% (1)
4.deep Learning Assignment4 Solution PDF
12 pages
Section05 Solutions
No ratings yet
Section05 Solutions
9 pages
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
No ratings yet
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
43 pages
Neuro Fuzzy - Session 6
No ratings yet
Neuro Fuzzy - Session 6
21 pages
S02_DNN_Perceptron_wip
No ratings yet
S02_DNN_Perceptron_wip
24 pages
Unit 2
No ratings yet
Unit 2
36 pages
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
No ratings yet
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
4 pages
Ai Assignment 2 Answer
No ratings yet
Ai Assignment 2 Answer
12 pages
sol3_2015
No ratings yet
sol3_2015
8 pages
Sample Final Exam Solutions
No ratings yet
Sample Final Exam Solutions
30 pages
MS_Final_Exam_ANNFL_2015-1
No ratings yet
MS_Final_Exam_ANNFL_2015-1
7 pages
Fiqrotin Nur Asita - ETS
No ratings yet
Fiqrotin Nur Asita - ETS
19 pages
Artificial Neural Networks - 12: Dr. Aditya Abhyankar
No ratings yet
Artificial Neural Networks - 12: Dr. Aditya Abhyankar
42 pages
Exp 4
No ratings yet
Exp 4
9 pages
ML_Lec-23
No ratings yet
ML_Lec-23
20 pages
Back Propagation Lsn 4
No ratings yet
Back Propagation Lsn 4
17 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
5 pages
ML cheat sheet(1)
No ratings yet
ML cheat sheet(1)
2 pages
class-test-1
No ratings yet
class-test-1
5 pages
05_optimization_basics
No ratings yet
05_optimization_basics
94 pages
International Baccalaureate (IB) : Artificial Neural Networks - #3
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #3
13 pages
mod4
No ratings yet
mod4
65 pages
Week 7 - Lab
No ratings yet
Week 7 - Lab
6 pages
hw3 Soln
No ratings yet
hw3 Soln
7 pages
Networks: (Back Propagation)
No ratings yet
Networks: (Back Propagation)
13 pages
Feed Forward NN
No ratings yet
Feed Forward NN
35 pages
Learning Rules of ANN
No ratings yet
Learning Rules of ANN
25 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
Lec 15 MLP Cont
No ratings yet
Lec 15 MLP Cont
34 pages
SkriptOptMach
No ratings yet
SkriptOptMach
49 pages
Back Propagation-2-20
No ratings yet
Back Propagation-2-20
19 pages
cs188_sp16_f_sol
No ratings yet
cs188_sp16_f_sol
27 pages
Introduction to Machine Learning (4)
No ratings yet
Introduction to Machine Learning (4)
10 pages
Adaline Network: W Output
No ratings yet
Adaline Network: W Output
12 pages
Machine Learning Homework
No ratings yet
Machine Learning Homework
8 pages
ML Unit 2 ppt
No ratings yet
ML Unit 2 ppt
54 pages
2. Linear_ Regression_SGD
No ratings yet
2. Linear_ Regression_SGD
71 pages
aml pa
No ratings yet
aml pa
17 pages
Adaline and Delta Learning Rule
No ratings yet
Adaline and Delta Learning Rule
18 pages
Computer Solved Differential Equations
From Everand
Computer Solved Differential Equations
Joe J.
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
electronics-11-02293-v2 (1)
No ratings yet
electronics-11-02293-v2 (1)
14 pages
Presentation speech
No ratings yet
Presentation speech
2 pages
DLD Assignment
No ratings yet
DLD Assignment
1 page
Chapter 03
No ratings yet
Chapter 03
5 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
A Review of Image Classification Approaches and Techniques
No ratings yet
A Review of Image Classification Approaches and Techniques
6 pages
Activation Function
No ratings yet
Activation Function
44 pages
Unit 5 Autoencoders.docx
No ratings yet
Unit 5 Autoencoders.docx
6 pages
ANN Syllabus
No ratings yet
ANN Syllabus
2 pages
C1 W2
No ratings yet
C1 W2
18 pages
D1-22683 Aam Tyan 2023-24 SMD
No ratings yet
D1-22683 Aam Tyan 2023-24 SMD
6 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
Multilayer Feed Forward Neural Network
No ratings yet
Multilayer Feed Forward Neural Network
8 pages
Deep Learning As A Frontier of Machine Learning A
No ratings yet
Deep Learning As A Frontier of Machine Learning A
10 pages
深度强化学习（初稿）
No ratings yet
深度强化学习（初稿）
289 pages
NN Adaline
0% (2)
NN Adaline
14 pages
L10 - Intro - To - Deep - Learning
No ratings yet
L10 - Intro - To - Deep - Learning
75 pages
ML 03
No ratings yet
ML 03
42 pages
Addernet: Do We Really Need Multiplications in Deep Learning?
No ratings yet
Addernet: Do We Really Need Multiplications in Deep Learning?
8 pages
8.lecture7 28a 29 NN
No ratings yet
8.lecture7 28a 29 NN
60 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
ENsemble, Random Forest
No ratings yet
ENsemble, Random Forest
28 pages
Lecture 9 - Supervised Learning in ANN - (Part 2) New
No ratings yet
Lecture 9 - Supervised Learning in ANN - (Part 2) New
7 pages
R: Adabag
No ratings yet
R: Adabag
34 pages
08 Neural Networks
No ratings yet
08 Neural Networks
47 pages
Complete NN Concept
No ratings yet
Complete NN Concept
73 pages
NN DL
No ratings yet
NN DL
1 page
Advanced Deep Learning Practical File
No ratings yet
Advanced Deep Learning Practical File
29 pages
Automatic Fruit Classification Using Deep Learning for Industrial Applications
No ratings yet
Automatic Fruit Classification Using Deep Learning for Industrial Applications
8 pages
DEEP LEARNING LAB PRACTICALS
No ratings yet
DEEP LEARNING LAB PRACTICALS
10 pages
Ann Lab Manual 2
No ratings yet
Ann Lab Manual 2
7 pages
Advanced AI Concepts _ Quizizz
No ratings yet
Advanced AI Concepts _ Quizizz
2 pages
DL Question Bank
No ratings yet
DL Question Bank
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Backpropagation Math

Uploaded by

Backpropagation Math

Uploaded by

Activation Function:

1. Step function: Mathematically it can be defined as

Step – 01: ∑ w1 x1=1.2× 0+0.6 × 0

w 2=w 2+ ∆ w 2=w 2+ η (T −O ) × x 2=0.6+0.5 × ( 0−1 ) ×0=0.6

Step – 02: ∑ w1 x1=0.7 ×0+ 0.6 ×0=0<1=0(¿=AO )

Problem – 02: w 1=0.6 , w 2=0.6 , Threshold T =1 ,learning rate η=0.5

Step – 01: ∑ w1 x1=0.6 ×0+ 0.6 ×0=0<1=0( AO=¿)

w 2=w 2+ ∆ w 2=w 2+ η (T −O ) × x 2=0.6+0.5 × ( 1−0 ) ×1=1.1

Step – 02: ∑ w1 x1=0.6 ×0+1.1 × 0=0<1=0 ( AO =¿ )

w 2=w 2+ ∆ w 2=w 2+ η (T −O ) × x 2=1.1+0.5 × ( 1−0 ) × 0=1.1

Step – 03: ∑ w1 x1=1.1× 0+1.1× 0=0<1=0 (AO =¿)

Back Propagation Algorithm on Multi-Layer Perceptron Network

Solution: Forward Pass:

for H 3 : ∑ ( w ij∗x i )=w 13 x 1+ w23 x 2=0.1× 0.35+0.8 × 0.9=0.755

Backward pass: Compute δ 3 , δ 4 and δ 5.

For hidden unit,

δ 4 = y 4 ( 1− y 4 ) δ 5 w 54=0.66 ( 1−0.66 ) (−0.0406 ) ( 0.9 )=−0.0082

Compute new weights:

w 45 ( new )=∆ w 45+ w45 ( old )=−0.0269+0.9=0.8731

∆ w14=η δ 4 x 1=1 × (−0.0082 ) × ( 0.35 )=−0.00287

w 14 ( new )=w14 ( old )+ ∆ w14 =0.4−0.00287=0.3971

∆ w35=η δ 5 y3 =1× (−0.0406 ) ×0.68=−0.027608

w 35 ( new )=w35 ( old ) +∆ w35=0.3−0.0027608=0.2724

for H 3 : ∑ ( w ij∗x i )=w 13 x 1+ w23 x 2=0.0991× 0.35+0.7976 × 0.9=0.7525

¿ ( 0.2 ×1 ) + ( 0.4 × 0 ) + (−0.5 × 1 )+ (−0.4 )=−0.7

a 5=( w 15∗x 1 ) + ( w 25∗x 3 ) + ( w 35∗x 3 ) +θ5

¿ (−0.3 ×1 )+ ( 0.1× 0 ) + ( 0.2 ×1 ) + ( 0.2 )=0.1

a 6=( w46∗ y 4 ) + ( w56∗y 5 ) +θ 6

¿ (−0.3 × 0.332 )+ (−0.2× 0.525 ) +0.1=−0.105

Backward Pass: Compute δ 4 , δ 5 ∧δ 6

For output unit: δ 6= y 6 (1− y 6 )( y target − y 6 )

¿ 0.332 × ( 1−0.332 ) × (−0.3 ×0.1331 ) =−0.0087

Compute new weights:

w 46 ( new )=w 46 ( old )+ ∆ w 46=−0.3+ 0.03917=−0.261

∆ w56=η δ 6 y 5=0.9 ×0.1311× 0.525=0.06194475

w 56 ( new )=w56 ( old ) +∆ w56=−0.2+ 0.06194475=−0.138

∆ w25=η δ 4 x 2=0.9 × (−0.0087 ) × 0=0

w 25 ( new )=w25 ( old ) +∆ w25=0.1+0=0.1

∆ w24=η δ 4 x 2=1 × (−0.0087 ) × 0=0

w 24 ( new )=w24 ( old )+ ∆ w 24=0.4+ 0=0.4

∆ w34=η δ 4 x 3=1 × (−0.0087 ) × ( 1 )=−0.0087

w 34 ( new )=w34 ( old )+ ∆ w34 =−0.5−0.0087=−0.508

w 14 ( new )=∆ w 14+ w14 ( old )=−0.0078+0.2=0.192

Compute bais weights:

θ5 ( new )=θ5 ( old )+ η δ 5=0.2+ (−0.0065 ×0.9 ) =0.194

θ 4 ( new )=θ 4 ( old ) + η δ 4 =−0.4+ (−0.0087 ×0.9 )=−0.408

Now, perform another forward pass:

¿ ( 0.192 ×1 ) + ( 0.4 × 0 ) + (−0.508 × 1 )+ (−0.408 )=−0.724

a 5=( w 15∗x 1 ) + ( w 25∗x 3 ) + ( w 35∗x 3 ) +θ5

¿ (−0.306 × 1 )+ ( 0.1× 0 ) + ( 0.194 × 1 ) + ( 0.194 )=0.082

a 6=( w46∗ y 4 ) + ( w56∗y 5 ) +θ 6

Convolutional Neural Network (CNN)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.