0% found this document useful (0 votes)

3K views

Solution 4 Ann Weka 2012

The document discusses machine learning concepts including perceptrons, backpropagation algorithm, and error functions. It provides solutions to several tutorial questions on these topics. The questions cover determining perceptron weights from a decision surface, comparing perceptrons, deriving gradient descent rules for single and multi-unit perceptrons, revising backpropagation for tanh activation, alternative error functions, finding the optimal weight with minimum error, and calculating performance metrics from a confusion matrix.

Uploaded by

Nguyen Tien Thanh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3K views

Solution 4 Ann Weka 2012

Uploaded by

Nguyen Tien Thanh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

1

CS3244 Machine Learning Semester 1, 2012/13

Solution to Tutorial 4

1. What are the values of weights w
0
, w
1
, and w
2
for the perceptron whose decision
surface is illustrated in Figure 4.3? Assume the surface crosses the x
1
axis at 1,
and the x
2
axis at 2.
Answer:
The line for the decision surface corresponds to the equation x
2
=2x
1
+2, and since
all points above the line should be classified as positive, we have x
2
2x
1
2 >0.
Hence w
0
=2, w
1
=2, and w
2
=1.

2. Consider two perceptrons defined by the threshold expression w
0
+w
1
x
1
+w
2
x
2
>0.
Perceptron A has weight values
w
0
=1, w
1
=2, w
2
=1
and Perceptron B has weight values
w
0
=0, w
1
=2, w
2
=1
True or false? Perceptron A is more-general-than perceptron B. (More-general-
than is defined in Chapter 2).
Answer:

True. Perceptron A is more general than B, because any point lying above B will
also be above A. i.e. as per definition in chapter 2 for more-general-than:
)] ) ( ) ) ( )[( ( 1 x A 1 x B X x = = e

B
A
+
+
+

2
3. Derive a gradient descent training rule for a single unit with output o, where
o = w
0
+w
1
x
1
+w
1
x
1
2

+. . . +w
n
x
n
+w
n
x
n
2
Answer:
First, the error function is defined as:
2
2
1
) o t ( ) w ( E
d
D d
d
=

e

The update rule is the same, namely:
i i i
w w : w A + =
i
i
w
E
w
c
c
= A q

For w
0
,
) ( ) 1 )( (
) ( ) ( 2
2
1
) (
2
1
) (
2
1
0
2
0
2
0 0
d
D d
d d
D d
d
d d d
D d
d d
D d
d d
D d
d
o t o t
o t
w
o t o t
w
o t
w w
E
= =

c
c
=
c
c
=
c
c
=
c
c

e e
e e e

Thus
) (
0 d
D d
d
o t w = A

e
q

For w
1
,w
2
,,w
n

)) ( )( (
) ( ) ( 2
2
1
) (
2
1
) (
2
1
2
2 2
id id d
D d
d
d d
i
d
D d
d d
D d
d
i
d
D d
d
i i
x x o t
o t
w
o t o t
w
o t
w w
E
+ =

c
c
=
c
c
=
c
c
=
c
c

e
e e e

Thus
) )( (
2
id id d
D d
d i
x x o t w + = A

e
q

3
4. Consider a two-layer feedforward ANN with two inputs a and b, one hidden unit c,
and one output unit d. This network has five weights (w
ca
, w
cb
, w
c0
, w
dc
, w
d0
), where
w
x0
represents the threshold weight for unit x. Initialize these weights to the values
(0.1, 0.1, 0.1, 0.1, 0.1), then give their values after each of the first two training
iterations of the BACKPROPAGATION algorithm. Assume learning rate q =0.3,
momentum o = 0.9, incremental weight updates, and the following training
examples:
a b d
1 0 1
0 1 0
Answer:
The network and the sigmoid activation function sigmoid function are as follows:

y
e
) y (

+
=
1
1
o

Training example 1:
The outputs of the two neurons, noting that a=1and b=0:
53867 0 15498 0 1 1 0 5498 0 1 0
5498 0 2 0 1 1 0 0 1 0 1 1 0
. ) . ( ) . . . ( o
. ) . ( ) . . . ( o
d
c
= = + =
= = + + =
o o
o o

The error terms for the two neurons, noting that d=1:
002836 0 1146 0 1 0 5498 0 1 5498 0
1146 0 53867 0 1 53867 0 1 53867 0
. . . ) . ( .
. ) . ( ) . ( .
c
d
= =
= =
o
o

Compute the correction terms as follows, noting that a=1, b=0 and q=0.3:
0 0 002836 0 3 0
000849 0 1 002836 0 3 0
000849 0 1 002836 0 3 0
0189 0 5498 0 1146 0 3 0
0342 0 1 1146 0 3 0
0
0
= = A
= = A
= = A
= = A
= = A
. . w
. . . w
. . . w
. . . . w
. . . w
cb
ca
c
dc
d

0
a
b
c d
w
d0
w
c0

w
ca

w
cb

w
dc

4
and the new weights become:
1 0 0 1 0
100849 0 000849 0 1 0
100849 0 000849 0 1 0
1189 0 0189 0 1 0
1342 0 0342 0 1 0
0
0
. . w
. . . w
. . . w
. . . w
. . . w
cb
ca
c
dc
d
= + =
= + =
= + =
= + =
= + =

Training example 2:
The outputs of the two neurons, noting that a=0 and b=1:
5497 0 1996 0 1 1342 0 55 0 1189 0
55 0 200849 0 1 100849 0 1 1 0 0 100849 0
. ) . ( ) . . . ( o
. ) . ( ) . . . ( o
d
c
= = + =
= = + + =
o o
o o

The error terms for the two neurons, noting that d=0:
004 0 1361 0 1189 0 55 0 1 55 0
1361 0 5497 0 0 5497 0 1 5497 0
. ) . ( . ) . ( .
. ) . ( ) . ( .
c
d
= =
= =
o
o

Compute the correction terms as follows, noting that a=0, b=1, q=0.3 and o=0.9:
0012 0 0 9 0 1 004 0 3 0
00086 0 000849 0 9 0 0 004 0 3 0
0004 0 000849 0 9 0 1 004 0 3 0
0055 0 0189 0 9 0 55 0 1361 0 3 0
01 0 0342 0 9 0 1 1361 0 3 0
0
0
. . ) . ( . w
. . . ) . ( . w
. . . ) . ( . w
. . . . ) . ( . w
. . . ) . ( . w
cb
ca
c
dc
d
= + = A
= + = A
= + = A
= + = A
= + = A

and the new weights become:
0988 0 0012 0 1 0
1016 0 00086 0 100849 0
100849 0 0004 0 100849 0
1134 0 0055 0 1189 0
1242 0 01 0 1342 0
0
0
. . . w
. . . w
. . . w
. . . w
. . . w
cb
ca
c
dc
d
= =
= + =
= =
= =
= =

5
5. Revise the BACKPROPAGATION algorithm in Table 4.2 so that it operates on units
using the squashing function tanh in place of the sigmoid function. That is, assume
the output of a single unit is ) x . w tanh( o

= . Give the weight update rule for output
layer weights and hidden layer weights. Hint: ) x ( tanh ) x ( h tan
2
1 = ' .
Answer:
Steps T4.3 and T4.4 in Table 4.2 will become as follows, respectively:

k
outputs k
kh h h
k k k k
w ) o (
) o t )( o (
o o
o

e

2
2
1
1

6
6. Consider the alternative error function described in Section 4.81.

+
e e j , i
ji kd
D d outputs k
kd
w ) o t ( ) w ( E
2 2
2
1

Derive the gradient descent update rule for this definition of E. Show that it can be
implemented by multiplying each weight by some constant before performing the
standard gradient descent update given in Table 4.2.
Answer:

c
c
+
c
c
=
c
c
Vc
c
= A
A +
e e j , i
ji
ji
kd
D d outputs k
kd
ji ji
ji
ji
ji ji ji
w
w
) o t (
w w
) w ( E
w
) w ( E
w
w w w
2 2
2
1

The first term in the R.H.S of the above equation can be derived in the same manner
as in equation (4.27), while we continue to work on the 2
nd
term. For output nodes,
leads to:
ji j ji ji
ji ji j j j j ji ji
ji ji j j j j
ji
x w w
w x ) o ( o ) o t ( w w
w x ) o ( o ) o t (
w
) w ( E
qo |
q q

+
+
+ =
c
c
2 1
2 1

where q | 2 1 = and ) o ( o ) o t (
j
j j j j
= 1 o
Similarly, for hidden units, we can derive:
ji j ji ji
x w w qo | +
where q | 2 1 = and

e
=
) j ( Downstream k
kj k j j
w ) o ( o
j
o o 1
The above shows the update rule can be implemented by multiplying each weight
by some constant before performing the gradient descent update given in Table 4.2.
7
7. Assume the following error function:
2 2
2
1
2 ) ( w w w E o + =

where o, and are constants. The weight w is updated according to gradient
descent with a positive learning rate q. Write down the update equation for
w(k+1) given w(k). Find the optimum weight w that gives the minimal error E(w).
What is the value of the minimal E(w)? (8 marks)

Answer:

) ( ) ( ) 1 (
) (
w k w k w
w
w
E
w
w
w
E
q
q q

+ = +
=
c
c
= A
+ =
c
c

When E(w) becomes the smallest, 0 =
c
c
w
E

Thus, optimal

=
optimal
w

Minimal error:

o
2
2
2
2 ) (
2
2
2 2
2
= + =
optimal
w E

8
8. WEKA outputs the following confusion matrix after training a J 48 decision tree
classifier with the contact-lenses dataset. (a) Count the number of True Positives,
True Negatives, False Positives and False Negatives for each the three classes, i.e.
soft, hard and none. (b) Calculate the TP rate (Recall), FP rate, Precision and F-
measure for each class.
a b c <- - cl assi f i ed as
4 0 1 | a = sof t
0 1 3 | b = har d
1 2 12 | c = none
Answer:

soft:
(a) TP =4
TN =18
FP =1
FN =1

(b) TP rate =Recall =TP / (TP +FN) =4/5 =0.8
FP rate =FP / (FP +TN) =1/19 =0.053
Precision =TP / (TP +FP) =4/5 =0.8
F-Measure =20.80.8/(0.8+0.8) =0.8

hard:
(a) TP =1
TN =18
FP =2
FN =3

(b) TP rate =Recall =TP / (TP +FN) =1 / 4 =0.25
FP rate =FP / (FP +TN) =2/20 =0.1
Precision =TP / (TP +FP) =1 / 3=0.333
F-Measure =20.250.333/(0.25+0.333) =0.286

none:
(a) TP =12
TN =5
FP =4
FN =3

(b) TP rate =Recall =TP / (TP +FN) =12 / 15 =0.8
FP rate =FP / (FP +TN) =4/9 =0.444
Precision =TP / (TP +FP) =12 / 16 =0.75
F-Measure =20.80.75/(0.8+0.75) =0.774

Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
53% (19)
Haykin, Xue-Neural Networks and Learning Machines 3ed Soln
103 pages
Bob Knowlton Case Analysis
0% (1)
Bob Knowlton Case Analysis
4 pages
Chapter Bit Manipulation
No ratings yet
Chapter Bit Manipulation
14 pages
Introduction To The Lifting Scheme
No ratings yet
Introduction To The Lifting Scheme
15 pages
Exercises 695 Clas
No ratings yet
Exercises 695 Clas
3 pages
English For Medical Science Syllabus (Example)
50% (2)
English For Medical Science Syllabus (Example)
6 pages
Lab 5: 16 April 2012 Exercises On Neural Networks
No ratings yet
Lab 5: 16 April 2012 Exercises On Neural Networks
6 pages
Perceptons Neural Networks
No ratings yet
Perceptons Neural Networks
33 pages
Confusion Matrix, Accuracy, Precision, Recall, F1 Score
No ratings yet
Confusion Matrix, Accuracy, Precision, Recall, F1 Score
1 page
CS302 Mid-Sem Online Examination - Attempt Review-2
No ratings yet
CS302 Mid-Sem Online Examination - Attempt Review-2
10 pages
Hopfield Neural Network
100% (1)
Hopfield Neural Network
6 pages
ANN - Ch2-Adaline and Madaline
100% (1)
ANN - Ch2-Adaline and Madaline
29 pages
Deep Learning - AD3501 - Notes - Unit 2 - Convolutional Neural Networks
No ratings yet
Deep Learning - AD3501 - Notes - Unit 2 - Convolutional Neural Networks
36 pages
Cs433 Fa12 Hw4 Sol Correct
No ratings yet
Cs433 Fa12 Hw4 Sol Correct
14 pages
Perceptron Notes
No ratings yet
Perceptron Notes
4 pages
Instructor's Solution Manual For Neural Networks
No ratings yet
Instructor's Solution Manual For Neural Networks
40 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
Asymptotic Analysis
No ratings yet
Asymptotic Analysis
19 pages
Unit 2a
No ratings yet
Unit 2a
31 pages
Solutions To Reinforcement Learning by Sutton Chapter 5 r3
No ratings yet
Solutions To Reinforcement Learning by Sutton Chapter 5 r3
9 pages
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
No ratings yet
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
5 pages
Experiment-6: AIM-Write A Program To Implement XOR Gate Using Mcculloch-Pitts Neuron. Program
No ratings yet
Experiment-6: AIM-Write A Program To Implement XOR Gate Using Mcculloch-Pitts Neuron. Program
3 pages
MP Neuron
No ratings yet
MP Neuron
35 pages
Introduction To Matlab Tutorial 11
No ratings yet
Introduction To Matlab Tutorial 11
37 pages
Linear Algebra And Its Applications 4th Edition Lay Solutions Manual instant download
100% (4)
Linear Algebra And Its Applications 4th Edition Lay Solutions Manual instant download
45 pages
Train A Simple NN - Jupyter Notebook
No ratings yet
Train A Simple NN - Jupyter Notebook
4 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
Neural Network Module 2 Notes
100% (1)
Neural Network Module 2 Notes
72 pages
Unit II - Asymptotic notations
No ratings yet
Unit II - Asymptotic notations
9 pages
Answers All 2007
0% (1)
Answers All 2007
64 pages
Lab Manual Soft Computing
No ratings yet
Lab Manual Soft Computing
44 pages
Experiment 3: Aim: Generate or Functions Using Mcculloch-Pitts Neural Net by A Matlab Program
No ratings yet
Experiment 3: Aim: Generate or Functions Using Mcculloch-Pitts Neural Net by A Matlab Program
3 pages
Genetic Algorithm
No ratings yet
Genetic Algorithm
46 pages
Practice Questions CNNs Solns
No ratings yet
Practice Questions CNNs Solns
11 pages
Solution PDF
No ratings yet
Solution PDF
20 pages
Lecture 21 - Sugeno Fuzzy Models
No ratings yet
Lecture 21 - Sugeno Fuzzy Models
5 pages
ITC Mod1@
No ratings yet
ITC Mod1@
79 pages
AES Example PDF
No ratings yet
AES Example PDF
18 pages
Deep Learning Assignment3 Solution
No ratings yet
Deep Learning Assignment3 Solution
9 pages
32 Lecture CSC462
No ratings yet
32 Lecture CSC462
34 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Subsets, Graph Coloring, Hamiltonian Cycles, Knapsack Problem. Traveling Salesperson Problem
No ratings yet
Subsets, Graph Coloring, Hamiltonian Cycles, Knapsack Problem. Traveling Salesperson Problem
22 pages
Lab Manual 15
No ratings yet
Lab Manual 15
9 pages
Unit-1 Basics of Algorithms and Mathematics
No ratings yet
Unit-1 Basics of Algorithms and Mathematics
47 pages
Unit - II: Recurrent Neural Network
No ratings yet
Unit - II: Recurrent Neural Network
75 pages
Lesson 4 Gradient Descent
No ratings yet
Lesson 4 Gradient Descent
13 pages
Ai QB
No ratings yet
Ai QB
3 pages
CS6659 Artificial Intelligence
No ratings yet
CS6659 Artificial Intelligence
10 pages
Nptel: Recurrent Network of A Single Layer
No ratings yet
Nptel: Recurrent Network of A Single Layer
35 pages
Acn Question Bank With Solution.
100% (1)
Acn Question Bank With Solution.
47 pages
Deep Learning Exp
No ratings yet
Deep Learning Exp
25 pages
Dica Question Bank
No ratings yet
Dica Question Bank
4 pages
CS 224n Assignment #2: Word2vec (43 Points)
No ratings yet
CS 224n Assignment #2: Word2vec (43 Points)
4 pages
Bit Plane Slicing and Bit Plane Compression
No ratings yet
Bit Plane Slicing and Bit Plane Compression
5 pages
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
The Multinomial Theorem
No ratings yet
The Multinomial Theorem
82 pages
Switching and Finite Automata Theory, 3rd Ed by Kohavi, K. Jha Sample From Ch10
No ratings yet
Switching and Finite Automata Theory, 3rd Ed by Kohavi, K. Jha Sample From Ch10
3 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
94 pages
Neural Syllabus
No ratings yet
Neural Syllabus
1 page
Int. To Data Analytics and Cyber Security Syllabus
No ratings yet
Int. To Data Analytics and Cyber Security Syllabus
2 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
50% (2)
Haykin Xue Neural Networks and Learning Machines 3ed Soln PDF
103 pages
Chapter 2
No ratings yet
Chapter 2
9 pages
CHAPTER 1 Module 1
No ratings yet
CHAPTER 1 Module 1
2 pages
Learning Reference 3 - PPT-Understanding The Self
No ratings yet
Learning Reference 3 - PPT-Understanding The Self
41 pages
Guidelines For Scaffolding Assignment 3 Fall 2023
No ratings yet
Guidelines For Scaffolding Assignment 3 Fall 2023
4 pages
Edu 503
No ratings yet
Edu 503
3 pages
Rubrics For Video Presentation (30 Points)
No ratings yet
Rubrics For Video Presentation (30 Points)
2 pages
English 1 Gramma
100% (1)
English 1 Gramma
159 pages
Classroom Management Styles, Classroom Climate and School
No ratings yet
Classroom Management Styles, Classroom Climate and School
11 pages
Answer Template - Ads 460
No ratings yet
Answer Template - Ads 460
13 pages
Analytic Memo 10
No ratings yet
Analytic Memo 10
3 pages
Theories About The Origins of Language
50% (2)
Theories About The Origins of Language
2 pages
DLL UCSP Week 2
No ratings yet
DLL UCSP Week 2
3 pages
Saussure, Course in General Linguistics, Ed. Bally & Sechehaye (1915)
No ratings yet
Saussure, Course in General Linguistics, Ed. Bally & Sechehaye (1915)
4 pages
Parts of Speech - Term 3 - Notes and Activity
No ratings yet
Parts of Speech - Term 3 - Notes and Activity
24 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Practice Makes Perfect - Complete Spanish Grammarr
0% (1)
Practice Makes Perfect - Complete Spanish Grammarr
3 pages
FBS Week 1
No ratings yet
FBS Week 1
3 pages
Nouns, Pronouns and Noun Phrase
No ratings yet
Nouns, Pronouns and Noun Phrase
53 pages
Clappyland Lesson Plan
No ratings yet
Clappyland Lesson Plan
2 pages
76 PDF
No ratings yet
76 PDF
8 pages
MSC Psychology Scheme of Studies
No ratings yet
MSC Psychology Scheme of Studies
52 pages
Sociolinguistics and Discourse Analysis: Orders of Indexicality and Polycentricity
No ratings yet
Sociolinguistics and Discourse Analysis: Orders of Indexicality and Polycentricity
17 pages
Humor in Communication
No ratings yet
Humor in Communication
22 pages
Ulbs Scoring Criteria
No ratings yet
Ulbs Scoring Criteria
2 pages
New TTS Manual
No ratings yet
New TTS Manual
146 pages
10 Academic Phrases To Use in Your Essay
No ratings yet
10 Academic Phrases To Use in Your Essay
2 pages
Job Satisfaction: Ruokuonuo Glesilda Kesi (1416122) Rohan Monis (1416120) Sagar Naik (1416123)
No ratings yet
Job Satisfaction: Ruokuonuo Glesilda Kesi (1416122) Rohan Monis (1416120) Sagar Naik (1416123)
30 pages
Speech Act Theories
No ratings yet
Speech Act Theories
64 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Solution 4 Ann Weka 2012

Uploaded by

Solution 4 Ann Weka 2012

Uploaded by

1

CS3244 Machine Learning Semester 1, 2012/13

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.