0% found this document useful (0 votes)

7 views

Perceptron 1

Uploaded by

mohamadziad.altabel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Perceptron 1

Uploaded by

mohamadziad.altabel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Simple Perceptrons

Simple Perceptrons
Perform supervised learning
correct I/O associations are provided

Feed-forward networks
connections are one-directional

One layer
input layer + output layer

PR , ANN, & ML 2
Notations
N: dimension of the input vector
M: dimension of the output vector
inputs x j , j = 1,..., N
real outputs yi , i =1,...,M
weight vectors w ij , i = 1, ..., M , j = 1, ..., N
activation function g
y 1
y 2 y 3

N
w34 yi = g (neti ) = g (∑ wij x j )
j =1

x1 x2 x3 x4
PR , ANN, & ML 3
Perceptron Training
given
u
input patterns x
desired output patterns O u
how to adapt the connection weights such that
the actual outputs conform the desired outputs
u u
O i = y i i = 1, L , M

PR , ANN, & ML 4
Simplest case
two types of inputs
binary outputs (-1,1)
u u
thresholding sgn(w ⋅ x) = sgn(w1 x1 + w2 x2 + wo ) = y u

( w1 , w2 )

PR , ANN, & ML 5
Examples u u
x2 g(net) = sgn(w x + w x −1.5)
1 1 2 2
O
x1 x2 O w0 = 15
.
0 0 -1
w w1 = 1 w2 = 1
0 1 -1 x2

1 0 -1
1 1 1 x1 x1 x2
x1 x2 O x2 x2

0 0 -1
0 1 1
1 0 1 x1
x1
1 1 -1 PR , ANN, & ML 6
Linear separability
Is it at all possible to learn the desired I/O
associations?
yes, if wij can be found such that
N
Oi = sgn( ∑ wij x j − wi 0 ) = yi for all i and u
u u u

j =1
no, otherwise

Single-layer perceptron is severely limited

in what it can learn

PR , ANN, & ML 7
Perceptron Learning
Linear separable or not, how to find the set
of weights?
Using tagged samples
closed form solution
iterative solutions

PR , ANN, & ML 8
Closed Form Solution
 x11 ... x 1n 
1  1 w  O 
1

 2 2    2 
 1x ... x n 1   w 2  O 
=
O O O O M   M 
 u u    u 
 x1 ... x n 1   w o   O 
AW = B
W = (A T A) − 1 A T B

Not practical when number of samples is

large (most likely case)
PR , ANN, & ML 9
Perceptron Learning Rule
If a pattern is correctly classified, no action
x2

(w1, w2)

PR , ANN, & ML 10
Perceptron Learning Rule (cont.)
If a positive pattern becomes a negative
pattern
x2

(w1, w2)

PR , ANN, & ML 11
Perceptron Learning Rule (cont.)
If a negative pattern becomes a positive
pattern
x2

(w1, w2 )

PR , ANN, & ML 12
w ( k ) + cx w ( k ) ⋅ x < 0, x ∈ +
 (k )
w ( k +1) = w − cx w ( k ) ⋅ x > 0, x ∈ −
 w (k ) otherwise


w ( k ) + cyx y (w ( k ) ⋅ x) < 0, x ∈ +or −


w ( k +1) =
 w (k ) otherwise

How should c be decided?
Fixed increment
Fractional correction

PR , ANN, & ML 13
Perceptron Learning Rule (cont.)
Weight is a signed, linear combination of
training points
Use those informative points (those the
classifier made a mistake, mistake driven)
This is VERY important, lead later to
generalization to Support Vector Machines

PR , ANN, & ML 14
Comparison
Version space
The (w1-w2) space of all feasible solutions
Perceptron learning
Greedy, gradient descent that often ends up at boundary
of the version space with little space for error
SVM learning
Center of largest imbedded sphere in the version space
(maximum margin) w 2
Bayes point machine SVM
Centroid of the version space perceptron
w1
vs Bayes
PR , ANN, & ML 15
Perceptron Usage Rules
After the weight has been determined
y = w ⋅ x = ( ∑ α i yi x i ) ⋅ x = ∑ α i yi x i ⋅ x
i i

Classification involves inner product of

training samples and test samples
This is again VERY important, lead later
to generalization to Kernel Methods

PR , ANN, & ML 16
Hebb’s Learning Rule
Synapse strength should be increased when
both pre- and post-synapse neurons fire
vigorously
for binarynew
outputs old
w ij = w ij + ∆ w ij
 2η y i u x j u u
if y i ≠ O i
u
∆ w ij = 
 0 otherwise
u u u u
= η (1 − y i O i ) y i x j
u u2 u u
= η ( yi − yi Oi ) x j
u u u
= η ( yi − Oi ) x j
δ PR , ANN, & ML 17
Case 1: O = −1, y = 1 Case 2: O=1, y =−1

x2 new x2
w
w old w old

2η x x1 x1
− 2η x
w new

PR , ANN, & ML 18
LMS (Widrow-Hoff, Delta)
Not restricted to binary outputs
Gradient search
N
1 u 2 1
E(w) = ∑∑(Oi − yi ) = ∑∑(Oi − g(∑wijxj ))
u u u 2

2u i 2u i j=1

∂E (w )
= − ∑ ( O i − g ( net i )) g ' ( net i ) x j
u u u u

∂ w ij u
new old
w ij = w ij + ∆ w ij

+ η ∑ ( O i − g ( net i )) g ' ( net

old u u u u
= w ij i )x j
u

PR , ANN, & ML 19
Nothing but Chain Rule
u u 2 u u u u
∂E ( w ) ∂ (Oi − y i ) ∂ (Oi − y i ) ∂y i ∂net i
= u u u u
∂w ij ∂ ( Oi − y i ) ∂y i ∂net i ∂w ij

= − ∑ (Oi − g ( net i )) g ' ( net i )x j

u u u u

PR , ANN, & ML 20
O = g(net) = sgn(w1x1 + w2 x2 + b)
w1 = 0.4299
w1 w2
w2 = −0.2793
b = −0.1312
x1 = x x2 = y PR , ANN, & ML 21
Final training results Error vs. training epoch

PR , ANN, & ML 22
y = g(net) = sgn(w1x1 + w2 x2 + w3x3 + b) w1 = 0.4232
w1 w3 w2 = −0.7411
w2
w3 = −0.3196
x1 = x x2 = y x3 = z PR , ANN, & ML b = 0.7550 23
Final training results Error vs. training epoch

PR , ANN, & ML 24
O1 O2
y1 = g (w11x1 + w12 x2 + b1 )
w12 w21
w11 w22
y2 = g (w21x1 + w22 x2 + b2 )

x1 x2 PR , ANN, & ML 25
Final training results Error vs. training epoch

PR , ANN, & ML 26
PR , ANN, & ML 27
Final training results Error vs. training epoch

PR , ANN, & ML 28

Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
71 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Lec03 NeuralNetwork
No ratings yet
Lec03 NeuralNetwork
39 pages
NN-Ch2 New V1
No ratings yet
NN-Ch2 New V1
99 pages
Linear
No ratings yet
Linear
18 pages
Perceptron Linear Classifiers
No ratings yet
Perceptron Linear Classifiers
42 pages
Single Layer Feedforward Networks
No ratings yet
Single Layer Feedforward Networks
21 pages
3 Perceptron: Nnets - L. 3 February 10, 2002
No ratings yet
3 Perceptron: Nnets - L. 3 February 10, 2002
31 pages
NN 03
No ratings yet
NN 03
27 pages
Perceptron Lecture 3
No ratings yet
Perceptron Lecture 3
25 pages
ANN 3 - Perceptron
100% (1)
ANN 3 - Perceptron
56 pages
ANN-unit 4 PDF
No ratings yet
ANN-unit 4 PDF
23 pages
PLA Explanation
No ratings yet
PLA Explanation
19 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
Neural Networks Two
No ratings yet
Neural Networks Two
69 pages
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
No ratings yet
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
11 pages
NN Ch04
No ratings yet
NN Ch04
29 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
Lec1 PerceptronPocket Recap
No ratings yet
Lec1 PerceptronPocket Recap
61 pages
lecture 4
No ratings yet
lecture 4
65 pages
Perceptron PDF
0% (1)
Perceptron PDF
8 pages
ANN - Perceptron - Adaline
No ratings yet
ANN - Perceptron - Adaline
15 pages
PNAL4 SingleLayerNets
No ratings yet
PNAL4 SingleLayerNets
42 pages
Unit-5 AI
No ratings yet
Unit-5 AI
19 pages
Clase3_redUnidireccional
No ratings yet
Clase3_redUnidireccional
74 pages
Slide 2
No ratings yet
Slide 2
35 pages
Perceptrons
No ratings yet
Perceptrons
11 pages
ML Lecture#3
No ratings yet
ML Lecture#3
37 pages
Ann MPDM
No ratings yet
Ann MPDM
61 pages
Perceptron PDF
No ratings yet
Perceptron PDF
37 pages
2007 02 01b Janecek Perceptron
No ratings yet
2007 02 01b Janecek Perceptron
37 pages
ANN Unit 3
No ratings yet
ANN Unit 3
11 pages
NY Perceptron Notes
No ratings yet
NY Perceptron Notes
21 pages
Perceptron Notes
No ratings yet
Perceptron Notes
27 pages
13b Neural Networks 1
No ratings yet
13b Neural Networks 1
24 pages
Lecture 4
No ratings yet
Lecture 4
50 pages
Unit 1 Until MLP
No ratings yet
Unit 1 Until MLP
56 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
100% (1)
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
40 pages
NN Unit 2
No ratings yet
NN Unit 2
20 pages
Lecturenotes Perceptron
No ratings yet
Lecturenotes Perceptron
7 pages
NN 2
No ratings yet
NN 2
42 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Bim309 Ai Week13
No ratings yet
Bim309 Ai Week13
53 pages
2023-Lecture11-NeuralNetworks
No ratings yet
2023-Lecture11-NeuralNetworks
48 pages
2021 Lecture11 NeuralNetworks
No ratings yet
2021 Lecture11 NeuralNetworks
48 pages
Lect 13 Perceptron SLP
No ratings yet
Lect 13 Perceptron SLP
15 pages
Perceptron 2014
No ratings yet
Perceptron 2014
44 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Lecture 02 - Neural Networks - 4p
No ratings yet
Lecture 02 - Neural Networks - 4p
10 pages
02A-DL2023-NN-basics
No ratings yet
02A-DL2023-NN-basics
52 pages
Theory and Examples: Problem Statement
No ratings yet
Theory and Examples: Problem Statement
44 pages
ML2
No ratings yet
ML2
22 pages
20.NeuralNets Short
No ratings yet
20.NeuralNets Short
60 pages
Lecture 5 NN
No ratings yet
Lecture 5 NN
57 pages
Lecture Notes 3 Perceptron
No ratings yet
Lecture Notes 3 Perceptron
7 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
CS8451 - Lp-Daa
No ratings yet
CS8451 - Lp-Daa
7 pages
(eBook PDF) Linear System Theory and Design 4th Edition download pdf
No ratings yet
(eBook PDF) Linear System Theory and Design 4th Edition download pdf
45 pages
Predicting Stock Trend Using Fourier Transform and Support Vector Regression
No ratings yet
Predicting Stock Trend Using Fourier Transform and Support Vector Regression
4 pages
CMPT 354 Assignment 3
No ratings yet
CMPT 354 Assignment 3
7 pages
S1B 15 02 Estimation Bias 4
No ratings yet
S1B 15 02 Estimation Bias 4
2 pages
10-601 Machine Learning Midterm Exam Fall 2011: Tom Mitchell, Aarti Singh Carnegie Mellon University
No ratings yet
10-601 Machine Learning Midterm Exam Fall 2011: Tom Mitchell, Aarti Singh Carnegie Mellon University
16 pages
Digital Control Engineering Chapter 1
100% (1)
Digital Control Engineering Chapter 1
8 pages
Chemistry - Chang 10th Edition 1
No ratings yet
Chemistry - Chang 10th Edition 1
1 page
4.deep Learning Assignment4 Solution PDF
100% (1)
4.deep Learning Assignment4 Solution PDF
12 pages
Lecture 17: Library Case Study: Software Engineering Mike Wooldridge
No ratings yet
Lecture 17: Library Case Study: Software Engineering Mike Wooldridge
23 pages
(eBook PDF) Introductory Econometrics for Finance 4th Edition 2024 Scribd Download
100% (2)
(eBook PDF) Introductory Econometrics for Finance 4th Edition 2024 Scribd Download
51 pages
Lecture 11 Performance of Communication Systems Corrupted by Noise
No ratings yet
Lecture 11 Performance of Communication Systems Corrupted by Noise
46 pages
Time and Space Complexity
100% (2)
Time and Space Complexity
5 pages
Cryptography Lab
No ratings yet
Cryptography Lab
26 pages
White Paper: AES Encryption
No ratings yet
White Paper: AES Encryption
5 pages
Maynard Operation Sequence Technique (MOST)
No ratings yet
Maynard Operation Sequence Technique (MOST)
12 pages
Unit 3
No ratings yet
Unit 3
37 pages
2021 - Dysgraphia Classification Based On The Non-Discrimination Regularization in Rotational Region Convolutional Neural Network
No ratings yet
2021 - Dysgraphia Classification Based On The Non-Discrimination Regularization in Rotational Region Convolutional Neural Network
9 pages
Neural Networks: Learning: Cost Function
No ratings yet
Neural Networks: Learning: Cost Function
33 pages
Problem Set 3 Solutions
No ratings yet
Problem Set 3 Solutions
6 pages
Final Review PPT - Batch - 28
No ratings yet
Final Review PPT - Batch - 28
29 pages
U3 Difference Between Substitution Cipher Technique and Transposition Cipher Technique
No ratings yet
U3 Difference Between Substitution Cipher Technique and Transposition Cipher Technique
3 pages
Project Code
No ratings yet
Project Code
21 pages
Comandos Usados en ML
No ratings yet
Comandos Usados en ML
17 pages
Lecture 7
No ratings yet
Lecture 7
66 pages
3rd Quarter, Week No. 3
No ratings yet
3rd Quarter, Week No. 3
22 pages
Inverse Laplace Transforms
No ratings yet
Inverse Laplace Transforms
24 pages
SAS9 ACC115 1st Periodic Exam
No ratings yet
SAS9 ACC115 1st Periodic Exam
9 pages
(Ebook) Linear programming with MATLAB by Michael C. Ferris, Olvi L. Mangasarian, Stephen J. Wright ISBN 9780898716436, 0898716438 All Chapters Instant Download
100% (3)
(Ebook) Linear programming with MATLAB by Michael C. Ferris, Olvi L. Mangasarian, Stephen J. Wright ISBN 9780898716436, 0898716438 All Chapters Instant Download
81 pages
Investigating Thomas-Stieber Model For Property Estimation of Thin-Bedded Shaly-Sand Reservoirs
No ratings yet
Investigating Thomas-Stieber Model For Property Estimation of Thin-Bedded Shaly-Sand Reservoirs
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Perceptron 1

Uploaded by

Perceptron 1

Uploaded by

Simple Perceptrons

Single-layer perceptron is severely limited

Not practical when number of samples is

w ( k ) + cyx y (w ( k ) ⋅ x) < 0, x ∈ +or −

Classification involves inner product of

+ η ∑ ( O i − g ( net i )) g ' ( net

= − ∑ (Oi − g ( net i )) g ' ( net i )x j

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.