0% found this document useful (0 votes)

16 views

Chapter 11 Neural Nets (Python)

Chapter 11 discusses neural networks and their application in data mining for business analytics using Python. It explains the structure of neural networks, including input, hidden, and output layers, and details the training process involving weight adjustments and backpropagation. The chapter also highlights the advantages and disadvantages of neural networks, particularly in relation to deep learning and its effectiveness in image and voice recognition.

Uploaded by

orselmerve2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Chapter 11 Neural Nets (Python)

Uploaded by

orselmerve2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 43

Chapter 11 – Neural Nets

Data Mining for Business

Analytics in Python
Import Functionality Needed

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.neural_network import MLPClassifier
from dmba import classificationSummary
Basic Idea
⚫Combine input information in a complex &
flexible neural net “model”

⚫Model “coefficients” are continually

tweaked in an iterative process

Obs. Fat Score Salt Score Opinion

1 0.2 0.9 like
2 0.1 0.1 dislike
3 0.2 0.4 dislike
1
4 0.2 0.5 dislike
5 0.4 0.5 like
6 0.3 0.8 like
Example – Using fat & salt content to
predict consumer acceptance of
cheese

Rectangles are nodes, wij on arrows are weights, and ϴj are node bias values
Moving Through the Network
The Input Layer

⚫For input layer, input = output

⚫E.g., for record #1:
Fat input = output = 0.2
Salt input = output = 0.9

⚫Output of input layer = input into hidden

layer
The Hidden Layer
⚫In this example, it has 3 nodes
⚫Each node receives as input the
output of all input nodes
⚫Output of each hidden node is some
function of the weighted sum of
inputs
The Weights
⚫The weights θ (theta) and w are typically
initialized to random values in the range -
0.05 to +0.05

⚫Equivalent to a model with random

prediction (in other words, no predictive
value)

⚫These initial weights are used in the first

round of training
Output of Node 3 if g is a Logistic
Function
Initial Pass of the Network
Node outputs (on right within node) using first record in tiny example, and
logistic function

Calculations at hidden node

3:
Output Layer
The output of the last hidden layer becomes input for the
output layer
Mapping the output to a
classification

Output = 0.506, just slightly in excess of 0.5,

so classification, at this early stage, is “like”
Relation to Linear Regression
A net with a single output node and no
hidden layers, where g is the identity
function, takes the same form as a linear
regression model
Training the Model
Preprocessing Steps

⚫Scale variables to 0-1

⚫Categorical variables
⚫Create dummy variables
⚫Transform (e.g., log) skewed variables
Initial Pass Through Network
⚫Goal: Find weights that yield best
predictions
⚫The process described above is repeated
for all records
⚫At each record compare prediction to actual
⚫Difference is the error for the output node
⚫Error is propagated back and distributed to
all the hidden nodes and used to update
their weights
Back Propagation (“back-
prop”)

⚫ Output from output node k:

⚫ Error associated with that node:
Error is Used to Update
Weights

l = constant between 0 and 1, reflects the

“learning rate” or “weight decay parameter”
Why It Works
⚫Big errors lead to big changes in weights
⚫Small errors leave weights relatively
unchanged
⚫Over thousands of updates, a given weight
keeps changing until the error associated
with that weight is negligible, at which
point weights change little
Python Packages for Neural
Nets
Most common for basic neural nets:
● scikit-learn

For deep learning:

● tensorflow
● keras
● pytorch
Prep for Tiny Example

example_df = pd.read_csv('TinyData.csv')
predictors = ['Fat', 'Salt']
outcome = 'Acceptance'
X = example_df[predictors]
y = example_df[outcome]
classes = sorted(y.unique())
Code for Tiny Example
Using MPClassifier in scikit-learn

clf = MLPClassifier(hidden_layer_sizes=(3), activation='logistic',

solver='lbfgs', random_state=1)
clf.fit(X, y)
clf.predict(X)
# Look at network structure
print('Intercepts')
print(clf.intercepts_)
print('Weights')
print(clf.coefs_)

Intercepts
[array([0.13368045, 4.07247552, 7.00768104]),
array([14.30748676])]
Weights
[array([
[ -1.30656481, -4.20427792, -13.29587332],
[ -0.04399727, -4.91606924, -6.03356987]
]),
array([
[ -0.27348313],
[ -9.01211573],
[-17.63504694]
])]
Predictions
# Prediction
print(pd.concat([
example_df,
pd.DataFrame(clf.predict_proba(X), columns=classes)
], axis=1))

Fat Salt Acceptance dislike like

0 0.2 0.9 like 0.000490 0.999510
1 0.1 0.1 dislike 0.999994 0.000006
2 0.2 0.4 dislike 0.999741 0.000259
3 0.2 0.5 dislike 0.997368 0.002632
4 0.4 0.5 like 0.002133 0.997867
5 0.3 0.8 like 0.000075 0.999925
Tiny Example - Final
Weights
Common Criteria to Stop the
Updating
⚫When weights change very little from one
iteration to the next

⚫When the misclassification rate reaches a

required threshold

⚫When a limit on runs is reached

Avoiding Overfitting
With sufficient iterations, neural net can
easily overfit the data

To avoid overfitting:
⚫ Track error in validation data or via cross-
validation
⚫ Limit iterations
⚫ Limit complexity of network
User Inputs
Specify Network Architecture
Number of hidden layers
⚫Most popular – one hidden layer (use
argument hidden_layer_sizes)

Number of nodes in hidden layer(s)

⚫More nodes capture complexity, but increase
chances of overfit (use argument
hidden_layer_sizes)

Number of output nodes

⚫For classification with m classes, use m or m-
1 nodes
⚫For numerical prediction use one
Network Architecture, cont.

“Learning Rate” (argument

learning_rate)
⚫Low values “downweight” the new
information from errors at each iteration
⚫This slows learning, but reduces tendency to
overfit to local structure
Advantages

⚫Good predictive ability

⚫Can capture complex relationships
⚫No need to specify a model
Disadvantages
⚫Considered a “black box” prediction
machine, with no insight into relationships
between predictors and outcome
⚫No variable-selection mechanism, so you
have to exercise care in selecting variables
⚫Heavy computational requirements if there
are many variables (additional variables
dramatically increase the number of
weights to calculate)
Deep Learning
⚫ The statistical and machine learning models
in this book - including standard neural nets
- work where you have informative
predictors (purchase information, bank
account information, # of rooms in a house,
etc.)
⚫ In rapidly-growing applications of voice and
image recognition, you have high numbers
of “low-level” granular predictors - pixel
values, wave amplitudes, uninformative at
this low level
Deep Learning
The most active application area for neural nets

• In image recognition, pixel values are predictors, and there might be

100,000+ predictors – big data! (voice recognition similar)
• Deep neural nets with many layers (“neural nets on steroids”) have
facilitated revolutionary breakthroughs in image/voice recognition, and in
artificial intelligence (AI)
• Key is the ability to self-learn features (“unsupervised”)
• For example, clustering could separate the pixels in this 1” by 1” football
field image into the “green field” and “yard marker” areas without
knowing that those concepts exist
• From there, the concept of a boundary, or “edge” emerges
• Successive stages move from identification of local, simple features to
more global & complex features
Convolutional Neural Net
example in image recognition
● A popular deep learning implementation is a convolutional
neural net (CNN)
● Need to aggregate predictors (pixels)
● Rather than have weights for each pixel, group pixels together
and apply the same operation: “convolution”
● Common aggregation is a 3 x 3 pixel area, for example the
small area around this man’s lower chin

Enlargement Pixel values

of area (higher number =
darker)
Apply the convolution

Convolution operation is “multiply the pixel matrix by

the filter matrix” then sum

025 + 1200 + 0*25 +

x 0*25 + 1*225 + 0*25 +
0*25 + 1*225 + 0*25
= 650

Filter matrix that is Sum = 650; this is higher

good at identifying Pixel values than for any other
center vertical arrangement of the filter
lines (we will see matrix, because pixel values
why shortly) are highest in central column
Continue the
Convolution
⚫ The filter matrix moves across the image,
storing its result, yielding a smaller matrix
whose values indicate the presence or
absence of a vertical line.
⚫ Similar filters can detect horizontal lines,
curves, borders - hyper-local features
⚫ Further convolutions can be applied to
these local features
⚫ Result: multi-dimensional matrix, or tensor,
of higher-level features
The Learning Process
How does the net learn which convolutions to do?
⚫ In supervised learning, the net retains those
convolutions and features which are successful
in labeling (tagging) images
⚫ Note that the feature-learning process yields a
reduced (simpler) set of features than the
original set of pixel values

training data has

known labels
Unsupervised Learning
Autoencoding
⚫ Deep learning nets can learn higher level
features even when there are no labels to guide
the process
⚫ The net adds a process to take the high level
features and generate an image
⚫ The generated image is compared to the
original image and the net retains the
architecture that produces the best matches
Summary
⚫ Neural nets can capture flexible/complicated
relationships between outcome and predictors
⚫ The network “learns” and updates its model
iteratively as more data are fed into it
⚫ Major danger: overfitting
⚫ Requires large amounts of data
⚫ Good predictive performance, yet it’s a “black
box”
⚫ Deep learning, very complex neural nets, is
effective in learning higher level features from a
multitude of lower level ones
⚫ Deep learning is the key to image recognition
and many AI applications

Marketing Analytics
100% (1)
Marketing Analytics
58 pages
House Prices Prediction in King County
No ratings yet
House Prices Prediction in King County
10 pages
Chapter 11 Neural Nets
No ratings yet
Chapter 11 Neural Nets
39 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
3 ArtificialNeuralNetworks PDF
No ratings yet
3 ArtificialNeuralNetworks PDF
77 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
No ratings yet
DL_ANN_RNN_CNN [Autosaved] [Autosaved]
53 pages
04 - Machine Learning for Embedded and Edge AI
No ratings yet
04 - Machine Learning for Embedded and Edge AI
58 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
22 pages
F11 Handout
No ratings yet
F11 Handout
5 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
55 pages
06-NeuralNetworks-2024
No ratings yet
06-NeuralNetworks-2024
82 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Unit 1
No ratings yet
Unit 1
16 pages
Sigmoid Neural Networks to Predict Handwritten Digits
No ratings yet
Sigmoid Neural Networks to Predict Handwritten Digits
16 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Contemporary ML For Physicists
No ratings yet
Contemporary ML For Physicists
91 pages
Module1 ECO-598 AI & ML Aug 21
No ratings yet
Module1 ECO-598 AI & ML Aug 21
45 pages
week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
week 03-04 - Deep Feedforward Networks - Intro
141 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Lecture_09_slides_-_after
No ratings yet
Lecture_09_slides_-_after
57 pages
Essential Concept in Artificial Neural Networks
No ratings yet
Essential Concept in Artificial Neural Networks
27 pages
Neural network
No ratings yet
Neural network
7 pages
Inference and Learning
No ratings yet
Inference and Learning
33 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
Note 5
No ratings yet
Note 5
24 pages
Ch10 Deep Learning
No ratings yet
Ch10 Deep Learning
104 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
14 pages
A Beginner's Tutorial For CNN
100% (1)
A Beginner's Tutorial For CNN
35 pages
Neural Networks / Deep Learning
No ratings yet
Neural Networks / Deep Learning
9 pages
Classification BP Regression KNN Other Classifiers_ Final.ppt
No ratings yet
Classification BP Regression KNN Other Classifiers_ Final.ppt
116 pages
BBBB
No ratings yet
BBBB
8 pages
Deep-Learning-book-part1
No ratings yet
Deep-Learning-book-part1
100 pages
Lecture Notes 2016
No ratings yet
Lecture Notes 2016
132 pages
Eng Ppt Tech
No ratings yet
Eng Ppt Tech
18 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
UNIT 3 - Backpropagation Algorithm
No ratings yet
UNIT 3 - Backpropagation Algorithm
38 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
Lecture8 DeepLearning
No ratings yet
Lecture8 DeepLearning
94 pages
L7-Lecture-Image.classification.DNN-v4
No ratings yet
L7-Lecture-Image.classification.DNN-v4
61 pages
Week 9 Neural Networks
No ratings yet
Week 9 Neural Networks
40 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
AI - W7L13
No ratings yet
AI - W7L13
46 pages
10 Multilayer Perceptrons
No ratings yet
10 Multilayer Perceptrons
54 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
14 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
No ratings yet
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
7 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Neural networks unit-3
No ratings yet
Neural networks unit-3
14 pages
Bai 1 Eng
No ratings yet
Bai 1 Eng
10 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Vark Learning Style Tracking Data
No ratings yet
Vark Learning Style Tracking Data
5 pages
Chapter 9 Between Subjects Design Notes 1
100% (1)
Chapter 9 Between Subjects Design Notes 1
3 pages
Real Estate Price Prediction Based On Linear Regre
No ratings yet
Real Estate Price Prediction Based On Linear Regre
10 pages
The Reviewer s Guide to Quantitative Methods in the Social Sciences Gregory R. Hancock 2024 Scribd Download
100% (5)
The Reviewer s Guide to Quantitative Methods in the Social Sciences Gregory R. Hancock 2024 Scribd Download
40 pages
Data Science: Demystifying
No ratings yet
Data Science: Demystifying
73 pages
Econometrics Ch6 Applications
No ratings yet
Econometrics Ch6 Applications
49 pages
La Min Aung (OMBA - 161002)
No ratings yet
La Min Aung (OMBA - 161002)
22 pages
Olympics Project
No ratings yet
Olympics Project
3 pages
MDA Output Interpretation
No ratings yet
MDA Output Interpretation
32 pages
Erdemir, Cavdar, Bagci, Cihat Corbaci - Factors Predicting E-Learners' Satisfaction On Online Education, 2016
No ratings yet
Erdemir, Cavdar, Bagci, Cihat Corbaci - Factors Predicting E-Learners' Satisfaction On Online Education, 2016
9 pages
Critics of Research Methodology
No ratings yet
Critics of Research Methodology
2 pages
Vaginal Temperature As Indicative of Thermoregulatory Response in Nellore Heifers Under Different Microclimatic Conditions
No ratings yet
Vaginal Temperature As Indicative of Thermoregulatory Response in Nellore Heifers Under Different Microclimatic Conditions
13 pages
Martin G 2019 PHD Thesis
No ratings yet
Martin G 2019 PHD Thesis
324 pages
Organizational Behavior An Evidencebased Guide For Mba Students Ning Hou pdf download
100% (1)
Organizational Behavior An Evidencebased Guide For Mba Students Ning Hou pdf download
87 pages
Short Brief - Machine Learning
No ratings yet
Short Brief - Machine Learning
10 pages
Amulya Report
No ratings yet
Amulya Report
39 pages
A Quantitative Assessment of Student Performance and Examination Format
No ratings yet
A Quantitative Assessment of Student Performance and Examination Format
10 pages
export
No ratings yet
export
5 pages
Group Virtual Mindfulness-Based Intervention for Parents of Autistic Adolescents and Adults.
No ratings yet
Group Virtual Mindfulness-Based Intervention for Parents of Autistic Adolescents and Adults.
12 pages
Summative Exam With Answers
No ratings yet
Summative Exam With Answers
8 pages
IB Biology Internal Assessment Guide
100% (7)
IB Biology Internal Assessment Guide
7 pages
Machine Learning Projects in Python
100% (16)
Machine Learning Projects in Python
135 pages
Extra-Role Performance Behavior of Teachers
No ratings yet
Extra-Role Performance Behavior of Teachers
17 pages
Jurnal Zafran New
No ratings yet
Jurnal Zafran New
15 pages
Zero Code Learning Business Analytics Assignment I: Regression Analysis
No ratings yet
Zero Code Learning Business Analytics Assignment I: Regression Analysis
2 pages
Manual Therapy: Julie Hides, Warren Stanton, M. Dilani Mendis, Margot Sexton
No ratings yet
Manual Therapy: Julie Hides, Warren Stanton, M. Dilani Mendis, Margot Sexton
5 pages
Eregress Predict - Predict After Eregress and Xteregress
No ratings yet
Eregress Predict - Predict After Eregress and Xteregress
6 pages
Customer Relationship Management: Concepts and Technologies
No ratings yet
Customer Relationship Management: Concepts and Technologies
43 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 11 Neural Nets (Python)

Uploaded by

Chapter 11 Neural Nets (Python)

Uploaded by

Chapter 11 – Neural Nets

Data Mining for Business

⚫Model “coefficients” are continually

⚫The network’s interim performance in

Obs. Fat Score Salt Score Opinion

⚫For input layer, input = output

⚫Output of input layer = input into hidden

⚫Equivalent to a model with random

⚫These initial weights are used in the first

Calculations at hidden node

Output = 0.506, just slightly in excess of 0.5,

⚫Scale variables to 0-1

⚫ Output from output node k:

l = constant between 0 and 1, reflects the

For deep learning:

clf = MLPClassifier(hidden_layer_sizes=(3), activation='logistic',

Fat Salt Acceptance dislike like

⚫When the misclassification rate reaches a

⚫When a limit on runs is reached

Number of nodes in hidden layer(s)

Number of output nodes

“Learning Rate” (argument

⚫Good predictive ability

• In image recognition, pixel values are predictors, and there might be

Enlargement Pixel values

Convolution operation is “multiply the pixel matrix by

025 + 1200 + 0*25 +

Filter matrix that is Sum = 650; this is higher

training data has

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Chapter 11 Neural Nets (Python)

Uploaded by

Chapter 11 Neural Nets (Python)

Uploaded by

Chapter 11 – Neural Nets

Data Mining for Business

⚫Model “coefficients” are continually

⚫The network’s interim performance in

Obs. Fat Score Salt Score Opinion

⚫For input layer, input = output

⚫Output of input layer = input into hidden

⚫Equivalent to a model with random

⚫These initial weights are used in the first

Calculations at hidden node

Output = 0.506, just slightly in excess of 0.5,

⚫Scale variables to 0-1

⚫ Output from output node k:

l = constant between 0 and 1, reflects the

For deep learning:

clf = MLPClassifier(hidden_layer_sizes=(3), activation='logistic',

Fat Salt Acceptance dislike like

⚫When the misclassification rate reaches a

⚫When a limit on runs is reached

Number of nodes in hidden layer(s)

Number of output nodes

“Learning Rate” (argument

⚫Good predictive ability

• In image recognition, pixel values are predictors, and there might be

Enlargement Pixel values

Convolution operation is “multiply the pixel matrix by

0*25 + 1*200 + 0*25 +

Filter matrix that is Sum = 650; this is higher

training data has

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

025 + 1200 + 0*25 +