0% found this document useful (0 votes)

87 views

Early Stopping in Practice

The document discusses how to add and customize early stopping when training machine learning models using Keras and TensorFlow. It provides an example of implementing early stopping on an iris flower dataset, including preparing the data, building a neural network model, compiling and training the model with early stopping.

Uploaded by

Alina Burdyuh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views

Early Stopping in Practice

Uploaded by

Alina Burdyuh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B.

Chen | Towards Data Science

Open in app Sign up Sign In

Published in Towards Data Science

You have 2 free member-only stories left this month.

B. Chen Follow

Jul 29, 2020 · 8 min read · · Listen

Save

Early Stopping in Practice: an example with

Keras and TensorFlow 2.0
A step to step tutorial to add and customize Early Stopping

59 1
Photo by Samuel Bourke on Unsplash

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 1/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

In this article, we will focus on adding and customizing Early Stopping in our
machine learning model and look at an example of how we do this in practice with
Keras and TensorFlow 2.0.

Introduction to Early Stopping

In machine learning, early stopping is one of the most widely used regularization
techniques to combat the overfitting issue.

Early Stopping monitors the performance of the

model for every epoch on a held-out validation set
during the training, and terminate the training
conditional on the validation performance.

From Hands-on ML [1]

Early Stopping is a very different way to regularize the machine learning model.
The way it does is to stop training as soon as the validation error reaches a
minimum. The figure below shows a model being trained.

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 2/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

As the epochs go by, the algorithm leans and its error on the training set naturally
goes down, and so does its error on the validation set. However, after a while, the
validation error stops decreasing and actually starts to go back up. This indicates
that the model has started to overfit the training data. With Early Stopping, you just
stop training as soon as the validation error reaches the minimum.

It is such a simple and efficient regularization technique that Geoffrey Hinton

called it a “beautiful free lunch.” [1].

With Stochastic and Mini-batch Gradient Descent

With Stochastic and Mini-batch Gradient Descent, the curves are not so smooth, and
it may be hard to know whether you have reached the minimum or not. One
solution is to stop only after the validation error has been above the minimum for
some time (when you are confident that the model will not do any better), then roll
back the model parameters to the point where the validation error was at a
minimum.

In the following article, we are going to add and customize Early Stopping in our
machine learning model.

Environment setups and dataset preparation

We will be using the same dataset as we did in the model regularization and batch
normalization. You can skip this chapter if you are already familiar with it.

In order to run this tutorial, you need to install

TensorFlow 2, numpy, pandas, sklean, matplotlib

They can all be installed directly vis PyPI and I strongly recommend to create a new
Virtual Environment. For a tutorial on creating a Python virtual environment

Create Virtual Environment using “virtualenv” and add it to Jupyter Notebook

Create Virtual Environment using “conda” and add it to Jupyter Notebook

Source code
This is a step by step tutorial and all instructions are in this article. For source code,
please check out my Github machine learning repo.

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 3/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

Dataset preparation
This tutorial uses the Anderson Iris flower (iris) dataset for demonstration. The
dataset contains a set of 150 records under five attributes: sepal length, sepal width,
petal length, petal width, and class (known as target from sklearn datasets).

First, let’s import the libraries and obtain iris dataset from scikit-learn library. You
can also download it from the UCI Iris dataset.

import tensorflow as tf
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split

iris = load_iris()

For the purpose of exploring data, let’s load data into a DataFrame

# Load data into a DataFrame

df = pd.DataFrame(iris.data, columns=iris.feature_names)
# Convert datatype to float
df = df.astype(float)
# append "target" and name it "label"
df['label'] = iris.target
# Use string label instead
df['label'] = df.label.replace(dict(enumerate(iris.target_names)))

And the df should look like below:

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 4/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

We notice the label column is a categorical feature and will need to convert it to one-
hot encoding. Otherwise, our machine learning algorithm won’t be able to directly
take in that as input.

# label -> one-hot encoding

label = pd.get_dummies(df['label'], prefix='label')
df = pd.concat([df, label], axis=1)
# drop old label
df.drop(['label'], axis=1, inplace=True)

Now, the df should look like:

Next, let’s create X and y. Keras and TensorFlow 2.0 only take in Numpy array as
inputs, so we will have to convert DataFrame back to Numpy array.

# Creating X and yX = df[['sepal length (cm)', 'sepal width (cm)',

'petal length (cm)', 'petal width (cm)']]
# Convert DataFrame into np array
X = np.asarray(X)y = df[['label_setosa', 'label_versicolor',
'label_virginica']]
# Convert DataFrame into np array
y = np.asarray(y)

Finally, let’s split the dataset into a training set (80%)and a test set (20%) using
train_test_split() from sklearn library.

X_train, X_test, y_train, y_test = train_test_split(

X,
y,
test_size=0.20
)

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 5/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

Great! our data is ready for building a Machine Learning model.

Build a neural network

There are 3 ways to create a machine learning model with Keras and TensorFlow
2.0. Since we are building a simple fully connected neural network and for
simplicity, let’s use the easiest way: Sequential Model with Sequential() .

Let’s go ahead and create a function called create_model() to return a Sequential

model.

from tensorflow.keras.models import Sequential

from tensorflow.keras.layers import Dense
def create_model():
model = Sequential([
Dense(64, activation='relu', input_shape=(4,)),
Dense(128, activation='relu'),
Dense(128, activation='relu'),
Dense(128, activation='relu'),
Dense(64, activation='relu'),
Dense(64, activation='relu'),
Dense(64, activation='relu'),
Dense(3, activation='softmax')
])
return model

Our model has the following specifications:

The first layer (also known as the input layer) has the input_shape to set the
input size (4,)

The input layer has 64 units, followed by 3 dense layers, each with 128 units.
Then there are further 3 dense layers, each with 64 units. All these layers use the
ReLU activation function.

The output Dense layer has 3 units and the softmax activation function.

Compile and train the model

In order to train a model, we first have to configure our model using compile() and
pass the following arguments:

Use Adam ( adam ) optimization algorithm as the optimizer

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 6/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

Use categorical cross-entropy loss function ( categorical_crossentropy ) for our

multiple-class classification problem

For simplicity, use accuracy as our evaluation metrics to evaluate the model
during training and testing.

model.compile(
optimizer='adam',
loss='categorical_crossentropy',
metrics=['accuracy']
)

After that, we can call model.fit() to fit our model to the training data.

history = model.fit(
X_train,
y_train,
epochs=200,
validation_split=0.25,
batch_size=40,
verbose=2
)

If all runs smoothly, we should get an output like below

Train on 84 samples, validate on 28 samples

Epoch 1/200
84/84 - 1s - loss: 1.0901 - accuracy: 0.3214 - val_loss: 1.0210 -
val_accuracy: 0.7143
Epoch 2/200
84/84 - 0s - loss: 1.0163 - accuracy: 0.6905 - val_loss: 0.9427 -
val_accuracy: 0.7143
......
Epoch 200/200
84/84 - 0s - loss: 0.5269 - accuracy: 0.8690 - val_loss: 0.4781 -
val_accuracy: 0.8929

Plot the learning curves

Finally, let’s plot the loss vs. epochs graph on the training and validation sets.

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 7/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

It is preferable to create a small function for plotting metrics. Let’s go ahead and
create a function plot_metric() .

%matplotlib inline
%config InlineBackend.figure_format = 'svg'def
plot_metric(history, metric):
train_metrics = history.history[metric]
val_metrics = history.history['val_'+metric]
epochs = range(1, len(train_metrics) + 1)
plt.plot(epochs, train_metrics)
plt.plot(epochs, val_metrics)
plt.title('Training and validation '+ metric)
plt.xlabel("Epochs")
plt.ylabel(metric)
plt.legend(["train_"+metric, 'val_'+metric])
plt.show()

By running plot_metric(history, 'loss') to get a picture of loss progress.

From the above graph, we can see that the model has overfitted the training data,
so it outperforms the validation set.
https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 8/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

Adding Early Stopping

The Keras module contains a built-in callback designed for Early Stopping [2].

First, let’s import EarlyStopping callback and create an early stopping object
early_stopping .

from tensorflow.keras.callbacks import EarlyStopping

early_stopping = EarlyStopping()

EarlyStopping() has a few options and by default:

monitor='val_loss' : to use validation loss as performance measure to terminate

the training.

patience=0 : is the number of epochs with no improvement. The value 0 means

the training is terminated as soon as the performance measure gets worse from
one epoch to the next.

Next, we just need to pass the callback object to model.fit() method.

history = model.fit(
X_train,
y_train,
epochs=200,
validation_split=0.25,
batch_size=40,
verbose=2,
callbacks=[early_stopping]
)

You can see that early_stopping get passed in a list to the callbacks argument. It is
a list because in practice we might be passing a number of callbacks for performing
different tasks, for example debugging and learning rate scheduler.

By executing the statement, you should get an output like below:

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 9/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

Note: your output can be different due to the different weight initialization.

The training gets terminated at Epoch 6 due to the increase of val_loss value and
that is exactly the conditions monitor='val_loss' and patience=0 .

It’s often more convenient to look at a plot, let’s run plot_metric(history, 'loss') to
get a clear picture. In the below graph, validation loss is shown in orange and it’s
clear that validation error increases at Epoch 6.

Customizing Early Stopping

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 10/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

Apart from the options monitor and patience we mentioned early, the other 2
options min_delta and mode are likely to be used quite often.

monitor='val_loss' : to use validation loss as performance measure to terminate

the training.

patience=0 : is the number of epochs with no improvement. The value 0 means

the training is terminated as soon as the performance measure gets worse from
one epoch to the next.

min_delta : Minimum change in the monitored quantity to qualify as an

improvement, i.e. an absolute change of less than min_delta , will count as no
improvement.

mode='auto' : Should be one of auto , min or max . In 'min' mode, training will
stop when the quantity monitored has stopped decreasing; in 'max' mode it will
stop when the quantity monitored has stopped increasing; in 'auto' mode, the
direction is automatically inferred from the name of the monitored quantity.

And here is an example of a customized early stopping:

custom_early_stopping = EarlyStopping(
monitor='val_accuracy',
patience=8,
min_delta=0.001,
mode='max'
)

monitor='val_accuracy' to use validation accuracy as performance measure to

terminate the training. patience=8 means the training is terminated as soon as 8
epochs with no improvement. min_delta=0.001 means the validation accuracy has to
improve by at least 0.001 for it to count as an improvement. mode='max' means it
will stop when the quantity monitored has stopped increasing.

Let’s go ahead and run it with the customized early stopping.

history = model.fit(
X_train,
y_train,
epochs=200,
https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 11/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

validation_split=0.25,
batch_size=40,
verbose=2,
callbacks=[custom_early_stopping]
)

This time, the training gets terminated at Epoch 9 as there are 8 epochs with no
improvement on validation accuracy (It has to be ≥ 0.001 to count as an
improvement). For a clear picture, let’s look at a plot representation of accuracy by
running plot_metric(history, 'accuracy') . In the below graph, validation accuracy
is shown in orange and it’s clear that validation accuracy hasn’t got any
improvement.

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 12/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

That’s it
Thanks for reading.

Please checkout the notebook on my Github for the source code.

Stay tuned if you are interested in the practical aspect of machine learning.

References
[1] Hands-on Machine Learning with scikit-learn, keras, and tensorflow:
concepts, tools, and techniques to build intelligent system

[2] Keras Official Documentation for Early Stopping

Early Stopping Keras Tensor Flow Machine Learning Data Science

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 13/14
01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B. Chen | Towards Data Science

Sign up for The Variable

By Towards Data Science

Every Thursday, the Variable delivers the very best of Towards Data Science: from hands-on tutorials and cutting-
edge research to original features you don't want to miss. Take a look.

By signing up, you will create a Medium account if you don’t already have one. Review
our Privacy Policy for more information about our privacy practices.

Get this newsletter

About Help Terms Privacy

Get the Medium app

https://towardsdatascience.com/a-practical-introduction-to-early-stopping-in-machine-learning-550ac88bc8fd 14/14

(Ebook) Machine Learning Algorithms in Depth (MEAP V01) by Vadim Smolyakov ISBN 9781633439214, 1633439216 download pdf
100% (5)
(Ebook) Machine Learning Algorithms in Depth (MEAP V01) by Vadim Smolyakov ISBN 9781633439214, 1633439216 download pdf
81 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Article Review
No ratings yet
Article Review
13 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
No ratings yet
Word2Vec Tutorial - The Skip-Gram Model Chris McCormick PDF
39 pages
Quebec CEGEP Curriculum - Ontario Equivalents
No ratings yet
Quebec CEGEP Curriculum - Ontario Equivalents
21 pages
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
100% (1)
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
34 pages
AML 04 Backpropagation
100% (1)
AML 04 Backpropagation
26 pages
Eda PDF
100% (1)
Eda PDF
45 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
20 pages
3 Regression Diagnostics
100% (1)
3 Regression Diagnostics
53 pages
Chapter 5.3-Mulitple Linear Regression
No ratings yet
Chapter 5.3-Mulitple Linear Regression
26 pages
Binary Classification Tutorial With The Keras Deep Learning Library
No ratings yet
Binary Classification Tutorial With The Keras Deep Learning Library
33 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
Transformers in NLP 1
No ratings yet
Transformers in NLP 1
9 pages
Kaggle State of Machine Learning and Data Science 2020 PDF
No ratings yet
Kaggle State of Machine Learning and Data Science 2020 PDF
30 pages
Using Categorical Data With One Hot Encoding - Kaggle PDF
No ratings yet
Using Categorical Data With One Hot Encoding - Kaggle PDF
4 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
Machine Learning
No ratings yet
Machine Learning
38 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
Gradient Descent Algorithms and Variations - PyImageSearch
No ratings yet
Gradient Descent Algorithms and Variations - PyImageSearch
21 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
Soft Max
No ratings yet
Soft Max
6 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Logistic Regression
No ratings yet
Logistic Regression
24 pages
Predict 422 - Module 8
100% (1)
Predict 422 - Module 8
138 pages
Introduction To Data Visualization With Python
No ratings yet
Introduction To Data Visualization With Python
47 pages
Keras Succinctly
No ratings yet
Keras Succinctly
107 pages
Hyperparameter Tuning in XGBoost Using Genetic Algorithm
100% (1)
Hyperparameter Tuning in XGBoost Using Genetic Algorithm
11 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
Lecture 2 Prompt Engineering
No ratings yet
Lecture 2 Prompt Engineering
60 pages
Vision Mamba: Rethinking Visual Representation With Bidirectional LSTMs
No ratings yet
Vision Mamba: Rethinking Visual Representation With Bidirectional LSTMs
7 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Machine Lpipearning Interview Questions: Algorithms/Tp: Q1-What's The Trade-Off Between Bias and Variance?
No ratings yet
Machine Lpipearning Interview Questions: Algorithms/Tp: Q1-What's The Trade-Off Between Bias and Variance?
46 pages
Lab I TENSOR FLOW AND KERAS
No ratings yet
Lab I TENSOR FLOW AND KERAS
3 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
15 pages
ML Project Shivani Pandey
100% (2)
ML Project Shivani Pandey
49 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Cluster Analysis: Concepts and Techniques - Chapter 7
100% (1)
Cluster Analysis: Concepts and Techniques - Chapter 7
60 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Introduction
100% (1)
Introduction
49 pages
DL Full Merged
No ratings yet
DL Full Merged
454 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
No ratings yet
Vector Representation of Text: Vagelis Hristidis Prepared With The Help of Nhat Le Many Slides Are From Richard Socher
20 pages
PPT_Btech CSE
No ratings yet
PPT_Btech CSE
17 pages
Quiz
No ratings yet
Quiz
6 pages
Guide To RAG System Evaluation Metrics
No ratings yet
Guide To RAG System Evaluation Metrics
21 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
Unit 2
No ratings yet
Unit 2
112 pages
10 Evani Generative AI Champion
No ratings yet
10 Evani Generative AI Champion
39 pages
27 SVM Interview Questions (ANSWERED) To Master Before ML & Data Science Interview - MLStack - Cafe
No ratings yet
27 SVM Interview Questions (ANSWERED) To Master Before ML & Data Science Interview - MLStack - Cafe
25 pages
Machine Learning + Devops Using Azure ML Services
No ratings yet
Machine Learning + Devops Using Azure ML Services
17 pages
StatisticsMachineLearningPythonDraft PDF
100% (1)
StatisticsMachineLearningPythonDraft PDF
313 pages
Ridge and Lasso in Python PDF
No ratings yet
Ridge and Lasso in Python PDF
5 pages
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
From Everand
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
Pierre-yves Bonnefoy
No ratings yet
Za HL 376 Degrees of Comparison Using Suffixes Er and Est - Ver - 2
No ratings yet
Za HL 376 Degrees of Comparison Using Suffixes Er and Est - Ver - 2
11 pages
! НА ПЕРЕКЛАД Consumer Electronics Producers. Matushita Denki+Виробники побутової електротехніки. Дженерал Електрик
No ratings yet
! НА ПЕРЕКЛАД Consumer Electronics Producers. Matushita Denki+Виробники побутової електротехніки. Дженерал Електрик
14 pages
Readership and Purpose in The Choice of Economics
No ratings yet
Readership and Purpose in The Choice of Economics
20 pages
410 - 1 - Speakout Elementary SB PDF
No ratings yet
410 - 1 - Speakout Elementary SB PDF
1 page
Attention U-Net, ResUnet, U-Net++, U - Net - AIGuys
No ratings yet
Attention U-Net, ResUnet, U-Net++, U - Net - AIGuys
16 pages
410 - 3 - Speakout Elementary WB (With Key) PDF
No ratings yet
410 - 3 - Speakout Elementary WB (With Key) PDF
1 page
An Introduction to Mathematical Modeling of Infectious Diseases Premium Download
100% (3)
An Introduction to Mathematical Modeling of Infectious Diseases Premium Download
14 pages
APPENDIX 4 Empirical Methods of Prediction of Maneuverability
No ratings yet
APPENDIX 4 Empirical Methods of Prediction of Maneuverability
4 pages
The Design and Conduct of Meaningful Experiments Involving Human Participants 25 Scientific Principles 1st Edition R. Barker Bausell download
100% (1)
The Design and Conduct of Meaningful Experiments Involving Human Participants 25 Scientific Principles 1st Edition R. Barker Bausell download
57 pages
2019 Class Test 1 MEMO
No ratings yet
2019 Class Test 1 MEMO
6 pages
(Edgar Chambers) Sensory Testing Methods (ASTM Man PDF
0% (1)
(Edgar Chambers) Sensory Testing Methods (ASTM Man PDF
120 pages
Quality Management Book + Article Review Review. For Robel
No ratings yet
Quality Management Book + Article Review Review. For Robel
15 pages
MFIN 5800 Chapter 11
No ratings yet
MFIN 5800 Chapter 11
32 pages
Probability and Statistical Course.: Instructor: DR - Ing. (C) Sergio A. Abreo C
No ratings yet
Probability and Statistical Course.: Instructor: DR - Ing. (C) Sergio A. Abreo C
25 pages
Lecture Problem Set 1-Chem203
No ratings yet
Lecture Problem Set 1-Chem203
4 pages
SPE 133428 Modeling Thermal Effects On Wellbore Stability
No ratings yet
SPE 133428 Modeling Thermal Effects On Wellbore Stability
23 pages
Research Made Easy
No ratings yet
Research Made Easy
74 pages
Ship's Lifeboats - Analysis of Accident Cause and Effect and Its Relationship To Seafarers' Hazard Perception
100% (2)
Ship's Lifeboats - Analysis of Accident Cause and Effect and Its Relationship To Seafarers' Hazard Perception
153 pages
Data Collection Statistics
No ratings yet
Data Collection Statistics
18 pages
Output SOFA LVEF
No ratings yet
Output SOFA LVEF
7 pages
Visvesvaraya Technological University, Belagavi: VTU-ETR Seat No.: A
No ratings yet
Visvesvaraya Technological University, Belagavi: VTU-ETR Seat No.: A
48 pages
(Assume α= 5% if not mentioned in the question) : Cfa, Frm, Ca, Cs, Fm, Caia, Cipm, Ccra, Ciib, Aim, Cira
No ratings yet
(Assume α= 5% if not mentioned in the question) : Cfa, Frm, Ca, Cs, Fm, Caia, Cipm, Ccra, Ciib, Aim, Cira
3 pages
Literature Review On Training and Development Process
100% (1)
Literature Review On Training and Development Process
4 pages
Group 2 Section A - Full Paper
No ratings yet
Group 2 Section A - Full Paper
29 pages
Empirical Methods For Finance: Sjoerd Van Den Hauwe
No ratings yet
Empirical Methods For Finance: Sjoerd Van Den Hauwe
27 pages
(AR) An Alternative Method For Quantitative Synthesis of Single-Subject Researches. Percentage of Data Points Exceeding The Median (2006)
No ratings yet
(AR) An Alternative Method For Quantitative Synthesis of Single-Subject Researches. Percentage of Data Points Exceeding The Median (2006)
20 pages
Normal Distribution PPT
No ratings yet
Normal Distribution PPT
20 pages
Cox Ingersoll Ross - Model
No ratings yet
Cox Ingersoll Ross - Model
6 pages
Statistical Comparison of The Slopes of Two Regression Lines A Tutorial
No ratings yet
Statistical Comparison of The Slopes of Two Regression Lines A Tutorial
12 pages
statitics by Mesfin
No ratings yet
statitics by Mesfin
150 pages
Lecture Slides For BER+Q-factor+EyeDiagram
No ratings yet
Lecture Slides For BER+Q-factor+EyeDiagram
5 pages
Teaching Science Technology Components of Scientific Literacy and Insight Into The Steps of Research
No ratings yet
Teaching Science Technology Components of Scientific Literacy and Insight Into The Steps of Research
17 pages
Understanding Survival Analysis Kaplan-Meier Estimate
No ratings yet
Understanding Survival Analysis Kaplan-Meier Estimate
5 pages
Pointers to Review
No ratings yet
Pointers to Review
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Early Stopping in Practice

Uploaded by

Early Stopping in Practice

Uploaded by

01.02.2023, 17:17 Early Stopping in Practice: an example with Keras and TensorFlow 2.0 | by B.

Chen | Towards Data Science

Open in app Sign up Sign In

Published in Towards Data Science

You have 2 free member-only stories left this month.

Jul 29, 2020 · 8 min read · · Listen

Early Stopping in Practice: an example with

Introduction to Early Stopping

Early Stopping monitors the performance of the

From Hands-on ML [1]

It is such a simple and efficient regularization technique that Geoffrey Hinton

With Stochastic and Mini-batch Gradient Descent

Environment setups and dataset preparation

In order to run this tutorial, you need to install

TensorFlow 2, numpy, pandas, sklean, matplotlib

Create Virtual Environment using “virtualenv” and add it to Jupyter Notebook

Create Virtual Environment using “conda” and add it to Jupyter Notebook

# Load data into a DataFrame

And the df should look like below:

# label -> one-hot encoding

Now, the df should look like:

# Creating X and yX = df[['sepal length (cm)', 'sepal width (cm)',

X_train, X_test, y_train, y_test = train_test_split(

Great! our data is ready for building a Machine Learning model.

Build a neural network

Let’s go ahead and create a function called create_model() to return a Sequential

from tensorflow.keras.models import Sequential

Our model has the following specifications:

Compile and train the model

Use Adam ( adam ) optimization algorithm as the optimizer

Use categorical cross-entropy loss function ( categorical_crossentropy ) for our

If all runs smoothly, we should get an output like below

Train on 84 samples, validate on 28 samples

Plot the learning curves

By running plot_metric(history, 'loss') to get a picture of loss progress.

Adding Early Stopping

from tensorflow.keras.callbacks import EarlyStopping

EarlyStopping() has a few options and by default:

monitor='val_loss' : to use validation loss as performance measure to terminate

patience=0 : is the number of epochs with no improvement. The value 0 means

Next, we just need to pass the callback object to model.fit() method.

By executing the statement, you should get an output like below:

Customizing Early Stopping

monitor='val_loss' : to use validation loss as performance measure to terminate

patience=0 : is the number of epochs with no improvement. The value 0 means

min_delta : Minimum change in the monitored quantity to qualify as an

And here is an example of a customized early stopping:

monitor='val_accuracy' to use validation accuracy as performance measure to

Let’s go ahead and run it with the customized early stopping.

Please checkout the notebook on my Github for the source code.

[2] Keras Official Documentation for Early Stopping

Early Stopping Keras Tensor Flow Machine Learning Data Science

Sign up for The Variable

Get this newsletter

About Help Terms Privacy

Get the Medium app

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.