0% found this document useful (0 votes)

4 views

ml unit 3 part 1

Decision tree learning is a supervised machine learning algorithm that classifies instances by recursively dividing data based on attribute tests until a final outcome is reached. The ID3 algorithm, a popular method for building decision trees, selects features based on information gain to create a tree structure that represents decision rules. This approach is effective for various tasks, including medical diagnosis and credit risk assessment, and is robust to errors and missing values in training data.

Uploaded by

ananyasharma4014

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ml unit 3 part 1

Uploaded by

ananyasharma4014

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

UNIT-3: DECISION TREE

LEARNING
DECISION TREE LEARNING
Decision Tree Algorithm is a supervised Machine
Learning Algorithm where data is continuously
divided at each row based on certain rules until the
final outcome is generated.
Decision tree learning is a method for approximating
discrete-valued target functions, in which the learned
function is represented by a decision tree. Learned trees
can also be re-represented as sets of if-then rules to
improve human readability.
These learning methods are among the most popular of
inductive inference algorithms, an iterative and
inductive machine learning algorithm, which is used for
generating a set of a classification rule, produces rules
of the form “IF-THEN”, for a set of examples,
producing rules at each iteration and appending to the
set of rules.
These have been successfully applied to a broad range
of tasks from learning to diagnose medical cases to
learning to assess credit risk of loan applicants.
DECISION TREE
REPRESENTATION
Decision trees classify instances by sorting them
down the tree from the root to
some leaf node, which provides the classification of the
instance. Each node in the tree specifies a test of some
attribute of the instance, and each branch descending
from that node corresponds to one of the possible
values for this attribute.
In general, decision trees represent a disjunction of
conjunctions of constraints on the attribute values
of instances. Each path from the tree root to a leaf
corresponds to a conjunction of attribute tests, and the
tree itself to a disjunction of these conjunctions. For
example, the decision tree shown in Figure 3.1
corresponds to the expression
APPROPRIATE PROBLEMS FOR
DECISION TREE LEARNING:
Decision tree learning is generally best suited to
problems with the following characteristics:
1. Instances are represented by attribute-value pairs:
Instances are described by
a fixed set of attributes (e.g., Temperature) and their values
(e.g., Hot).
2. The target function has discrete output values: The decision
tree in Figure 3.1 assigns a boolean classification (e.g., yes or
no) to each example. Decision tree methods easily extend to
learning functions with more than two possible output values.
3. Disjunctive descriptions may be required: decision trees
naturally represent disjunctive expressions.
4. The training data may contain errors: Decision tree
learning methods are robust to errors, both errors in
classifications of the training examples and errors in
the attribute values that describe these examples.
5. The training data may contain missing attribute
values: Decision tree methods can be used even when
some training examples have unknown values (e.g., if
the Humidity of the day is known for only some of
the training examples).
ID-3 Algorithm:
ID3 stands for Iterative Dichotomiser 3 and is
named such because the algorithm iteratively
(repeatedly) dichotomizes (divides) features into two
or more groups at each step.
Invented by Ross Quinlan, ID3 uses a top-down
greedy approach to build a decision tree. In simple
words, the top-down approach means that we start
building the tree from the top and the greedy
approach means that at each iteration we select the
best feature at the present moment to create a node.
The ID3 algorithm selects the best feature at each
step while building a Decision tree. It uses
Information Gain or just Gain to find the best
feature.
Entropy is the measure of disorder and the Entropy
of a dataset is the measure of disorder in the target
feature of the dataset.
In the case of binary classification (where the target
column has only two types of classes) entropy is 0 if
all values in the target column are
homogenous(similar) and will be 1 if the target
column has equal number values for both the classes.
Information Gain calculates the reduction in the
entropy and measures how well a given feature
separates or classifies the target classes. The feature
with the highest Information Gain is selected as the
best one.
Information Gain for a feature column A is calculated
as:
An Illustrative Example:
To illustrate the operation of ID3, consider the
learning task represented by the training examples of
Table 3.2. Here the target attribute PlayTennis,
which can have values yes or no for different
Saturday mornings, is to be predicted based on other
attributes of the morning in question.
ID3 determines the information gain for each
candidate attribute (i.e., Outlook, Temperature,
Humidity, and Wind), then selects the one with
highest information gain.
To illustrate, suppose S is a collection of 14 examples
of some Boolean concept, including 9 positive and 5
negative examples (we adopt the notation [9+, 5- to
summarize such a sample of data).
Information gain is precisely the measure used by
ID3 to select the best attribute at each step in
growing the tree. The use of information gain to
evaluate the relevance of attributes is summarized in
Figure 3.3.
The information gain values for all four attributes are
Gain(S, Outlook) = 0.246
Gain(S, Humidity) = 0.151
Gain(S, Wind) = 0.048
Gain(S, Temperature) = 0.029
According to the information gain measure, the
Outlook attribute provides the best prediction of the
target attribute, PlayTennis, over the training
examples. Therefore, Outlook is selected as the
decision attribute for the root node, and branches are
created below the root for each of its possible values
(i.e., Sunny, Overcast, and Rain).
Every example for which Outlook = Overcast is also
a positive example of PlayTennis. Therefore, this
node of the tree becomes a leaf node with the
classification PlayTennis = Yes. In contrast, the
descendants corresponding to Outlook = Sunny and
Outlook =Rain still have nonzero entropy, and the
decision tree will be further elaborated below these
nodes.
The process of selecting a new attribute and
partitioning the training examples is now repeated for
each non-terminal descendant node, this time using
only the training examples associated with that node
HYPOTHESIS SPACE SEARCH IN
DECISION TREE LEARNING:
As with other inductive learning methods, ID3 can be
characterized as searching a space of hypotheses for one
that fits the training examples.
The hypothesis space searched by ID3 is the set of possible
decision trees.
ID3 performs a simple-to complex, hill-climbing search
through this hypothesis space, beginning with the empty
tree, then considering progressively more elaborate
hypotheses in search of a decision tree that correctly
classifies the training data.
The evaluation function that guides this hill-climbing search
is the information gain measure.
Gain (Ssunny, Humidity) = .970
Gain(Ssunny, Temperature) =.570
Gain(Ssunny, Wind) = .019

FIGURE 3.4:
The partially learned decision tree resulting from the first
step of ID3. The training examples are sorted to the
corresponding descendant nodes. The Overcast descendant
has only positive examples and therefore becomes a leaf
node with classification Yes. The other two nodes will be
further expanded, by selecting the attribute with highest
information gain relative to the new subsets of examples.
INDUCTIVE BIAS IN DECISION
TREE LEARNING:
Inductive bias is the set of assumptions that, together with
the training data, deductively justify the classifications
assigned by the learner to future instances.
Describing the inductive bias of ID3 consists of describing
the basis by which it chooses one of these consistent
hypotheses over the others.
ID3 search strategy (a) selects in favor of shorter trees over
longer ones, and (b) selects trees that place the attributes
with highest information gain closest to the root.
ID3 does not always find the shortest consistent tree, and it
is biased to favor trees that place attributes with high
information gain closest to the root.
Restriction Biases and Preference
Biases:
ID3 searches a complete hypothesis space (i.e., one
capable of expressing any finite discrete-valued
function). It searches incompletely through this
space, from simple to complex hypotheses, until its
termination condition is met (e.g., until it finds a
hypothesis consistent with the data).
The version space CANDIDATE-ELIMINATION
algorithm searches an incomplete hypothesis space
(i.e., one that can express only a subset of the
potentially teachable concepts).
The inductive bias of ID3 follows from its search
strategy, whereas the inductive bias of the
CANDIDATE-ELIMINATION algorithm follows from
the definition of its search space.
The inductive bias of ID3 is thus a preference for
certain hypotheses over others (e.g., for shorter
hypotheses), with no hard restriction on the hypotheses
that can be eventually enumerated. This form of bias is
typically called a preference bias (or, alternatively, a
search bias). In contrast, the bias of the CANDIDATE
ELIMINATION algorithm is in the form of a
categorical restriction on the set of hypotheses
considered. This form of bias is typically called a
restriction bias (or, alternatively, a language bias).
Typically, a preference bias is more desirable than a
restriction bias, because it allows the learner to work
within a complete hypothesis space that is assured to
contain the unknown target function. In contrast, a
restriction bias that strictly limits the set of potential
hypotheses is generally less desirable, because it
introduces the possibility of excluding the unknown
target function altogether.
Why Prefer Short Hypotheses?
William of Occam was one of the first to discuss the
question “Is ID3's inductive bias favoring shorter decision
trees a sound basis for generalizing beyond the training
data?”, around the year 1320, so this bias often goes by the
name of Occam's razor.
Occam's razor: Prefer the simplest hypothesis that fits the
data.
One argument is that because there are fewer short
hypotheses than long ones, it is less likely that one will find
a short hypothesis that coincidentally fits the training data.
In contrast there are often many very complex hypotheses
that fit the current training data but fail to generalize
correctly to subsequent data.

Unit-3 MLT
No ratings yet
Unit-3 MLT
74 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
Unit 3
No ratings yet
Unit 3
46 pages
Mod 3 AIML QB With Answers
No ratings yet
Mod 3 AIML QB With Answers
26 pages
Decision Tree Using ID3 Algorithm
No ratings yet
Decision Tree Using ID3 Algorithm
40 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
Module 3
No ratings yet
Module 3
103 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
ML - Unit 2 - Part I
No ratings yet
ML - Unit 2 - Part I
15 pages
03 02 Decision Trees (1)
No ratings yet
03 02 Decision Trees (1)
61 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Decision Trees
No ratings yet
Decision Trees
7 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
ML UNIT 2 Decision Tree
No ratings yet
ML UNIT 2 Decision Tree
109 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
3 Decision Tree Learning
No ratings yet
3 Decision Tree Learning
38 pages
ML UNIT 2-2-40
No ratings yet
ML UNIT 2-2-40
39 pages
Module 2 Notes
No ratings yet
Module 2 Notes
20 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
decision_tree_learning_lecture
No ratings yet
decision_tree_learning_lecture
13 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Ai Mod3@Azdocuments - in
No ratings yet
Ai Mod3@Azdocuments - in
42 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
MCA3 (DS) Unit 4 ML
No ratings yet
MCA3 (DS) Unit 4 ML
29 pages
Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Research Scholars Evaluation Based On Guides View Using Id3
4 pages
ML Unit-2.1
No ratings yet
ML Unit-2.1
17 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
No ratings yet
Ijret - Research Scholars Evaluation Based On Guides View Using Id3
4 pages
ML Lecture 3
No ratings yet
ML Lecture 3
13 pages
ID3 Algorithm
100% (1)
ID3 Algorithm
3 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
AIML- Module 3- Updated
No ratings yet
AIML- Module 3- Updated
42 pages
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
No ratings yet
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
8 pages
AI_01_ID3
No ratings yet
AI_01_ID3
7 pages
Decision Trees- Id3 Algorithms
No ratings yet
Decision Trees- Id3 Algorithms
12 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
Machine Learning: MVJ21CS62
No ratings yet
Machine Learning: MVJ21CS62
12 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
The ID3 Algorithm
No ratings yet
The ID3 Algorithm
9 pages
3 - Decision trees
No ratings yet
3 - Decision trees
16 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Decision Tree Algorithm: and Classification Problems Too
No ratings yet
Decision Tree Algorithm: and Classification Problems Too
12 pages
Unit 3 MLT
No ratings yet
Unit 3 MLT
18 pages
module 2
No ratings yet
module 2
42 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
Ai&ml M-3
No ratings yet
Ai&ml M-3
6 pages
ID3
No ratings yet
ID3
7 pages
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
100% (1)
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
8 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
Module - 3 - DTL & Ann
No ratings yet
Module - 3 - DTL & Ann
10 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
VNPT Approach in Smart City Development
No ratings yet
VNPT Approach in Smart City Development
23 pages
MOBOTIX-mx ML T26-Part1 en 20200513
No ratings yet
MOBOTIX-mx ML T26-Part1 en 20200513
130 pages
Assignment 3250 Content Document 20241226125759PM
No ratings yet
Assignment 3250 Content Document 20241226125759PM
9 pages
M1. Multi-Axis Servo System Using AC Servo Motors
No ratings yet
M1. Multi-Axis Servo System Using AC Servo Motors
48 pages
Form Hirarc Template
No ratings yet
Form Hirarc Template
421 pages
Module 3: The Web and The Internet: A. Web 1.0 (Read Only Static Web)
No ratings yet
Module 3: The Web and The Internet: A. Web 1.0 (Read Only Static Web)
8 pages
Transition Procedure From REEM To REM
No ratings yet
Transition Procedure From REEM To REM
17 pages
Regression Questions Topic 6
No ratings yet
Regression Questions Topic 6
3 pages
Managerial Effectiveness
No ratings yet
Managerial Effectiveness
10 pages
My Resume
No ratings yet
My Resume
2 pages
The Incorporation Robotics and Artificial Intelligence in Nursing Practices
No ratings yet
The Incorporation Robotics and Artificial Intelligence in Nursing Practices
8 pages
_Session_1_CSP_2023_AP_Daily_Practice_Sessions_
No ratings yet
_Session_1_CSP_2023_AP_Daily_Practice_Sessions_
10 pages
Oracle Applications Cloning
No ratings yet
Oracle Applications Cloning
7 pages
Systemair AW 500E4 Sileo Axial Fan
No ratings yet
Systemair AW 500E4 Sileo Axial Fan
2 pages
Lyca Survey Specialized in Scanning Coring Survey
No ratings yet
Lyca Survey Specialized in Scanning Coring Survey
18 pages
Catalogue Lenze Selection Universal Joints en
No ratings yet
Catalogue Lenze Selection Universal Joints en
36 pages
OOP in C++ Short Quentions
No ratings yet
OOP in C++ Short Quentions
13 pages
Cambridge International AS & A Level: Computer Science 9608/11
No ratings yet
Cambridge International AS & A Level: Computer Science 9608/11
16 pages
FD-XS20_Datasheet
No ratings yet
FD-XS20_Datasheet
12 pages
CCC 2
No ratings yet
CCC 2
32 pages
Everyday Electronics 1998 07
No ratings yet
Everyday Electronics 1998 07
83 pages
Xelix+Implementation+Guide+-+Full+Platform
No ratings yet
Xelix+Implementation+Guide+-+Full+Platform
23 pages
Sensor Based PLC Programming For A Discrete Event Control System
No ratings yet
Sensor Based PLC Programming For A Discrete Event Control System
95 pages
HP Compaq CQ61 QUANTA OP6 - OP7 UMA DIS DA00P6MB6D0 Schematics
No ratings yet
HP Compaq CQ61 QUANTA OP6 - OP7 UMA DIS DA00P6MB6D0 Schematics
37 pages
Scanauto Ug en
No ratings yet
Scanauto Ug en
17 pages
National Interest: Waiver Petitions
No ratings yet
National Interest: Waiver Petitions
12 pages
21 - Effective Pages: Beechcraft Corporation
No ratings yet
21 - Effective Pages: Beechcraft Corporation
166 pages
WP Software Defined Networking
No ratings yet
WP Software Defined Networking
8 pages
TOS - Math - Grade 7 - 1st Periodical - SY 2023-2024
No ratings yet
TOS - Math - Grade 7 - 1st Periodical - SY 2023-2024
14 pages
E. JONES - Violin Star Book 1 (With Violin Duos)
100% (1)
E. JONES - Violin Star Book 1 (With Violin Duos)
71 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ml unit 3 part 1

Uploaded by

ml unit 3 part 1

Uploaded by

UNIT-3: DECISION TREE

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.