0% found this document useful (0 votes)

9 views

Decision Tree

Uploaded by

shaikhmismail66

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Decision Tree

Uploaded by

shaikhmismail66

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Hands on Machine Learning

nd
2 Edition
Chapter 6 – Decision Trees

Prepared by Glenn Miller for the San Diego Machine Learning MeetupTM group
Decision Tree (DT) + / -

ADVANTAGES DISADVANTAGES
• Simple to understand and interpret (‘White • Prone to overfitting (must restrict degrees of
Box’ model) freedom)
• Little data prep (e.g. no scaling) • Can be unstable (small data variations
produce big changes to the tree)
• Versatile (classification and regression)
• Cost of using the tree is logarithmic in # of • Predictions are piecewise constant
data points used to train the tree approximations (not smooth or continuous)

• Can handle multi-output problems • DT learners create biased trees if some

classes dominate
• Can validate using statistical tests
• Practical DT algos cannot guarantee to return
• Performs well even if its assumptions are the globally optimal DT b/c learning an
somewhat violated optimal DT is NP-complete

Source: Scikit-Learn documentation

Computational Complexity
• Predictions are fast, even with large training sets

• Algorithm compares all available* features on all samples at each node

• Big O – O(n x mlog2(m))

• Presorting the data (presort=True) can speed up training for small data sets
*If max_features is set, the algorithm will consider max_features features at each split, subject to a minimum one valid
partition of the node samples
Regularization and Pruning
Regularize: prevent the tree from growing too large (like the tree below) by limiting parameter(s)
before growing the tree (e.g., set max_depth or min_samples_leaf)

Prune: let the tree grow and then replace irrelevant nodes with leaves
Impurity

Gini impurity index Entropy / Information Gain

n
• Hi = -  Pi,klog2(Pi,k)
k=1

Where Pi,k is the ratio of k instances among the

training instances in the ith node
CART Algorithm (Scikit-Learn)
Splits the training set into two subsets using a
single feature (k) and a threshold (tk)

Classification (DecisionTreeClassifier) Regression (DecisionTreeRegressor)

• Predict a class in each node • Predict a value in each node

• Minimize impurity • Minimize MSE
• Cost function: • Cost function:
J(k,tk) = mleft Gleft + mright Gright J(k,tk) = mleft MSEleft + mright MSEright
m m m m
G is impurity of the subset; m is number of instances in the subset
• Setosa

(Some) Iris
Species
• Versicolor

• Versicolor
Classification
Regression
Regularization / Pruning

Classification Regression

Source: HOML 2nd edition pp. 182,184 / https://github.com/ageron/handson-ml2/blob/master/06_decision_trees.ipynb

Instability
Sensitivity to training set rotation Sensitivity to training set details

Just one data

point removed!!

Source: HOML 2nd edition pp. 185,186 / https://github.com/ageron/handson-ml2/blob/master/06_decision_trees.ipynb

Conclusion
• Decision trees are powerful, versatile, and easy to understand

• They have limitations, which can be addressed – e.g. averaging trees

reduces instability (random forests)

• More on this in the chapter 7, ‘Ensemble Learning and Random Forests’

LP 4 Linear and Nonlinear Text
100% (4)
LP 4 Linear and Nonlinear Text
8 pages
Trees and Forests: Machine Learning With Python Cookbook
No ratings yet
Trees and Forests: Machine Learning With Python Cookbook
5 pages
Decision Trees
No ratings yet
Decision Trees
38 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Prac 6
No ratings yet
Prac 6
6 pages
ML Unit 2
No ratings yet
ML Unit 2
8 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
DM Lab 04
No ratings yet
DM Lab 04
6 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Decision Tree in ML
No ratings yet
Decision Tree in ML
21 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
ML Unit 3
No ratings yet
ML Unit 3
49 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
14 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
ML Unit 3
No ratings yet
ML Unit 3
28 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
unit 4 ML
No ratings yet
unit 4 ML
24 pages
Decision Tree
No ratings yet
Decision Tree
68 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
15 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
17 pages
Decision Tree
No ratings yet
Decision Tree
11 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Decision Tree Classification Algorithm (2)
No ratings yet
Decision Tree Classification Algorithm (2)
11 pages
Module 4 Lecture -2
No ratings yet
Module 4 Lecture -2
65 pages
An Introduction TO Decision Trees
No ratings yet
An Introduction TO Decision Trees
30 pages
U4 ML Updated
No ratings yet
U4 ML Updated
32 pages
Decision Trees Presentation
No ratings yet
Decision Trees Presentation
10 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
ESGB_2025_classification and regression tress [Enregistré automatiquement]
No ratings yet
ESGB_2025_classification and regression tress [Enregistré automatiquement]
43 pages
practical 15 python
No ratings yet
practical 15 python
6 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
AIML Module 4 Imp
No ratings yet
AIML Module 4 Imp
5 pages
Unit Iir20
No ratings yet
Unit Iir20
22 pages
1.10. Decision Trees — scikit-learn 0.24.1 documentation
No ratings yet
1.10. Decision Trees — scikit-learn 0.24.1 documentation
10 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Decision_tree
No ratings yet
Decision_tree
15 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Decision Trees
No ratings yet
Decision Trees
27 pages
فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
Aiml Qb With Ans
No ratings yet
Aiml Qb With Ans
70 pages
Unit-3 Introduction To Machine Learning Algorithms
No ratings yet
Unit-3 Introduction To Machine Learning Algorithms
18 pages
Machine Learning-Lecture 05
No ratings yet
Machine Learning-Lecture 05
21 pages
EST Cheatsheet
No ratings yet
EST Cheatsheet
5 pages
L22 DecisionTrees
No ratings yet
L22 DecisionTrees
14 pages
DMI UNIT 4
No ratings yet
DMI UNIT 4
34 pages
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
18 pages
ML_Module-3-chapter-6 RNSIT
No ratings yet
ML_Module-3-chapter-6 RNSIT
10 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Water Intake Tower 1. What Is Water Intake Tower?
No ratings yet
Water Intake Tower 1. What Is Water Intake Tower?
3 pages
Get Investigating a Phase Conjugate Mirror for Magnon Based Computing Alistair Inglis free all chapters
100% (4)
Get Investigating a Phase Conjugate Mirror for Magnon Based Computing Alistair Inglis free all chapters
55 pages
Long Welded Rails: Indian Railways Institute of Civil Engineering Pune
No ratings yet
Long Welded Rails: Indian Railways Institute of Civil Engineering Pune
136 pages
October Daye Jumpchain
No ratings yet
October Daye Jumpchain
21 pages
Introduction To Newtons 1st Law
No ratings yet
Introduction To Newtons 1st Law
8 pages
Social Psychology Handout 1
No ratings yet
Social Psychology Handout 1
43 pages
10 CBSE Q Sources of Energy
No ratings yet
10 CBSE Q Sources of Energy
3 pages
063_EN 10246-3
No ratings yet
063_EN 10246-3
16 pages
Lecture 7
No ratings yet
Lecture 7
26 pages
A21 7th Grade Geometry Test - 50 Questions
No ratings yet
A21 7th Grade Geometry Test - 50 Questions
16 pages
Availability Analysis / Exergy: Mech 330: Applied Thermodynamics Ii
No ratings yet
Availability Analysis / Exergy: Mech 330: Applied Thermodynamics Ii
5 pages
Business Analytics and Intelligence From IIM-Bengaluru
No ratings yet
Business Analytics and Intelligence From IIM-Bengaluru
12 pages
05 - Defects in Crystalline Solids PDF
No ratings yet
05 - Defects in Crystalline Solids PDF
49 pages
Shunt & Anti-Split Plates: For The Electrical Distribution Industry
No ratings yet
Shunt & Anti-Split Plates: For The Electrical Distribution Industry
2 pages
NPS/002/020 - Technical Specification For 11 & 20kV Power Cables
No ratings yet
NPS/002/020 - Technical Specification For 11 & 20kV Power Cables
18 pages
Instant Download The Ethics of Social Roles Alex Barber (Editor) PDF All Chapters
100% (2)
Instant Download The Ethics of Social Roles Alex Barber (Editor) PDF All Chapters
57 pages
Eco Efficiency Notes
No ratings yet
Eco Efficiency Notes
3 pages
Sets Lesson Plan
No ratings yet
Sets Lesson Plan
2 pages
Classic Traveller.-.1983.-.SS2 - Exotic Atmospheres
100% (4)
Classic Traveller.-.1983.-.SS2 - Exotic Atmospheres
13 pages
HiTEC-552 PDS
No ratings yet
HiTEC-552 PDS
2 pages
Concrete Technology
No ratings yet
Concrete Technology
18 pages
Crio - Copy Business Operations - Case Study Assignment
No ratings yet
Crio - Copy Business Operations - Case Study Assignment
3 pages
Arrows Dark Part 2
No ratings yet
Arrows Dark Part 2
55 pages
Take Test - ENGD2001 Phase Test - ENGD2001 - 2223 - 520 .. - PDF
No ratings yet
Take Test - ENGD2001 Phase Test - ENGD2001 - 2223 - 520 .. - PDF
26 pages
Explanation Text - Bagian 1 0
No ratings yet
Explanation Text - Bagian 1 0
4 pages
The International Journal of Ambient Systems and Applications (IJASA)
No ratings yet
The International Journal of Ambient Systems and Applications (IJASA)
2 pages
Responsive Settling
No ratings yet
Responsive Settling
4 pages
Applied Social Sciences - For Reading
No ratings yet
Applied Social Sciences - For Reading
15 pages
DLL English 10 Q3 W1
No ratings yet
DLL English 10 Q3 W1
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Decision Tree

Uploaded by

Decision Tree

Uploaded by

Hands on Machine Learning

• Can handle multi-output problems • DT learners create biased trees if some

Source: Scikit-Learn documentation

• Algorithm compares all available* features on all samples at each node

• Big O – O(n x mlog2(m))

Gini impurity index Entropy / Information Gain

Where Pi,k is the ratio of k instances among the

Classification (DecisionTreeClassifier) Regression (DecisionTreeRegressor)

• Predict a class in each node • Predict a value in each node

Source: HOML 2nd edition pp. 182,184 / https://github.com/ageron/handson-ml2/blob/master/06_decision_trees.ipynb

Just one data

Source: HOML 2nd edition pp. 185,186 / https://github.com/ageron/handson-ml2/blob/master/06_decision_trees.ipynb

• They have limitations, which can be addressed – e.g. averaging trees

• More on this in the chapter 7, ‘Ensemble Learning and Random Forests’

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.