0% found this document useful (0 votes)

5 views6 pages

Decision Tree Notes (1)

Decision Trees are supervised learning algorithms used for classification and regression by splitting data based on feature values. They utilize impurity measures like Entropy and Gini Index to determine the best splits, and can handle non-linear data without feature scaling. Key processes include building trees through recursive splitting, evaluating model performance with cross-validation, and tuning hyperparameters for optimal results.

Uploaded by

prek012_686865590

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views6 pages

Decision Tree Notes (1)

Uploaded by

prek012_686865590

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Comprehensive Notes on Decision Trees (for Examination)

Introduction to Decision Trees

• A supervised learning algorithm used for both classification and regression.

• Splits data into branches based on feature values until a decision/leaf node is reached.

Interpretation of Decision Trees

• Root Node: Feature giving maximum purity gain

• Internal Nodes: Decision points
• Leaf Nodes: Final predictions

Building Decision Trees

• Choose best feature at each step using impurity measures (Entropy, Gini, etc.)
• Recursively split until stopping condition is met

⚖ Tree Models vs Linear Models

• Trees: Handle non-linearity, no need for feature scaling

• Linear Models: Work best with linearly separable data

Memory Tip: "Trees branch smartly, lines draw plainly"

Decision Trees for Regression

• Predicts mean value of target in each region

• Uses Mean Squared Error (MSE) or Mean Absolute Error (MAE) to measure impurity

1 n
Formula: MSE = n ∑i=1 (yi − yˉ)2

Example:

• Leaf 1: [200, 210, 190] → Predicted y = 200

• Leaf 2: [400, 390, 410] → Predicted y = 400

1
Regression Tree Building Process

1. Calculate MSE of the target variable.

2. Split data based on attributes, compute MSE for each resulting node.
3. Subtract resulting MSE from original MSE → MSE Reduction.
4. Choose attribute with highest MSE reduction.
5. Repeat recursively until MSE is low and node is homogeneous.
6. Final prediction at leaf = average of target values.

⚖ Impurity Measures for Classification

1. Classification Error:

E = 1 − max(pi )

2. Gini Index:

k
G = ∑i=1 pi (1 − pi )

3. Entropy:

k
D = − ∑i=1 pi log2 (pi )

Memory Tip: "Entropy = Uncertainty, Gini = Diversity, Error = Simplicity"

Information Gain (Entropy-Based)

Gain = D − DA

• D: Entropy before split

• D_A: Weighted avg. entropy after split

Example:

• Parent: 2 of class A, 2 of class B → D = 1.0

• Split: Left = [A, A], Right = [B, B] → D_A = 0
• Gain = 1 - 0 = 1.0 (Perfect Split)

Splitting Based on Feature Type

1. Nominal Categorical Feature (k categories): 2k−1 − 1 splits

2. Ordinal or Continuous Feature (n values):

2
3. Sort values

4. Try n − 1 split points between values

Goal: Maximize homogeneity (minimize impurity)

📊 Weighted Post-Split Impurity:

nL nR
Post-Impurity = n DL + n DR ΔImpurity = D − Post-Impurity

Choose split with maximum gain (i.e., largest ∆ Impurity).

Disadvantages of Decision Trees

Disadvantage Explanation

Overfitting Trees grow too deep, memorize data

Instability Small data changes → large tree change

Bias Favors features with many levels

Poor linear fit Can’t model smooth linear trends

Memory Tip: "D.O.S.E.: Deep trees, Outliers, Sensitive, Easy to overfit"

✂ Tree Truncation vs Tree Pruning

Method Description Risk

Truncation Pre-pruning, stop early Underfitting

Pruning Post-pruning, cut weak branches Better generalization

Hyperparameter Tuning for Trees

Hyperparameter Effect

max_depth Controls depth, prevents overfitting

min_samples_split Minimum samples to split node

min_samples_leaf Minimum samples per leaf

max_features Max features to consider per split

3
Hyperparameter Effect

criterion Impurity function: Gini, Entropy, MSE

ccp_alpha Cost complexity pruning

Tip: Use Grid Search or Randomized Search + Cross-Validation

🔍 Feature Importance

• Importance = Total impurity reduction caused by the feature across all splits

Example:

Feature Total Gain

Income 0.40

Age 0.04

City 0.01

Memory Tip: "More impurity it kills, more important it feels."

Log Base 2 Calculation (log₂)

log10 (x) log(x) ln(x)
Formula: log2 (x) = log10 (2) ≈ 0.3010 Or using natural log: log2 (x) = 0.6931

Steps on calculator:

1. Use log(x) or ln(x)

2. Divide by log(2) or ln(2)

🔍 Entropy with Equal Classes (3 Classes Example)

• Class Distribution: [A, B, C] → each with 1/3

Entropy = −3 × ( 13 log2 ( 13 )) = log2 (3) ≈ 1.585

Memory Tip: Maximum entropy = log2 (k) where k = number of equally likely classes.

4
⭐ Cross-Validation & K-Fold Cross-Validation

Cross-Validation

• Technique to evaluate model performance more reliably than a single train-test split
• Helps avoid overfitting by testing on multiple data subsets

♻ K-Fold Cross-Validation

1. Split dataset into K equal parts (folds)

2. For each fold:
3. Use it as test set, others as training set
4. Train and evaluate
5. Compute average score across K runs

Example (K=5):

• Run model 5 times, each time a different fold is test set

Memory Tip: "K parts, K turns as test. Judge by average."

GridSearchCV (Grid Search with Cross-Validation)

• Used to find the best hyperparameters for a model

• Tries all combinations of given parameters using cross-validation

Steps:

1. Define parameter grid:

param_grid = {
'max_depth': [3, 5, 10],
'min_samples_split': [2, 5, 10]
}

1. Apply GridSearchCV:

from sklearn.model_selection import GridSearchCV

model = DecisionTreeClassifier()
grid_search = GridSearchCV(model, param_grid, cv=5)
grid_search.fit(X, y)

1. Best params:

grid_search.best_params_

5
Memory Tip: "Grid = Try All, CV = Test All"

Business Events in Oracle Applications - A Sample Implementation
No ratings yet
Business Events in Oracle Applications - A Sample Implementation
6 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
16 pages
Decision Trees and Random Forest
No ratings yet
Decision Trees and Random Forest
79 pages
ML CLASS 6 Decision Tree Algorithm
No ratings yet
ML CLASS 6 Decision Tree Algorithm
21 pages
Act9
No ratings yet
Act9
22 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
6. Decision Trees
No ratings yet
6. Decision Trees
18 pages
LVC+1+Post-Session+Summary
No ratings yet
LVC+1+Post-Session+Summary
9 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
ML-chap9_2024_110217
No ratings yet
ML-chap9_2024_110217
52 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
ML-unit-3
No ratings yet
ML-unit-3
22 pages
08 Decision - Tree
No ratings yet
08 Decision - Tree
9 pages
Decision Trees - 2022
No ratings yet
Decision Trees - 2022
49 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Trees
No ratings yet
Decision Trees
27 pages
Decision Tree For Classification (ID3 Information Gain Entropy)
No ratings yet
Decision Tree For Classification (ID3 Information Gain Entropy)
3 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
ESGB_2025_classification and regression tress [Enregistré automatiquement]
No ratings yet
ESGB_2025_classification and regression tress [Enregistré automatiquement]
43 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
DataMining-Handouts1.5
No ratings yet
DataMining-Handouts1.5
8 pages
Lec4 Tree v2.4 1
No ratings yet
Lec4 Tree v2.4 1
54 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
Chapter 7 Supervised Learning
No ratings yet
Chapter 7 Supervised Learning
71 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
Unit6 -2 Classification-Decision-Trees_25625586-1bf9-4821-a721-70db2d7805ef
No ratings yet
Unit6 -2 Classification-Decision-Trees_25625586-1bf9-4821-a721-70db2d7805ef
36 pages
bayes and decision tree
No ratings yet
bayes and decision tree
36 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Decision Trees
No ratings yet
Decision Trees
17 pages
T6 Decision Tree
No ratings yet
T6 Decision Tree
38 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
ML_UNIT3
No ratings yet
ML_UNIT3
24 pages
DS4 - CLS-Decision Tree
No ratings yet
DS4 - CLS-Decision Tree
32 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Decision Trees
No ratings yet
Decision Trees
37 pages
EST Cheatsheet
No ratings yet
EST Cheatsheet
5 pages
unit-4[1].docx ML
No ratings yet
unit-4[1].docx ML
42 pages
06 - Decision Trees
No ratings yet
06 - Decision Trees
14 pages
entropy and information gain for decision tree algorithm
No ratings yet
entropy and information gain for decision tree algorithm
12 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
15 pages
Decision Trees
No ratings yet
Decision Trees
11 pages
ML pp7_u2
No ratings yet
ML pp7_u2
42 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Machine Learning: B.E, M.Tech, PH.D
No ratings yet
Machine Learning: B.E, M.Tech, PH.D
23 pages
ML Unit 3 New
100% (1)
ML Unit 3 New
24 pages
Decision Trees
No ratings yet
Decision Trees
32 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Slanting Lines
No ratings yet
Slanting Lines
1 page
Adjective
No ratings yet
Adjective
1 page
Here Are Ten Engaging and Entertaining Riddles in the Requested Table Format
No ratings yet
Here Are Ten Engaging and Entertaining Riddles in the Requested Table Format
1 page
Kalam Memory Palace Worksheet
No ratings yet
Kalam Memory Palace Worksheet
2 pages
Possessive Apostrophes Homework Merged
No ratings yet
Possessive Apostrophes Homework Merged
62 pages
SQL
No ratings yet
SQL
6 pages
ck-or-k-blanks
No ratings yet
ck-or-k-blanks
1 page
quiz-possessive-s-merged (1)-pages
No ratings yet
quiz-possessive-s-merged (1)-pages
22 pages
grade-4-coordinate-grid-picture-c
No ratings yet
grade-4-coordinate-grid-picture-c
2 pages
Problem Statement For B-Tree: Functionalities: Insertion
No ratings yet
Problem Statement For B-Tree: Functionalities: Insertion
7 pages
Memory Management in Linux Operating System
100% (1)
Memory Management in Linux Operating System
13 pages
Top 10 Production-Grade Reusable PySpark Scripts for Data Engineers _ by Mayurkumar Surani _ May, 2025 _ Medium
No ratings yet
Top 10 Production-Grade Reusable PySpark Scripts for Data Engineers _ by Mayurkumar Surani _ May, 2025 _ Medium
14 pages
Erd Tutorial 1
No ratings yet
Erd Tutorial 1
2 pages
Advanced-View-Arduino-Projects-List
No ratings yet
Advanced-View-Arduino-Projects-List
63 pages
Lecture Notes in Artificial Intelligence 3230
No ratings yet
Lecture Notes in Artificial Intelligence 3230
497 pages
Owasp Api Security Top 10 Cheat Sheet A4
No ratings yet
Owasp Api Security Top 10 Cheat Sheet A4
4 pages
Yaseen Ashraf CV..
No ratings yet
Yaseen Ashraf CV..
3 pages
Final Paper
No ratings yet
Final Paper
34 pages
Dominic Da Silva: Summary
No ratings yet
Dominic Da Silva: Summary
7 pages
Manual de Usuario PLC 1400-87-97
No ratings yet
Manual de Usuario PLC 1400-87-97
11 pages
HTML5 Canvas For Dummies 1st Edition Don Cowan download
No ratings yet
HTML5 Canvas For Dummies 1st Edition Don Cowan download
43 pages
Part Reference-TurBiScat PM 40 - Selection - Product Kits (16500E2-19623-E)
No ratings yet
Part Reference-TurBiScat PM 40 - Selection - Product Kits (16500E2-19623-E)
3 pages
Tdc-E (Telematic Data Collector) : Gateway Systems
No ratings yet
Tdc-E (Telematic Data Collector) : Gateway Systems
116 pages
VXVM Training 1
No ratings yet
VXVM Training 1
70 pages
Virtual Machines
No ratings yet
Virtual Machines
30 pages
AC2100 (Plus) - Eng Guide - 1.00 - 20161201
No ratings yet
AC2100 (Plus) - Eng Guide - 1.00 - 20161201
78 pages
THREADS IN ANDROID2
No ratings yet
THREADS IN ANDROID2
8 pages
Exploring PIC 24F Series Microcontroller PDF
100% (1)
Exploring PIC 24F Series Microcontroller PDF
13 pages
IT - 11 - Part - I
No ratings yet
IT - 11 - Part - I
9 pages
Unit V-Initialization of 80386DX
No ratings yet
Unit V-Initialization of 80386DX
16 pages
AJPUT11
No ratings yet
AJPUT11
4 pages
Vamsi Vemulapati Report
No ratings yet
Vamsi Vemulapati Report
15 pages
What Is A Framework
No ratings yet
What Is A Framework
7 pages
ICCCI Conference Proceedings
No ratings yet
ICCCI Conference Proceedings
243 pages
What Is Python?
No ratings yet
What Is Python?
4 pages
Fundamentals of Information Systems 8th Edition by Stair ISBN Test Bank
100% (41)
Fundamentals of Information Systems 8th Edition by Stair ISBN Test Bank
24 pages
SEM R Notes 2019 3
No ratings yet
SEM R Notes 2019 3
172 pages
IManager M2000 V200R013 Optional Feature Description (GSM&UMTS&LTE)
No ratings yet
IManager M2000 V200R013 Optional Feature Description (GSM&UMTS&LTE)
203 pages
Instructions On Applying and Installing A Digital M PESA Certificate
No ratings yet
Instructions On Applying and Installing A Digital M PESA Certificate
17 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Decision Tree Notes (1)

Uploaded by

Decision Tree Notes (1)

Uploaded by

Comprehensive Notes on Decision Trees (for Examination)

Introduction to Decision Trees

• A supervised learning algorithm used for both classification and regression.

Interpretation of Decision Trees

• Root Node: Feature giving maximum purity gain

Building Decision Trees

⚖ Tree Models vs Linear Models

• Trees: Handle non-linearity, no need for feature scaling

Memory Tip: "Trees branch smartly, lines draw plainly"

Decision Trees for Regression

• Predicts mean value of target in each region

• Leaf 1: [200, 210, 190] → Predicted y = 200

1. Calculate MSE of the target variable.

⚖ Impurity Measures for Classification

Memory Tip: "Entropy = Uncertainty, Gini = Diversity, Error = Simplicity"

Information Gain (Entropy-Based)

• D: Entropy before split

• Parent: 2 of class A, 2 of class B → D = 1.0

Splitting Based on Feature Type

1. Nominal Categorical Feature (k categories): 2k−1 − 1 splits

2. Ordinal or Continuous Feature (n values):

4. Try n − 1 split points between values

Goal: Maximize homogeneity (minimize impurity)

📊 Weighted Post-Split Impurity:

Choose split with maximum gain (i.e., largest ∆ Impurity).

Disadvantages of Decision Trees

Overfitting Trees grow too deep, memorize data

Instability Small data changes → large tree change

Bias Favors features with many levels

Poor linear fit Can’t model smooth linear trends

Memory Tip: "D.O.S.E.: Deep trees, Outliers, Sensitive, Easy to overfit"

✂ Tree Truncation vs Tree Pruning

Method Description Risk

Truncation Pre-pruning, stop early Underfitting

Pruning Post-pruning, cut weak branches Better generalization

Hyperparameter Tuning for Trees

max_depth Controls depth, prevents overfitting

min_samples_split Minimum samples to split node

min_samples_leaf Minimum samples per leaf

max_features Max features to consider per split

criterion Impurity function: Gini, Entropy, MSE

ccp_alpha Cost complexity pruning

Tip: Use Grid Search or Randomized Search + Cross-Validation

Feature Total Gain

Memory Tip: "More impurity it kills, more important it feels."

Log Base 2 Calculation (log₂)

1. Use log(x) or ln(x)

🔍 Entropy with Equal Classes (3 Classes Example)

• Class Distribution: [A, B, C] → each with 1/3

Entropy = −3 × ( 13 log2 ( 13 )) = log2 (3) ≈ 1.585

1. Split dataset into K equal parts (folds)

• Run model 5 times, each time a different fold is test set

Memory Tip: "K parts, K turns as test. Judge by average."

GridSearchCV (Grid Search with Cross-Validation)

• Used to find the best hyperparameters for a model

1. Define parameter grid:

from sklearn.model_selection import GridSearchCV

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.