0% found this document useful (0 votes)

13 views25 pages

Decision Tree

notes

Uploaded by

abhishekpatekar2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views25 pages

Decision Tree

notes

Uploaded by

abhishekpatekar2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

What is Learning ?

• Is a branch of AI

• Criticism of AI
– Adapt to new situation
– Whatever is told it is done

• Remedy L….L….E….A…R……
R…N
Types of Learning

• Supervised Learning

– We know the target

– Because of previous history
– Chess Safe
– Loan
Risky
– Categorical
– Learning by Examples
Types of Learning
• Unsupervised Learning

– No target
– Continuous
– Eg. Predict the Economic growth of India
2004
– Clustering
– Learning by Observation
Supervised Learning –
Classification
Classification
Training Data
Algorithm

Name Age Income Credit_rating

Saran <=30 low fair Classification
rules
Bill <=30 low excellent
Susan >40 med fair
If age=“ 31..40 “
Geetha 31..40 high excellent and income= “high”
then credit_rating=
Clara >40 med fair
excellent
Babu 31..40 high excellent
Supervised Learning –
Classification

Classification
rules

Test Data New Data

(John Henri ,
31..40, high )
Credit rating ?
excellent
Learning by Decision Tree
Induction
• Tree

– Graphical representation
– Nodes – Rounded rectangle
– Branches
– Leaves - Oval
What is a Decision Tree ?
• Is a flow chart like tree structure

• Internal nodes tests on an attribute

• Branch Outcome of the test

• Leaf Classes ( Categorical)

Example – “ buys_computer”
Age ?

< = 30 > 40

Student ?
Credit_rating ?

no yes excellent fair

no yes no yes
How to choose Root Node ?
• Attribute with the highest Information Gain

• Information Gain
Expected
information to
classify a given
sample
How to choose Root Node ?
• S = { s 1 , s2 , … s m }
• Ci for i=1 to m

Classes

• Si No. of samples S in class C i

m
• I(s1,s2,…sm) =  Pi log 2 ( Pi )
i=1
How to choose Root Node ?
• Pi – Probability that an arbitrary sample
belongs to
class Ci

• Pi = S i
-------
S
Entropy & Gain
• E(A) A – Attribute
• Expected Information based on
partitioning into subsets by A
v
• E(A) =  s ij +……+smj I (s ij ….. smj)
j=1
S
Entropy & Gain

• Gain(A) = I (s1 , s2 ,….. sm) –E(A)

• Larger the gain the attribute is selected as

Root
Example
RID Age Income Studen Credi Class:
t t_rat buys_comput
e er
1 <=30 High no fair no
2 <=30 high no excel no

3 31..40 high no fair yes

4 >40 medium no fair yes
5 >40 low yes fair yes
6 >40 low yes excel no
7 31..40 low yes excel yes
Example
RID Age Income Student Credit_ Class:
rate buys_com
puter
8 <=30 medium no fair no
9 <=30 low yes fair yes
10 >40 medium yes fair yes
11 <=30 medium yes excel yes
12 31..40 medium no excel yes
13 31..40 high yes fair yes
14 >40 medium no excel no
Solution
• Class – buys_computer
• Category
– - yes - 9 records – s1
– - no - 5 records – s2
– - Total records - 14

I ( s1,s2 ) = - 9 log 2 9 - 5 log 2 5

14 14 14 14
= 0.940
Information Gain Calculation
• Attribute – Age
1. <=30 yes ( 2 ) - s11
no ( 3 ) - s21 5 records

2. 31..40 yes ( 4 ) - s12

no ( 0 ) - s22 4 records

3. <=30 yes ( 3 ) - s13

no ( 2 ) - s23 5 records
Information Gain Calculation

• I ( s11, s21) = - 2 log 2 2 - 3 log 2 3

5 5 5 5

= 0.971
I ( s12,s22 ) =0
I ( s13,s23 ) = 0.97
Entropy & Gain Calculation
E( age )= 5 I( s11 , s21) + 4 I( s12,s22) + 5 I(s13,s23 )
14 14 14

= 0.694

Gain(age) = I (s1,s2) –E (age) =0.246

Gain(income) =0.029
Gain(student) =0.151
Gain(Credit_Rating) =0.048
Test Attribute

• Age has the highest information gain

• So it is selected as the test attribute

<=30 Age ? >40
Income Student Credit_rating class Income Studen Credit_rating clas

High No Fair No
31 Medium
t
No Fair
s
Yes

High No Excel No . Low Yes fair Yes

Mediu
m
No Fair No . Low Yes excel No

Low Yes Fair Yes 40 medium Yes Fair Yes

Mediu Yes Excel yes Medium No Excel No

Income Studen Credit_rating class

t
High No Fair Yes

Low Yes Excel Yes

Medium No Excel Yes

High Yes Fair Yes

Generating Classification rules
from a decision tree

• If age = “<=30” AND student= “no” THEN

buys_computer = “no”

• If age= “<=30” AND student= “yes” THEN

buys_computer = “yes”
Generating Classification rules
from a decision tree

• If age=“31..40” THEN buys_computer =“yes”

• If age=“>40” AND credit_rating=“excel” THEN

buys_computer=“no”

• If age=“>40” AND credit_rating=“fair” THEN

buys_computer=“yes”
Problem
• X = ( age = “<=30”,
income = “medium”,
student = “yes”,
credit_rating = “fair” )

Cannot
Predict???????????????????????!!!!!!!!!!!!
Other Learning Schema

1. Naïve Bayesian Learning

2. Neural Network Learning

Ccs366 Sta Lab Final
No ratings yet
Ccs366 Sta Lab Final
41 pages
GRE - Quantitative Reasoning: QuickStudy Laminated Reference Guide
From Everand
GRE - Quantitative Reasoning: QuickStudy Laminated Reference Guide
BarCharts Publishing, Inc.
No ratings yet
Beckhoff FB1120 0 V10
No ratings yet
Beckhoff FB1120 0 V10
20 pages
Decision Tree and KNN Assignment Two
No ratings yet
Decision Tree and KNN Assignment Two
13 pages
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
No ratings yet
Machine Learning: BY:Vatsal J. Gajera (09BCE010)
25 pages
Classification Intr DT .Pptx
No ratings yet
Classification Intr DT .Pptx
31 pages
Slide 07 Chapter8 Classification Basic Concept
No ratings yet
Slide 07 Chapter8 Classification Basic Concept
55 pages
Chapter4 Machine Learning Part3
No ratings yet
Chapter4 Machine Learning Part3
43 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Datamining
No ratings yet
Datamining
6 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Concepts and Techniques: Data Mining
100% (1)
Concepts and Techniques: Data Mining
81 pages
VII - CS8031 - DMDW - Module 6 - Classification - VBP
No ratings yet
VII - CS8031 - DMDW - Module 6 - Classification - VBP
99 pages
AI-day-3-14th mar-2023
No ratings yet
AI-day-3-14th mar-2023
12 pages
Classification: Basic Concepts
No ratings yet
Classification: Basic Concepts
73 pages
Unit3 DT Nodes
No ratings yet
Unit3 DT Nodes
6 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
08 Class Basic
No ratings yet
08 Class Basic
81 pages
Assignment-Decision Tree
No ratings yet
Assignment-Decision Tree
12 pages
DM GTU Study Material Presentations Unit-4 21052021124323PM
No ratings yet
DM GTU Study Material Presentations Unit-4 21052021124323PM
28 pages
3160714_DM_GTU_Study_Material_Presentations_Unit-4_21052021124323PM
No ratings yet
3160714_DM_GTU_Study_Material_Presentations_Unit-4_21052021124323PM
28 pages
Classification
No ratings yet
Classification
73 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
81 pages
ML Unit-2
No ratings yet
ML Unit-2
16 pages
04 Classification
No ratings yet
04 Classification
72 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
42 pages
CH 5
No ratings yet
CH 5
81 pages
Unit 4 DM
No ratings yet
Unit 4 DM
88 pages
Classification DecisionTreesNaiveBayeskNN
No ratings yet
Classification DecisionTreesNaiveBayeskNN
75 pages
Data Mining Book
No ratings yet
Data Mining Book
84 pages
Classification and Prediction
No ratings yet
Classification and Prediction
143 pages
T7.2
No ratings yet
T7.2
5 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
Classification and Prediction
No ratings yet
Classification and Prediction
40 pages
Classification Ppts 2021
No ratings yet
Classification Ppts 2021
80 pages
Mod 3 part1_merged
No ratings yet
Mod 3 part1_merged
101 pages
For Classification Models
No ratings yet
For Classification Models
47 pages
Data Mining Unit 2
No ratings yet
Data Mining Unit 2
41 pages
Unit-6: Classification and Prediction
No ratings yet
Unit-6: Classification and Prediction
63 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
87 pages
dm4
No ratings yet
dm4
68 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Unit 3-Classification
No ratings yet
Unit 3-Classification
71 pages
dm 3
No ratings yet
dm 3
37 pages
08ClassBasic-L
No ratings yet
08ClassBasic-L
78 pages
CALCULATION
No ratings yet
CALCULATION
15 pages
05 Classification
No ratings yet
05 Classification
79 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
88 pages
_08ClassBasic_v1
No ratings yet
_08ClassBasic_v1
46 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
81 pages
University of Gondar: August 2011 E.C Gondar, Ethiopia
No ratings yet
University of Gondar: August 2011 E.C Gondar, Ethiopia
10 pages
2c Decision Tree Algorithm
No ratings yet
2c Decision Tree Algorithm
21 pages
Lecture 4
No ratings yet
Lecture 4
79 pages
Supervised Learning
No ratings yet
Supervised Learning
41 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
06-Classification_Part1
No ratings yet
06-Classification_Part1
44 pages
Basic Math & Pre-Algebra Super Review
From Everand
Basic Math & Pre-Algebra Super Review
The Editors of REA
No ratings yet
Calculus Super Review
From Everand
Calculus Super Review
Editors of REA
No ratings yet
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Pcnsa V17.95
No ratings yet
Pcnsa V17.95
72 pages
Project Proposal
No ratings yet
Project Proposal
3 pages
Fetching and Mutating Data in Makerkit Next.js Supabase Turbo
No ratings yet
Fetching and Mutating Data in Makerkit Next.js Supabase Turbo
5 pages
De Thi HSG 9 2017-2018 - VONG2
No ratings yet
De Thi HSG 9 2017-2018 - VONG2
10 pages
04 1) +EC2+instance+Lab
No ratings yet
04 1) +EC2+instance+Lab
45 pages
[Ebooks PDF] download (Ebook) Absolute Beginner's Guide to Microsoft Office Onenote 2003 by Patricia Cardoza ISBN 9780789731487, 0789731487 full chapters
100% (5)
[Ebooks PDF] download (Ebook) Absolute Beginner's Guide to Microsoft Office Onenote 2003 by Patricia Cardoza ISBN 9780789731487, 0789731487 full chapters
76 pages
Chapter 6 - Signals&ControlSystems
No ratings yet
Chapter 6 - Signals&ControlSystems
25 pages
Oligopoly Presentation - Group 7
No ratings yet
Oligopoly Presentation - Group 7
13 pages
Fabrication Guide
No ratings yet
Fabrication Guide
1 page
3 Continuous System Simulation
No ratings yet
3 Continuous System Simulation
36 pages
Azure Cloud Engineer Resume_Sai Lokesh
No ratings yet
Azure Cloud Engineer Resume_Sai Lokesh
4 pages
BXE Unit 3
No ratings yet
BXE Unit 3
83 pages
ყვავილები ელჯერნონისთვის
No ratings yet
ყვავილები ელჯერნონისთვის
348 pages
Amas - Pendency (2)
No ratings yet
Amas - Pendency (2)
4 pages
Lecture 1 Constructor
No ratings yet
Lecture 1 Constructor
28 pages
java errors
No ratings yet
java errors
15 pages
Angle-Delay-Doppler Domain NOMA Over Massive MIMO-OTFS Networks
No ratings yet
Angle-Delay-Doppler Domain NOMA Over Massive MIMO-OTFS Networks
6 pages
A_Graph_Construction_Method_for_Anomalous_Traffic_Detection_with_Graph_Neural_Networks_Using_Sets_of_Flow_Data
No ratings yet
A_Graph_Construction_Method_for_Anomalous_Traffic_Detection_with_Graph_Neural_Networks_Using_Sets_of_Flow_Data
2 pages
Caris Hips and Sips Tide Editor
No ratings yet
Caris Hips and Sips Tide Editor
10 pages
Extending Existing Logical Volumes On Red Hat 7 Enterprise
No ratings yet
Extending Existing Logical Volumes On Red Hat 7 Enterprise
7 pages
079N03LS Infineon
No ratings yet
079N03LS Infineon
10 pages
CBSE Science Challenge_2021_22 (1)
No ratings yet
CBSE Science Challenge_2021_22 (1)
2 pages
Uputstvo HP LP2465
No ratings yet
Uputstvo HP LP2465
51 pages
CSC Update Log
No ratings yet
CSC Update Log
47 pages
Resume Mayank Garg v1.0
No ratings yet
Resume Mayank Garg v1.0
1 page
Technical Training of 5G Networking Design
No ratings yet
Technical Training of 5G Networking Design
32 pages
Switch Fabric: Q1 Answer The Following Questions (11 Marks)
No ratings yet
Switch Fabric: Q1 Answer The Following Questions (11 Marks)
3 pages
CSS 12 Le Week2b
No ratings yet
CSS 12 Le Week2b
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Decision Tree

Uploaded by

Decision Tree

Uploaded by

What is Learning ?

– We know the target

Name Age Income Credit_rating

Test Data New Data

• Internal nodes tests on an attribute

• Branch Outcome of the test

• Leaf Classes ( Categorical)

no yes excellent fair

• Si No. of samples S in class C i

• Gain(A) = I (s1 , s2 ,….. sm) –E(A)

• Larger the gain the attribute is selected as

3 31..40 high no fair yes

I ( s1,s2 ) = - 9 log 2 9 - 5 log 2 5

2. 31..40 yes ( 4 ) - s12

3. <=30 yes ( 3 ) - s13

• I ( s11, s21) = - 2 log 2 2 - 3 log 2 3

Gain(age) = I (s1,s2) –E (age) =0.246

• Age has the highest information gain

• So it is selected as the test attribute

High No Excel No . Low Yes fair Yes

Low Yes Fair Yes 40 medium Yes Fair Yes

Mediu Yes Excel yes Medium No Excel No

Income Studen Credit_rating class

Low Yes Excel Yes

Medium No Excel Yes

High Yes Fair Yes

• If age = “<=30” AND student= “no” THEN

• If age= “<=30” AND student= “yes” THEN

• If age=“31..40” THEN buys_computer =“yes”

• If age=“>40” AND credit_rating=“excel” THEN

• If age=“>40” AND credit_rating=“fair” THEN

1. Naïve Bayesian Learning

2. Neural Network Learning

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.