0% found this document useful (0 votes)

201 views

7.simple Classification

The document describes three simple classification methods: Naive Rule, Naive Bayes, and K-Nearest Neighbors. Naive Rule simply classifies all records as the majority class. Naive Bayes classification is based on estimating the probability of class membership using conditional probabilities of the predictor variables. K-Nearest Neighbors classification involves identifying the k nearest records to the one being classified based on similarity of predictor values and assigning the most common class among those neighbors.

Uploaded by

ironchefff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

201 views

7.simple Classification

Uploaded by

ironchefff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 45

Classification methods

Three Simple Classification Methods

Methods & Characteristics

The three methods: Nave rule Nave Bayes K-nearest-neighbor Common characteristics:

Data-driven, not model-driven Make no assumptions about the data

Nave Rule

Classify all records as the majority class Not a real method Introduced so it will serve as a benchmark against which to measure other results
S Y Charge N Truthful 60% Size L

Error rate 40%

Fraud 40%

Nave Bayes

Idea of Nave Bayes: Financial Fraud

Target variable: fraud truthful Predictors:

Prior pending legal charges (yes/no) Size of firm (small/large) S

Y Charge N

Size

Classify based on the majority in each cell

(Conditional Probability)

Error rate 20%

Nave Bayes: The Basic Idea

For a given new record to be classified, find other records like it (i.e., same values for the predictors) What is the prevalent class among those records? Assign that class to your new record

Usage

Requires categorical variables Numerical variable must be binned and converted to categorical Can be used with very large data sets Example: Spell check computer attempts to assign your misspelled word to an established class (i.e., correctly spelled word)

Exact Bayes Classifier

Relies on finding other records that share same predictor values as record-to-beclassified. Want to find probability of belonging to class C, given specified values of predictors. Conditional probability P (Y= C| X = (x1, xp))

Example: Financial Fraud

Target variable: fraud truthful Predictors:

Prior pending legal charges (yes/no) Size of firm (small/large) S Classify based on the majority in each cell
Error rate 20% Y Charge N

Size

Exact Bayes Calculations

C h a rge s? y n n n n n y y n y Size sm a ll sm a ll la r g e la r g e sm a ll sm a ll sm a ll la r g e la r g e la r g e O u tco m e (T,F) Small tru th fu l Charges Yes (1,1) t r u t h f u l Charges No (3, 0) tru th fu l t r u t h f u l P(F|C,S) Small tru th fu l Y 0.5 tru th fu l N 0 fra u d Rule Small fra u d Y ? fra u d N Truthful fra u d
Large (0,2) (2,1)

Large 1 0.33 Large Fraud Truthful

Exact Bayes Calculations

Goal: classify (as fraudulent or as truthful) a small firm with charges filed There are 2 firms like that, one fraudulent and the other truthful P(fraud|charges=y, size=small) = = 0.50 Note: calculation is limited to the two firms matching those characteristics

Problem

Even with large data sets, may be hard to find other records that exactly match your record, in terms of predictor values.

Solution Nave Bayes

Assume independence of predictor variables (within each class) Use multiplication rule Find same probability that record belongs to class C, given predictor values, without limiting calculation to records that share all those same values

Refining the primitive idea: Nave Bayes

Main idea: Instead of looking at combinations of predictors (crossed pivot table), look at each predictor separately How can this be done? A probability trick!

Based on Bayes rule Then make some simplifying assumption And get a powerful classifier! 15

Conditional Probability

A = the event X = A B = the event Y = B P ( A | B ) denotes the probability of A given B (the conditional probability that A occurs given that B occurred)

P( A B) P( A | B) = P( B)
If P(B)>0

P(Fraud | Charge) = P(Charge and Fraud) / P(Charge)

Bayes Rule (Reverse conditioning)

What if I only know the opposite direction? Bayes rule gives a neat way to reverse time! P(AB) = P(B | A) P(A)= P(A | B) P(B) P( A B) A

P( A | B) P( B) P ( B | A) = P ( A)

P(Fraud | Charge) P(Charge)= P(Charge | Fraud) P(Fraud) P(Fraud | Charge) = P(Charge | Fraud) P(Fraud) / P(Charge)
17

Using Bayes rule

Flipping the condition:

P(Y = 1 | X 1 ,..., X p ) =

P( X 1 ,..., X p | Y = 1) P(Y = 1) P( X 1 ,..., X p )

P ( X 1 ,..., X p ) = P ( X 1 ,..., X p | Y = 1) P (Y = 1) + P ( X 1 ,..., X p | Y = 0) P (Y = 0)

How is this used to solve our problem?

True if we can assume independence between X1,,Xp within each class That means we could use single pivot tables! If the dependence is not extreme, it will work reasonably well
19

Independence Assumption
With Independence Assumption: A P(AB) = P(A)*P(B) We can thus calculate

P(X1,,Xp | Y=1) = P(X1|Y=1)*P(X2|Y=1)* P(Xp|Y=1) P(X1,,Xp | Y=0) = P(X1|Y=0)*P(X2|Y=0)* P(Xp|Y=0) P(X1,,Xp ) = P(X1,,Xp | Y=1)+ P(X1,,Xp | Y=0)

Putting it all together: How it works

1. 2.

All predictors must be categorical. From the training set create all pivot tables of Y on each separate X. We can thus obtain P(X), P(X|Y=1),P(X|Y=0) For a to-be-predicted observation with predictors X1,X2, Xp, software computes the probability of belonging to Y=1 using the formula P ( X 1 | Y = 1) P( X 2 | Y = 1) P( X p | Y = 1) P(Y = 1) P (Y = 1 | X 1 ,..., X p ) = P( X 1 ,..., X p )

Each of the probabilities in the formula is estimated from a pivot table, and estimated P(Y=1) is the proportion of 1s in training set

Use the cutoff to determine classification of this observation. Default: cutoff = 0.5 (classify to group that is most likely)
21

Nave Bayes, cont.

Note that probability estimate does not differ greatly from exact All records are used in calculations, not just those matching predictor values This makes calculations practical in most circumstances Relies on assumption of independence between predictor variables within each class

Independence Assumption

Not strictly justified (variables often correlated with one another) Often good enough

Example: Financial Fraud

Target variable: Fraud Predictors:

Truthful

Prior pending legal charges (yes/no) Size of firm (small/large) S

Y Charge N

Size

Classify based on estimated Conditional Probability

P(F|S,Y) =P(S,Y|F)P(F)/P(S,Y)=P(S,Y|F)P(F)/(P(S,Y|F)P(F)+(P(S,Y,IT)P(T)) = 0.075/(0.075+0.067) = 0.528 S L Y Charge N S 1 Charge 5 N 4 2 N 1 3 1 Y L Y S L 3

P(S,YIT)P(T) = P(S|T)P(Y|T)P(T) = (4/6)(1/6)(6/10) = 0.067 P(S,YIF)P(F) = P(S|F)P(Y|F)P(F)= (1/4)(3/4)(4/10) = 0.075

P(F|C,S) exact Y N

Small 0.5 0

Large 1 0.33 est.

P(F|C,S) Y N

Small 0.528 0.070

Large 0.869 0.316

Nave Bayes Calculations

C h a rge s? y n n n n n y y n y Size sm a ll sm a ll la r g e la r g e sm a ll sm a ll sm a ll la r g e la r g e la r g e

0.075/(0.075+0.067) = 0.528 Large sum

Example: Financial Fraud

Target variable: Fraud Truthful Predictors:

Prior pending legal charges (yes/no) Size of firm (small/large) S

Small 0.528 0.070 Large 0.869 0.316 Y Charge N

Size

Estimated conditional probability P(F|C,S) Y N

Advantages and Disadvantages

The good

Simple Can handle large amount of predictors High performance accuracy, when the goal is ranking Pretty robust to independence assumption! Need to categorize continuous predictors Predictors with rare categories -> zero prob (if this category is important, this is a problem) Gives biased probability of class membership No insight about importance/role of each predictor
28

The bad

Sheet: NNB-Output1

Nave Bayes in XLMiner

Classification> Nave Bayes

According to relative occurrences in training data Class 1 0 Prob. 0.095333333 <-- Success Class 0.904666667

Prior class probabilities

P(accept=1) = 0.095

Conditional probabilities
Classes--> Input Variables Online CreditCard 1 Value 0 1 0 1 Prob 0.374125874 0.625874126 0.699300699 0.300699301 0 Value 0 1 0 1

P(CC=1| accept=1) = 0.301

Prob 0.401621223 0.598378777 0.711864407 0.288135593

Sheet: NNB-ValidScore1

Nave Bayes in XLMiner

Scoring the validation data

['UniversalBank KNN NBayes.xls']'Data_Partition1'!$C$3019:$O$5018 0.5 Prob. for 1 (success) 0.08795125 0.08795125 0.097697987 0.092925663 0.08795125 0.08795125 0.097697987 0.08795125 0.10316131

XLMiner : Naive Bayes - Classification of Validation Data

Data range

Back to Navig

Cut off Prob.Val. for Success (Updatable)

( Updating the value here will NOT update value in summary re

Row Id. 2 3 7 8 11 13 14 15 16

Predicted Class 0 0 0 0 0 0 0 0 0

Actual Class 0 0 0 0 0 0 0 0 0

Online 0 0 1 0 0 0 1 0 1

CreditCard 0 0 0 1 0 0 0 0 1

K-Nearest Neighbors

Basic Idea

For a given record to be classified, identify nearby records Near means records with similar predictor values X1, X2, Xp Classify the record as whatever the predominant class is among the nearby records (the neighbors)

How to Measure nearby?

The most popular distance measure is Euclidean distance

Choosing k

K is the number of nearby neighbors to be used to classify the new record

k=1 means use the single nearest record k=5 means use the 5 nearest records

Typically choose that value of k which has lowest error rate in validation data

K=3
X2

Low k vs. High k

Low values of k (1, 3 ) capture local structure in data (but also noise) High values of k provide more smoothing, less noise, but may miss local structure Note: the extreme case of k = n (i.e. the entire data set) is the same thing as nave rule (classify all records according to majority class)

Example: Riding Mowers

Data: 24 households classified as owning or not owning riding mowers Predictors = Income, Lot Size

Income 60.0 85.5 64.8 61.5 87.0 110.1 108.0 82.8 69.0 93.0 51.0 81.0 75.0 52.8 64.8 43.2 84.0 49.2 59.4 66.0 47.4 33.0 51.0 63.0

Lot_Size 18.4 16.8 21.6 20.8 23.6 19.2 17.6 22.4 20.0 20.8 22.0 20.0 19.6 20.8 17.2 20.4 17.6 17.6 16.0 18.4 16.4 18.8 14.0 14.8

Ownership owner owner owner owner owner owner owner owner owner owner owner owner non-owner non-owner non-owner non-owner non-owner non-owner non-owner non-owner non-owner non-owner non-owner non-owner

XLMiner Output

For each record in validation data (6 records) XLMiner finds neighbors amongst training data (18 records). The record is scored for k=1, k=2, k=18. Best k seems to be k=8. K = 9, k = 10, k=14 also share low error rate, but best to choose lowest k.

Value of k 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

% Error Training 0.00 16.67 11.11 22.22 11.11 27.78 22.22 22.22 22.22 22.22 16.67 16.67 11.11 11.11 5.56 16.67 11.11 50.00

% Error Validation 33.33 33.33 33.33 33.33 33.33 33.33 33.33 16.67 <--- Best k 16.67 16.67 33.33 16.67 33.33 16.67 33.33 33.33 33.33 50.00

Using K-NN for Prediction (for Numerical Outcome)

Instead of majority vote determines class use average of response values May be a weighted average, weight decreasing with distance

Advantages

Simple No assumptions required about Normal distribution, etc. Effective at capturing complex interactions among variables without having to define a statistical model

Shortcomings

Required size of training set increases exponentially with # of predictors, p

This is because expected distance to nearest neighbor increases with p (with large vector of predictors, all records end up far away from each other)

In a large training set, it takes a long time to find distances to all the neighbors and then identify the nearest one(s) These constitute curse of dimensionality

Dealing with the Curse

Reduce dimension of predictors (e.g., with PCA) Computational shortcuts that settle for almost nearest neighbors

Summary

Nave rule: benchmark Nave Bayes and K-NN are two variations on the same theme: Classify new record according to the class of similar records No statistical models involved These methods pay attention to complex interactions and local structure Computational challenges remain

Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
Data Mining - Bayesian Classification
No ratings yet
Data Mining - Bayesian Classification
6 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Unit-4 DWDM
No ratings yet
Unit-4 DWDM
10 pages
Wk08
No ratings yet
Wk08
10 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
ML-09-naive-bayes-classifier
No ratings yet
ML-09-naive-bayes-classifier
24 pages
L25 - Naïve Bayes
No ratings yet
L25 - Naïve Bayes
18 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Unit6 -3 Classification-Bayesian_e224638f-6bb6-4684-a1a1-adb33ef1b15d
No ratings yet
Unit6 -3 Classification-Bayesian_e224638f-6bb6-4684-a1a1-adb33ef1b15d
15 pages
Bayesian
No ratings yet
Bayesian
23 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
Naive Bayes
No ratings yet
Naive Bayes
38 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
ML Naive Bayes 1
No ratings yet
ML Naive Bayes 1
19 pages
Unit-4 Naïve Bayes & Support Vector Machine
No ratings yet
Unit-4 Naïve Bayes & Support Vector Machine
79 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Week 4 - Classification Alternative Techniques
No ratings yet
Week 4 - Classification Alternative Techniques
87 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
NB classifier & Bayesian Network 2
No ratings yet
NB classifier & Bayesian Network 2
37 pages
slide07-bayes
No ratings yet
slide07-bayes
51 pages
DM NaiveBayes
No ratings yet
DM NaiveBayes
15 pages
NOTES
No ratings yet
NOTES
15 pages
Class Adv Classification IV
No ratings yet
Class Adv Classification IV
49 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Classification With NaiveBayes
No ratings yet
Classification With NaiveBayes
19 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
No ratings yet
(Machine Learning) BAYES’ THEOREM AND CONCEPT LEARNING
22 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Unit-Iv Data Classification: Data Warehousing and Data Mining
No ratings yet
Unit-Iv Data Classification: Data Warehousing and Data Mining
7 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
11 pages
6. Naive Bayes
No ratings yet
6. Naive Bayes
26 pages
Naive Bayes
No ratings yet
Naive Bayes
31 pages
Module 3- Bayesian Classifier (1)
No ratings yet
Module 3- Bayesian Classifier (1)
17 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
I239-5 Naive Bayes
No ratings yet
I239-5 Naive Bayes
35 pages
Bayes Theorem
No ratings yet
Bayes Theorem
7 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
UNIT- iv
No ratings yet
UNIT- iv
169 pages
Bayes Theorem
No ratings yet
Bayes Theorem
20 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
9-Decision Tree Induction-23-01-2025
No ratings yet
9-Decision Tree Induction-23-01-2025
40 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
No ratings yet
Generative and Discriminative Classifiers: Naive Bayes and Logistic Regression
17 pages
Week6 - Naive Bayes
No ratings yet
Week6 - Naive Bayes
68 pages
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
BAYES Theorem
From Everand
BAYES Theorem
Jeffery Short
2/5 (5)
Chapter 14
No ratings yet
Chapter 14
22 pages
Chapter 3
No ratings yet
Chapter 3
33 pages
Chapter 13
No ratings yet
Chapter 13
34 pages
10.cluster Analysis
No ratings yet
10.cluster Analysis
68 pages
Chapter 6
No ratings yet
Chapter 6
44 pages
Chapter 5
No ratings yet
Chapter 5
58 pages
3 Olap
No ratings yet
3 Olap
73 pages
6 Evaluation
No ratings yet
6 Evaluation
57 pages
6 Evaluation
No ratings yet
6 Evaluation
57 pages
4 Datamining
No ratings yet
4 Datamining
90 pages
MBA K723 Winter 2013: Data Mining and Business Intelligence
No ratings yet
MBA K723 Winter 2013: Data Mining and Business Intelligence
48 pages
Science 2B03 "The Big Questions": Lectures: Mon/Wed/Thur 1:30 - 2:20 + Your Tutorial Section ("Inquiry Group")
No ratings yet
Science 2B03 "The Big Questions": Lectures: Mon/Wed/Thur 1:30 - 2:20 + Your Tutorial Section ("Inquiry Group")
3 pages
Prof Ed 7 Chapter 1 Handouts
No ratings yet
Prof Ed 7 Chapter 1 Handouts
8 pages
Performance Evaluation Model of Technology and Vocational Education (Tve) : Optimizing The Role of Tve in Improving Certified Teachers
No ratings yet
Performance Evaluation Model of Technology and Vocational Education (Tve) : Optimizing The Role of Tve in Improving Certified Teachers
11 pages
A Case Study of Causes of High Employee Turnover Rate at AL
No ratings yet
A Case Study of Causes of High Employee Turnover Rate at AL
45 pages
IMST Unit-2 Dip ME 5.1
No ratings yet
IMST Unit-2 Dip ME 5.1
36 pages
book
No ratings yet
book
5 pages
Conceptual Framework, Epistemology, Paradigms, Theoretical Framework - 0
No ratings yet
Conceptual Framework, Epistemology, Paradigms, Theoretical Framework - 0
9 pages
Tema 4 FINAL
No ratings yet
Tema 4 FINAL
8 pages
What Is Gentle Parenting
No ratings yet
What Is Gentle Parenting
12 pages
Marzano's Six Steps To Effective Vocabulary Instruction
No ratings yet
Marzano's Six Steps To Effective Vocabulary Instruction
3 pages
MBA Sem I (Business Communication) Topic - Presentation Skills Faculty - Preeti Baliyan
No ratings yet
MBA Sem I (Business Communication) Topic - Presentation Skills Faculty - Preeti Baliyan
11 pages
Professional Goals ps1
No ratings yet
Professional Goals ps1
5 pages
8609 Assignment No 1
No ratings yet
8609 Assignment No 1
18 pages
Effective Classroom Teamwork - Gary Thomas
No ratings yet
Effective Classroom Teamwork - Gary Thomas
26 pages
Resume Nena Campbell 2
No ratings yet
Resume Nena Campbell 2
2 pages
Job Analysis Work
No ratings yet
Job Analysis Work
6 pages
Sociobiology of Sociopathy
No ratings yet
Sociobiology of Sociopathy
37 pages
Specific Instruction: Assignment / Tugasan OUMH2203 English For Workplace Communication May 2019 Semester
No ratings yet
Specific Instruction: Assignment / Tugasan OUMH2203 English For Workplace Communication May 2019 Semester
4 pages
2023 Resume Danielle Strohecker
No ratings yet
2023 Resume Danielle Strohecker
2 pages
Catp
No ratings yet
Catp
3 pages
Lesson Plan in Poetry
No ratings yet
Lesson Plan in Poetry
5 pages
Secret - Gen Ed-3
No ratings yet
Secret - Gen Ed-3
6 pages
Untitled
No ratings yet
Untitled
6 pages
Strategic Leadership Session 3 Sept17th 2021
No ratings yet
Strategic Leadership Session 3 Sept17th 2021
32 pages
Coherence: A Very General Principle of
No ratings yet
Coherence: A Very General Principle of
75 pages
Skripsi Tari
No ratings yet
Skripsi Tari
114 pages
Process Recording
No ratings yet
Process Recording
4 pages
Acquisition - Learning Hypothesis
No ratings yet
Acquisition - Learning Hypothesis
31 pages
TITLE-PAGE-FORMAT-2
No ratings yet
TITLE-PAGE-FORMAT-2
26 pages
Pratham Learning Material Revised
100% (1)
Pratham Learning Material Revised
30 pages
Kayu Paling Bismillah
No ratings yet
Kayu Paling Bismillah
60 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

7.simple Classification

Uploaded by

7.simple Classification

Uploaded by

Classification methods

Three Simple Classification Methods

Methods & Characteristics

Data-driven, not model-driven Make no assumptions about the data

Error rate 40%

Idea of Nave Bayes: Financial Fraud

Prior pending legal charges (yes/no) Size of firm (small/large) S

Classify based on the majority in each cell

Error rate 20%

Nave Bayes: The Basic Idea

Exact Bayes Classifier

Example: Financial Fraud

Exact Bayes Calculations

Large 1 0.33 Large Fraud Truthful

Exact Bayes Calculations

Solution Nave Bayes

Refining the primitive idea: Nave Bayes

P(Fraud | Charge) = P(Charge and Fraud) / P(Charge)

Bayes Rule (Reverse conditioning)

Using Bayes rule

Flipping the condition:

P( X 1 ,..., X p | Y = 1) P(Y = 1) P( X 1 ,..., X p )

P ( X 1 ,..., X p ) = P ( X 1 ,..., X p | Y = 1) P (Y = 1) + P ( X 1 ,..., X p | Y = 0) P (Y = 0)

How is this used to solve our problem?

Putting it all together: How it works

Nave Bayes, cont.

Example: Financial Fraud

Prior pending legal charges (yes/no) Size of firm (small/large) S

Classify based on estimated Conditional Probability

P(F|S,Y) =P(S,Y|F)P(F)/P(S,Y)=P(S,Y|F)P(F)/(P(S,Y|F)P(F)+(P(S,Y,IT)P(T)) = 0.075/(0.075+0.067) = 0.528 S L Y Charge N S 1 Charge 5 N 4 2 N 1 3 1 Y L Y S L 3

P(S,YIT)P(T) = P(S|T)*P(Y|T)P(T) = (4/6)*(1/6)*(6/10) = 0.067 P(S,YIF)P(F) = P(S|F)*P(Y|F)P(F)= (1/4)*(3/4)*(4/10) = 0.075

Large 1 0.33 est.

Small 0.528 0.070

Large 0.869 0.316

Nave Bayes Calculations

0.075/(0.075+0.067) = 0.528 Large sum

Example: Financial Fraud

Prior pending legal charges (yes/no) Size of firm (small/large) S

Estimated conditional probability P(F|C,S) Y N

Advantages and Disadvantages

Nave Bayes in XLMiner

Classification> Nave Bayes

Prior class probabilities

P(CC=1| accept=1) = 0.301

Prob 0.401621223 0.598378777 0.711864407 0.288135593

Nave Bayes in XLMiner

Scoring the validation data

XLMiner : Naive Bayes - Classification of Validation Data

Cut off Prob.Val. for Success (Updatable)

( Updating the value here will NOT update value in summary re

How to Measure nearby?

K is the number of nearby neighbors to be used to classify the new record

Low k vs. High k

Example: Riding Mowers

Using K-NN for Prediction (for Numerical Outcome)

Required size of training set increases exponentially with # of predictors, p

Dealing with the Curse

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

P(S,YIT)P(T) = P(S|T)P(Y|T)P(T) = (4/6)(1/6)(6/10) = 0.067 P(S,YIF)P(F) = P(S|F)P(Y|F)P(F)= (1/4)(3/4)(4/10) = 0.075