FDP Day1
FDP Day1
Data Mining
and
Machine Learning
Dr.B.Santhosh Kumar,
Associate Professor,
G. Pulla Reddy Engineering College(Autonomous),
Kurnool.
Introduction
What is Data Mining?
Targeted marketing
identify likely responders to promotions
Data reduction
Dimensionality reduction
Data compression
Data transformation
Normalization
Forms of Data Preprocessing
20
Data Cleaning
Data in the Real World Is Dirty: Lots of potentially
incorrect data, e.g., instrument faulty, human or
computer error, transmission error
incomplete: lacking attribute values, lacking certain
attributes of interest
e.g., Occupation=“ ” (missing data)
73,600 54,000
1.225
Ex. Let μ = 54,000, σ = 16,000. Then 16,000
Normalization by decimal scaling
v
v' j Where j is the smallest integer such that Max(|ν’|) < 1
10
22
The Traditional Approach
Use of Machine Learning
Automatic Adaptation
Machine Learning helps Humans
Learn
Types of Machine Learning Algorithms
Supervised learning