CE0716-Data Warehouse and Mining_Compulsory
CE0716-Data Warehouse and Mining_Compulsory
Subject: CE0716
Subject Code: Data Warehouse &
Program: B.Tech CE Semester: VII
Mining
Teaching Scheme (Hours per week) Examination Evaluation Scheme (Marks)
Continuous Continuous
University University Internal Internal
Lecture Tutorial Practical Credits Theory Practical Evaluation Evaluation Total
Examination Examination (CIE)- (CIE)-
Theory Practical
3 0 2 4 40 40 60 60 200
Course Objectives:
1. To learn how to gather and analyze large sets of data to gain useful business understanding
and how to produce a quantitative analysis report/memo with the necessary information to
make decisions.
2. To develop and apply critical thinking, problem-solving, and decision-making skills. Define
knowledge discovery and data mining for skill development.
3. To recognize the key areas and issues in data mining.
4. To apply the techniques of clustering, classification, association finding, feature selection and
visualization to real world data for employability.
5. To determine whether a real-world problem has a data mining solution.
6. To apply evaluation metrics to select data mining techniques.
CONTENTS
UNIT-I
Importance of Data Mining, Data Mining functionalities, Classification of Data mining systems,
Data mining architecture, Major Issues in Data Mining, Applications of Data Mining, Social
Impacts of data mining.
Introduction to Data Warehouse and OLAP Technology for Data Mining
Data Warehouse, From Data Warehousing to Data Mining, OLAP versus OLTP, Data
Warehouse Architecture, Data Warehouse Development Approach, Multidimensional data
Model, Data Warehouse Design Schema
UNIT-II
Data cleaning: Filling Out Missing Values, Noisy Data Removal, Outlier Analysis, Data
Cleaning as a Process; Data Integration: Correlation Techniques, Entity Identification Problem,
Tuple Duplication Problem; Data Reduction: Principal Component Analysis, Sampling, Attribute
Subset Selection, Histograms; Data Transformation: Normalization, Concept Hierarchy
Generation, Aggregation and Discretization
UNIT-III
Market Basket Analysis, Association Rule Mining, Association Rue Mining Algorithms: Apriori
Algorithm, FP Growth Algorithm; Mining of: Single dimensional Association Rules, Multilevel
Association Rules, Multidimensional Association Rules and Constraint based Association Rules
UNIT-IV
Introduction to Spatial Data Mining, Multimedia Data Mining, Temporal Data Mining, Text and
Web Mining
Course Outcomes:
Text Books:
1. Data Mining concepts and Techniques by Jiawei Han, Micheline Kamber –Elsevier.
Reference Books:
Web Resources
LIST OF EXPERIMENTS
1 Study Practical: Introduction To learn how to gather and analyze large sets of
to Weka data to gain useful business understanding and
how to produce a quantitative analysis
report/memo with the necessary information to
make Decisions.
2 Study Practical: Introduction To learn how to gather and analyze large sets of
to RStudio data to gain useful business understanding and
how to produce a quantitative analysis
report/memo with the necessary information to
make Decisions.