Unit 1 DWDM
Unit 1 DWDM
Course Name:
Data Ware Housing and Data Mining
Credits : 4
Question No. 1 should be compulsory and cover
the entire syllabus.
1. Discrete Mathematics
2. Information System Concept
COURSE OUTCOMES (COs)
1. Understand the various component of Datawarehouse
2. Appreciate the strengths and limitations of various data
mining and data warehousing models
3. Critically evaluate data quality to advocate application
of data pre processing techniques.
4. Describe different methodologies used in data mining
and data ware housing.
5. Design a data mart or data warehouse for any
organization
6. Test real data sets using popular data mining tools such
as WEKA
UNIT–I
Chapter/Book Reference: TB3[Chapters - 1,2,3]
ETL tools: -
Defining The Business Requirements:
Dimensional Analysis,
Information Packages – A New Concept,
Requirements Gathering Methods,
Requirements Definition:
Scope And Content
TEXTBOOKS:
RB2. Ian H., Eibe Frank, Mark A. Hall, Christopher Pal “Data Mining:
Practical Machine Learning Tools and Techniques” Published by Morgan
Kaufmann; 4th edition ,December 1, 2016
RB3. G. K. Gupta, “Introduction to Data Mining with Case Studies”, PHI, 2006
RB4. Alex Berson and Stephen J.Smith, “Data Warehousing, Data Mining &
OLAP”, Tata McGraw Hill, 1 July 2017
•Improved Decision-Making:
By providing a centralized view of data,
data warehouses enable organizations to make
informed decisions based on comprehensive analysis.
Enhanced Data
Quality: The
ETL process
helps ensure that
data is cleaned,
transformed, and
standardized,
leading to higher
data quality.
Historical
Analysis: Data
warehouses
store historical
data, allowing
organizations to
analyze trends
over time and
make forecasts.
Performance Optimization: Data warehouses are optimized for
read-heavy operations,
allowing for faster query performance compared to traditional databases.
Difference between
Database System
and
Data Warehouse
Definition
Database System
A Database System is a software application
that allows users
to create,
read,
update, and
delete data in a structured format.
Data Warehouse
Data Warehouses often employ a denormalized data structure,
such as star or snowflake schemas,
to optimize query performance.