BI Introduction
BI Introduction
DATA WAREHOUSE
Goals
After completing this chapter, you will be able to:
Describe the role of business intelligence in providing
comprehensive business decision support
Define BI architecture and its components
Summarize main applications and business value for BI
Differentiate between operational data and decision
support data
Identify the purpose, characteristics, and components of a
data warehouse
2
THE BUSINESS DEMAND FOR DATA,
INFORMATION, AND ANALYTICS
Data → information → knowledge → data…
knowledge that helps Organizations make informed
decisions → to understand their operations, customers,
competitors, suppliers, partners, employees, and
stockholders...:
Insight into the past: what is happening in the business?
Understand the future: What might happen?
Advice on possible outcomes: what action to take?
3
Need of IS use at high level
Wisdom
→ action my experience says this will cause severe flight
delays.
knowledge
Heathrow weather station;
visibility 15 km, sky completely cloudy; wind
direction north west, speed 85 kts; temperature
Information 15.7 degrees C
4
Data
Data is raw, random, and unorganized
Data: a collection of ingredients sitting on the counter. They
include carrots, onions, leeks, garlic, and potatoes from the
farmer’s market, and a package of chicken, a box of rice, and
some cans of broth from the grocery store
5
Information
Information is data that has been organized,
structured, and processed. Information is what
you use to gain knowledge.
then you get everything ready by washing,
peeling, and cutting up the vegetables, cutting up
the chicken, and opening the cans of broth. You
put it all in the pot and turn on the heat where it
cooks and becomes soup.
➔ In the DW/BI world, the data has been moved
into the ETL (extract, transform, and load) system
and is transformed into information
6
Knowledge
Knowledge: Now the soup is ready to be put into bowls
and eaten.
→ In the DW/BI world business people consume the
information in reports to gain knowledge that helps them
make informed business decisions.
7
THE BUSINESS DEMAND FOR DATA,
INFORMATION, AND ANALYTICS
Organizations tend to grow and prosper as they gain a better
understanding of their environment
Evaluate through tracking daily transactions and analyzing
company data
Organizations are always looking for a competitive advantage
Product development, market positioning, sales promotions,
and customer service
Companies and software vendors addressed these multilevel
decision support needs by creating autonomous applications
for particular groups of users
This more comprehensive and integrated decision support
framework within organizations became known as business
intelligence
8
THE BUSINESS DEMAND FOR DATA,
INFORMATION, AND ANALYTICS
While enterprises still need leaders and decision-makers
with intuition, they depend on data to validate their
intuitions
Data becomes a strategic guide that helps executives see
patterns they might not otherwise notice.
→ TOO MUCH DATA, TOO LITTLE INFORMATION
→ the importance of analytics
9
THE BUSINESS DEMAND FOR DATA,
INFORMATION, AND ANALYTICS
BI Analytics again ranks number one in Gartner CIO
Survey
https://www.concorn.com/bi-analytics-gartner-cio-survey/
data and analytics is the most important technology
initiative for 2014, with 72% of CIOs surveyed stating that it
is a critical or high priority
now many more people in an organization need the
information that comes from all this data.
10
Problem: Heterogeneous Information Sources
“Heterogeneities are
everywhere” Personal
Databases
World
Scientific Databases
Wide
Web
Digital Libraries
Different interfaces
Different data representations
Duplicate and inconsistent information
Slide credit: J. Hamme
IS 257 – Fall 2005
Problem: Data Management in Large Enterprises
Integration System
World
Wide
Personal
Web
Digital Libraries Scientific Databases Databases
...
...
Source Source Source
Slide credit: J. Hamme
IS 257 – Fall 2005
Disadvantages of Query-Driven Approach
...
...
Source Source Source
Slide credit: J. Hamme
IS 257 – Fall 2005
BI solution
• Multidimensional aggregation
and allocation
• Realtime reporting with
analytical alert
• Key performance indicators
optimization
• ….
DS S → E IS → B I
20
A Framework for Business Intelligence
Source: Eckerson, W. Smart Companies in the 21st Century: The Secrets of Creating Successful Business
Intelligent Solutions. The Data Warehousing Institute, Seattle, WA, 2003, p. 32
21
A Framework for Business Intelligence
22
A Framework for Business Intelligence
A Framework for Business Intelligence
A Framework for Business Intelligence
f06-03
Operational data store
Inmon’s CIF architecture
Kimball’s enterprise data bus
architecture
Source Systems
Many possible sources – (ERP, CRM, legacy system,
unstructured data, etc.)
Many platforms – IBM, Oracle, Microsoft, Sybase, SAS
Many formats – Relational, Hierarchical, Columnar, Multi-
dimensional, Big data MapReduce Databases, Unstructured
text data
29
BI Services components
Integration Services (ETL, Operational Data Feeds,
Enterprise Application Integration, Enterprise Information
Integration)
Data Management Services (data warehouse, data marts,
federated data marts, OLAP cubes, etc.)
Reporting and Analytical Services (Analytical Reporting,
ad-hoc query and batch reporting, dashboards/scorecards,
predictive and prescriptive modeling, data & text
mining/forecasting)
Information Delivery and Consumption Services (Web
portals, subscription, direct user access, internal portals
30
Types of BI users
IT developers
Analysts
Information workers
Managers and executives
Front line workers
Suppliers, customers, and regulators
Source Watson, H. J., "Tutorial: Business Intelligence –Past, Present, and Future," Communications of the Association for Information Systems: Vol. 25, Article 39, 2009.
31
BI Business Value
According to Williams (2004), BI can add value to:
Management Processes:
Planning budgeting, performance monitoring/assessment, process
improvement, cost analysis, optimization, etc.
Revenue Generating Processes:
Customer segmentation, campaign management, channel
management, sales management, etc.
Resource Consumption Processes:
Product/service development, order management,
manufacturing/operations, supply chain, purchasing, etc.
32
Business intelligence
Business intelligence (BI) is a broad category of
applications, technologies, and processes for gathering,
storing, accessing, and analyzing data to help business
users make better decisions.
Watson, Hugh J. (2009) " T utorial: Business Intelligence – Past, Present, and Future,"
Communications of the Association for Information Systems: V ol. 25, Article 39.
33
Business Intelligence
34
Business Intelligence
36
Operational Data versus Decision
Support Data
37
Operational Data versus Decision
Support Data
(Ralph Kimball)
Building the Data Warehouse, Fourth Edition( John Wiley, 2005)
(W.H. Inmon)
The Data Warehouse ETL Toolkit(John Wiley,2004)
40
Data warehouse - Schema
Dimensional approach:
Star schema
Snowflake schema
Normalized approach:
3NF model
42
Star Schemas
Data-modeling technique
Maps multidimensional decision support data into a
relational database
Creates the near equivalent of multidimensional
database schema from existing relational database
Yields an easily implemented model for
multidimensional data analysis
43
Star Schemas
45
BI - Online Analytical Processing (OLAP)
46
Example: Olap Usage of an Automobile Marketer
The Story
An automobile marketer wants to improve business
activity. Therefore he wants to view sales figures
from different perspectives.
➢Sales by model
The Data Needs
➢Sales by dealership
➢Sales by color
➢Sales over time
➢etc.
A Question
What is the trend in sales volumes over a period of
time for a specific model and color across a
specific group of dealerships ?
Sales
Volumes
M Van
O
D
Coupe
E
L Smith
Sedan Clyde
Miller
Blue Red White DEALERSHIP
COLOR
Adopted from Teradata University Network presentation
on OLAP.
OLAP Features: “Slicing and Dicing“ the Data
M Van
O Coupe Clyde
D
Coupe
E
Blue White
L Smith
Sedan Clyde “Sliced and
Diced“ Data
Miller
Blue Red White DEALERSHIP
COLOR
COLOR DEALERSHIP
Adopted from Teradata University Network presentation
on OLAP.
OLAP Features: Drill-Down and Roll-Up
Data can be disaggregated and aggregated along a dimension
according to their natural hierarchy
Georgia
State
Drill-Down
Adopted from Teradata University Network presentation
on OLAP.
OLAP SERVER
52
Relational OLAP
53
Multidimensional OLAP
54
BI - mining
1. Define what we want to
achieve:
Is there a correlation
between the sales of music,
film, and audio book product
types and the customer
interest or occupation?”
2. Prepare the data
3. Build the mining models
4. Deploy and maintain the
models in production
55
56
BI - mining
Mục tiêu: nhanh chóng tìm ra luật kết hợp trên 1 dữ liệu khá lớn
(vd: thông tin bán hàng của wall mart qua nhiều năm)
57
Bài toán minh hoạ
Cây phân lớp ( cây quyết định)
Buyers(<attributes>, purchase)
Cần dự đoán việc mua hàng dựa vào <attributes>
Clustering
Buyers(<attributes>)
Tự động nhóm các Buyers vào N nhóm tương tự
Top-N items
Purchase(salesID, item)
N item được mua nhiều nhất? (theo salesID)
58
BI - dashboard
Dashboards are a category of business intelligence
applications that give a quick high-level summaryof
business performance in graphical gadgets, typically
gauges, charts, indicators, and color-coded maps
https://public.datapine.com/?_ga=2.10453493.1940943450.1585127327-933313889.1565257605#board/DnjvEBVsJRVZteO3gGbSWA/null
61
Actual vs Forecast Financial Dashboard
https://public.datapine.com/#board/COcn8yeYXCcmXfzllGV5Ac
62
BI reporting
Print-perfect Operational
Reports
• Via Web and Print
• Easy Navigation Through
Hundreds of Report Pages
• Parameter Prompting Lets
Users Specify Report Content
• Pixel-perfect Business Reports
• …..
(Business Intelligence Concepts, Tools, and Applications )
63
Alerting and Proactive Notification
Delivers Information via E-Mail, Print, or File
64
Preferences
Business intelligent guide book - Rick Sherman
https://www.logility.com/blog/descriptive-predictive-
and-prescriptive-analytics-explained/
Database Systems Design, Implementation, Management
(13th Edition)
lecture notes of the course: Business Intelligence Concepts,
Tools, and Applications - University of Colorado Denver
66
67