0% found this document useful (0 votes)

6 views

Unit 4

Dimensional modeling is a data representation technique used in OLAP that organizes data into fact and dimension tables, allowing for efficient querying and analysis. It has advantages such as ease of understanding for end-users and efficient query performance, but also faces challenges like data integrity maintenance and adaptability to business changes. The document also discusses various schemas including star and snowflake schemas, which structure data differently to optimize performance and reduce redundancy.

Uploaded by

priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Unit 4

Uploaded by

priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 41

What is Dimensional Modeling?

• Dimensional modeling represents data with a cube operation, making more

suitable logical data representation with OLAP data management. The
perception of Dimensional Modeling was developed by Ralph Kimball and
is consist of "fact" and "dimension" tables.
• In dimensional modeling, the transaction record is divided into
either "facts," which are frequently numerical transaction data,
or "dimensions," which are the reference information that gives context to
the facts. For example, a sale transaction can be damage into facts such as
the number of products ordered and the price paid for the products, and
into dimensions such as order date, user name, product number, order
ship-to, and bill-to locations, and salesman responsible for receiving the
order.
Objectives of Dimensional Modeling

The purposes of dimensional modeling are:

• To produce database architecture that is easy for end-clients to
understand and write queries.
• To maximize the efficiency of queries. It achieves these goals by
minimizing the number of tables and relationships between them.
Advantages of Dimensional
Modeling
Disadvantages of Dimensional
Modeling
• To maintain the integrity of fact and dimensions, loading the data
warehouses with a record from various operational systems is
complicated.
• It is severe to modify the data warehouse operation if the
organization adopting the dimensional technique changes the method
in which it does business.
Elements of Dimensional Modeling

Fact
• It is a collection of associated data items, consisting of measures and
context data. It typically represents business items or business transactions.
Dimensions
• It is a collection of data which describe one business dimension.
Dimensions decide the contextual background for the facts, and they are
the framework over which OLAP is performed.
Measure
• It is a numeric attribute of a fact, representing the performance or behavior
of the business relative to the dimensions.
Fact Table

• Fact tables are used to data facts or measures in the business. Facts a
Characteristics of the Fact table
• The fact table includes numerical values of what we measure. For
example, a fact value of 20 might means that 20 widgets have been
sold.
• Each fact table includes the keys to associated dimension tables. These
are known as foreign keys in the fact table.
• Fact tables typically include a small number of columns.
• When it is compared to dimension tables, fact tables have a large
number of rows.
Dimension Table

• Dimension tables establish the context of the facts. Dimensional tables store fields that
describe the facts.
Characteristics of the Dimension table
• Dimension tables contain the details about the facts. That, as an example, enables the
business analysts to understand the data and their reports better.
• The dimension tables include descriptive data about the numerical values in the fact table.
That is, they contain the attributes of the facts. For example, the dimension tables for a
marketing analysis function might include attributes such as time, marketing region, and
product type.
• Since the record in a dimension table is denormalized, it usually has a large number of
columns. The dimension tables include significantly fewer rows of information than the fact
table.
• The attributes in a dimension table are used as row and column headings in a document or
query results display.
• Example: A city and state can view a store summary in a fact table.
Item summary can be viewed by brand, color, etc. Customer
information can
• be viewed by name and address.
Hierarchy
• A hierarchy is a directed tree whose nodes are dimensional attributes
and whose arcs model many to one association between dimensional
attributes team. It contains a dimension, positioned at the tree's root,
and all of the dimensional attributes that define it.
What is Multi-Dimensional Data
Model?
• A multidimensional model views data in the form of a data-cube. A data cube enables data
to be modeled and viewed in multiple dimensions. It is defined by dimensions and facts.
• The dimensions are the perspectives or entities concerning which an organization keeps
records. For example, a shop may create a sales data warehouse to keep records of the
store's sales for the dimension time, item, and location. These dimensions allow the save to
keep track of things, for example, monthly sales of items and the locations at which the
items were sold. Each dimension has a table related to it, called a dimensional table, which
describes the dimension further. For example, a dimensional table for an item may contain
the attributes item_name, brand, and type.
• A multidimensional data model is organized around a central theme, for example, sales. This
theme is represented by a fact table. Facts are numerical measures. The fact table contains
the names of the facts or measures of the related dimensional tables.
• Consider the data of a shop for items sold per quarter in the city of
Delhi. The data is shown in the table. In this 2D representation, the
sales for Delhi are shown for the time dimension (organized in
quarters) and the item dimension (classified according to the types of
an item sold). The fact or measure displayed in rupee_sold (in
thousands).
What is Data Cube?

• When data is grouped or combined in multidimensional matrices called Data

Cubes. The data cube method has a few alternative names or a few variants,
such as "Multidimensional databases," "materialized views," and "OLAP (On-
Line Analytical Processing)."
• The general idea of this approach is to materialize certain expensive
computations that are frequently inquired.
• For example, a relation with the schema sales (part, supplier, customer, and
sale-price) can be materialized into a set of eight views as shown in fig,
where psc indicates a view consisting of aggregate function value (such as
total-sales) computed by grouping three attributes part, supplier, and
customer, p indicates a view composed of the corresponding aggregate
function values calculated by grouping part alone, etc.
• A data cube is created from a subset of attributes in the database. Specific attributes
are chosen to be measure attributes, i.e., the attributes whose values are of interest.
Another attributes are selected as dimensions or functional attributes. The measure
attributes are aggregated according to the dimensions.
For example, XYZ may create a sales data warehouse to keep records of the store's
sales for the dimensions time, item, branch, and location. These dimensions enable
the store to keep track of things like monthly sales of items, and the branches and
locations at which the items were sold. Each dimension may have a table identify
with it, known as a dimensional table, which describes the dimensions. For example,
a dimension table for items may contain the attributes item_name, brand, and type.
• Data cube method is an interesting technique with many applications. Data cubes
could be sparse in many cases because not every cell in each dimension may have
corresponding data in the database.
• Techniques should be developed to handle sparse cubes efficiently.
• If a query contains constants at even lower levels than those provided in a
data cube, it is not clear how to make the best use of the precomputed
results stored in the data cube.
• The model view data in the form of a data cube. OLAP tools are based on the
multidimensional data model. Data cubes usually model n-dimensional data.
• A data cube enables data to be modeled and viewed in multiple dimensions.
A multidimensional data model is organized around a central theme, like
sales and transactions. A fact table represents this theme. Facts are
numerical measures. Thus, the fact table contains measure (such as Rs_sold)
and keys to each of the related dimensional tables.
• Dimensions are a fact that defines a data cube. Facts are generally
quantities, which are used for analyzing the relationship between
dimensions.
3-Dimensional Cuboids

• Let suppose we would like to view the sales data with a third
dimension. For example, suppose we would like to view the data
according to time, item as well as the location for the cities Chicago,
New York, Toronto, and Vancouver. The measured display in dollars
sold (in thousands). These 3-D data are shown in the table. The 3-D
data of the table are represented as a series of 2-D tables.
• In data warehousing, the data cubes are n-dimensional. The cuboid
which holds the lowest level of summarization is called a base cuboid.
• For example, the 4-D cuboid in the figure is the base cuboid for the
given time, item, location, and supplier dimensions.
• A 4-D data cube representation of sales data, according to the
dimensions time, item, location, and supplier. The measure displayed
is dollars sold (in thousands).
• The topmost 0-D cuboid, which holds the highest level of
summarization, is known as the apex cuboid. In this example, this is
the total sales, or dollars sold, summarized over all four dimensions.
• The lattice of cuboid forms a data cube. The figure shows the lattice of
cuboids creating 4-D data cubes for the dimension time, item,
location, and supplier. Each cuboid represents a different degree of
summarization.
What is Star Schema?

• A star schema is the elementary form of a dimensional model, in which

data are organized into facts and dimensions. A fact is an event that is
counted or measured, such as a sale or log in. A dimension includes
reference data about the fact, such as date, item, or customer.
• A star schema is a relational schema where a relational schema whose
design represents a multidimensional data model. The star schema is the
explicit data warehouse schema. It is known as star schema because the
entity-relationship diagram of this schemas simulates a star, with points,
diverge from a central table. The center of the schema consists of a large
fact table, and the points of the star are the dimension tables.
Fact Tables

• A table in a star schema which contains facts and connected to

dimensions. A fact table has two types of columns: those that include
fact and those that are foreign keys to the dimension table. The
primary key of the fact tables is generally a composite key that is
made up of all of its foreign keys.
• A fact table might involve either detail level fact or fact that have
been aggregated (fact tables that include aggregated fact are often
instead called summary tables). A fact table generally contains facts
with the same level of aggregation.
What is Snowflake Schema?

• A snowflake schema is equivalent to the star schema. "A schema is

known as a snowflake if one or more dimension tables do not connect
directly to the fact table but must join through other dimension tables."
• The snowflake schema is an expansion of the star schema where each
point of the star explodes into more points. It is called snowflake schema
because the diagram of snowflake schema resembles a
snowflake. Snowflaking is a method of normalizing the dimension
tables in a STAR schemas. When we normalize all the dimension tables
entirely, the resultant structure resembles a snowflake with the fact
table in the middle.
• Snowflaking is used to develop the performance of specific queries. The
schema is diagramed with each fact surrounded by its associated
dimensions, and those dimensions are related to other dimensions,
branching out into a snowflake pattern.
• In snowflake, schema tables are normalized to delete redundancy. In
snowflake dimension tables are damaged into multiple dimension
tables.
• Figure shows a simple STAR schema for sales in a manufacturing
company. The sales fact table include quantity, price, and other
relevant metrics. SALESREP, CUSTOMER, PRODUCT, and TIME are the
dimension tables.
Difference between Star and Snowflake Schemas

Catapult User Guide
No ratings yet
Catapult User Guide
452 pages
What Is The Difference Between OLTP and OLAP?
No ratings yet
What Is The Difference Between OLTP and OLAP?
33 pages
Chapter Eight
No ratings yet
Chapter Eight
33 pages
Basics of Dimensional Modeling
100% (1)
Basics of Dimensional Modeling
14 pages
Dimensional Modeling PDF
No ratings yet
Dimensional Modeling PDF
14 pages
DW Unit IV Notes
No ratings yet
DW Unit IV Notes
36 pages
DWDM Unit 2
No ratings yet
DWDM Unit 2
104 pages
DWM 2
No ratings yet
DWM 2
21 pages
DWDM Unit 2 PDF
No ratings yet
DWDM Unit 2 PDF
16 pages
Unit - 4
No ratings yet
Unit - 4
36 pages
Unit 2
No ratings yet
Unit 2
33 pages
Data Warehouse Design
No ratings yet
Data Warehouse Design
29 pages
Data Cubes
No ratings yet
Data Cubes
17 pages
Dimensions DW
No ratings yet
Dimensions DW
6 pages
DM104 - Evaluation of Business Performance
No ratings yet
DM104 - Evaluation of Business Performance
15 pages
ch3
No ratings yet
ch3
60 pages
Dimensional Modelling: CS2.1.1 CS2.1.2
No ratings yet
Dimensional Modelling: CS2.1.1 CS2.1.2
22 pages
BI - Lecture 3 - Kimball Concepts
No ratings yet
BI - Lecture 3 - Kimball Concepts
44 pages
Dimensional Modeling
No ratings yet
Dimensional Modeling
59 pages
Dimensional Modeling
No ratings yet
Dimensional Modeling
59 pages
Data Modeling
No ratings yet
Data Modeling
24 pages
Ais Prof 1 Chapter 5
No ratings yet
Ais Prof 1 Chapter 5
39 pages
Dimensional Modeling (DM)
No ratings yet
Dimensional Modeling (DM)
9 pages
A Multidimensional Model Views Data in The Form of A Data
No ratings yet
A Multidimensional Model Views Data in The Form of A Data
3 pages
Lecture 1 Notes: Dimension Tables
No ratings yet
Lecture 1 Notes: Dimension Tables
2 pages
introduction to DataWarehouse and DataMining
No ratings yet
introduction to DataWarehouse and DataMining
35 pages
Data Mning
No ratings yet
Data Mning
10 pages
Dimensional Modeling: Prof. Sunita Sahu
No ratings yet
Dimensional Modeling: Prof. Sunita Sahu
50 pages
C 01 Dimensional Modeling
No ratings yet
C 01 Dimensional Modeling
30 pages
Data Warehouse Ques
No ratings yet
Data Warehouse Ques
10 pages
What Is Dimensional Model
No ratings yet
What Is Dimensional Model
7 pages
DWH Interview Questions
No ratings yet
DWH Interview Questions
1 page
Chapter Four - Data Warehouse Design: SATA Technology and Business Collage
No ratings yet
Chapter Four - Data Warehouse Design: SATA Technology and Business Collage
10 pages
Presentation of Business Intelligence
No ratings yet
Presentation of Business Intelligence
29 pages
Unit 4
No ratings yet
Unit 4
11 pages
DWH Int Questions
100% (1)
DWH Int Questions
9 pages
SQL01 - Introduction To Business Intelligence
No ratings yet
SQL01 - Introduction To Business Intelligence
75 pages
Week5
No ratings yet
Week5
19 pages
Lec2 Dimensional Model
No ratings yet
Lec2 Dimensional Model
30 pages
3 - Business Analysis in Data Mining - L6 - 7 - 8 - 9 - 10
No ratings yet
3 - Business Analysis in Data Mining - L6 - 7 - 8 - 9 - 10
39 pages
Lecture 4
No ratings yet
Lecture 4
2 pages
Fact Tables
No ratings yet
Fact Tables
3 pages
Multi Dimensional Data Model[1]
No ratings yet
Multi Dimensional Data Model[1]
21 pages
3 Business Analysis in Data Mining L6 7 8-9-10
No ratings yet
3 Business Analysis in Data Mining L6 7 8-9-10
39 pages
Data Warehouse Concepts
No ratings yet
Data Warehouse Concepts
11 pages
Unit 3
No ratings yet
Unit 3
18 pages
5.data Warehouse
No ratings yet
5.data Warehouse
19 pages
What Is Data Warehouse?: Explanatory Note
No ratings yet
What Is Data Warehouse?: Explanatory Note
11 pages
Oh 3
No ratings yet
Oh 3
30 pages
DW Lec7
No ratings yet
DW Lec7
15 pages
Data Cubemod2
100% (1)
Data Cubemod2
21 pages
Data Cube
No ratings yet
Data Cube
55 pages
FDWDM Reviewer-Midterm
No ratings yet
FDWDM Reviewer-Midterm
3 pages
Understanding Multi Dimensional Database: Prepared By: Amit Sharma Hyperion/OBIEE Trainer
No ratings yet
Understanding Multi Dimensional Database: Prepared By: Amit Sharma Hyperion/OBIEE Trainer
34 pages
BI- Chap 3 - Data Warehouses Design
No ratings yet
BI- Chap 3 - Data Warehouses Design
54 pages
Ravi Data Warehousing Concepts Document 1665375367
No ratings yet
Ravi Data Warehousing Concepts Document 1665375367
49 pages
Advantages of Multidimensional Data Model
No ratings yet
Advantages of Multidimensional Data Model
6 pages
Bi Lecture4 - 2023
No ratings yet
Bi Lecture4 - 2023
49 pages
SSAS SunilKadimdiwan RI BI
No ratings yet
SSAS SunilKadimdiwan RI BI
36 pages
Data Warehousing Concepts
No ratings yet
Data Warehousing Concepts
14 pages
Data Visualization: Six Sigma Thinking, #2
From Everand
Data Visualization: Six Sigma Thinking, #2
Sumeet Savant
No ratings yet
What Are Design Tokens A Design Systems Tool
No ratings yet
What Are Design Tokens A Design Systems Tool
10 pages
SAP PM T-Codes
No ratings yet
SAP PM T-Codes
14 pages
Mohammed Jaber Resume
No ratings yet
Mohammed Jaber Resume
3 pages
Advanced Programming Language Concepts: Functional Programming (FP) Definition and Origin
No ratings yet
Advanced Programming Language Concepts: Functional Programming (FP) Definition and Origin
18 pages
Big Data
No ratings yet
Big Data
957 pages
EMC Invista: Making Virtual Storage A Reality For Your Enterprise
No ratings yet
EMC Invista: Making Virtual Storage A Reality For Your Enterprise
4 pages
OIC Training - Day 1 of 5
No ratings yet
OIC Training - Day 1 of 5
10 pages
SOA - Calydon IT Academy
No ratings yet
SOA - Calydon IT Academy
4 pages
CS106X Course Syllabus
No ratings yet
CS106X Course Syllabus
3 pages
Web Technology Submission
No ratings yet
Web Technology Submission
65 pages
Railway Database PDF
No ratings yet
Railway Database PDF
21 pages
ADVAIT_Dashboards
No ratings yet
ADVAIT_Dashboards
14 pages
Introduction To Database System
No ratings yet
Introduction To Database System
12 pages
Creditcheck
No ratings yet
Creditcheck
2 pages
Oracle Questions & Answers: Exam Information
No ratings yet
Oracle Questions & Answers: Exam Information
34 pages
4023881
No ratings yet
4023881
3 pages
Inventory System For Artesa PH
No ratings yet
Inventory System For Artesa PH
33 pages
Extensible Storage Engine (ESE) Database File (EDB) Format
No ratings yet
Extensible Storage Engine (ESE) Database File (EDB) Format
53 pages
259 Acceptance Testing Specialist Sample Exam Answers
No ratings yet
259 Acceptance Testing Specialist Sample Exam Answers
25 pages
PL-300 StudyGuide ENU FY23Q2 Vnext
50% (2)
PL-300 StudyGuide ENU FY23Q2 Vnext
10 pages
SQL Questions
No ratings yet
SQL Questions
13 pages
Chapter-2 (File Management)
100% (1)
Chapter-2 (File Management)
5 pages
How To Backup All Email On Lotus Notes
No ratings yet
How To Backup All Email On Lotus Notes
23 pages
An Old and Forgotten Problem
No ratings yet
An Old and Forgotten Problem
24 pages
HCIA-Cloud Service V3.0 Lab Guide
No ratings yet
HCIA-Cloud Service V3.0 Lab Guide
236 pages
Castor Reference Guide 1.3.1
No ratings yet
Castor Reference Guide 1.3.1
143 pages
Java Card & STK Applet Development Guidelines
No ratings yet
Java Card & STK Applet Development Guidelines
53 pages
Python-SideNotes - Module 1
No ratings yet
Python-SideNotes - Module 1
6 pages
Security Audit of BAHMNI EMR & HOSPITAL SERVICE Web Application Level-1 Report
No ratings yet
Security Audit of BAHMNI EMR & HOSPITAL SERVICE Web Application Level-1 Report
43 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Unit 4

Uploaded by

Unit 4

Uploaded by

What is Dimensional Modeling?

• Dimensional modeling represents data with a cube operation, making more

The purposes of dimensional modeling are:

• When data is grouped or combined in multidimensional matrices called Data

• A star schema is the elementary form of a dimensional model, in which

• A table in a star schema which contains facts and connected to

• A snowflake schema is equivalent to the star schema. "A schema is

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.