Snowflake Schema: The Snowflake Schema Is An Extension of Star Schema. in A Snowflake
Snowflake Schema: The Snowflake Schema Is An Extension of Star Schema. in A Snowflake
Snowflake Schema: The Snowflake Schema Is An Extension of Star Schema. in A Snowflake
Ans: Star Schema: The star schema architecture is easy to design. It is called a star schema because
diagram resembles a star,with points radiating from a center. The center of the star consists of the
fact table, and the points of the star is dimension tables.
The fact tables in a star schema which is third normal form whereas dimensional tables are de-
normalized.
In the following Star Schema example, the fact table is at the center which contains keys to
every dimension table like Dealer_ID, Model ID, Date_ID, Product_ID, Branch_ID & other
attributes like Units sold and revenue.
Ans: ROLAP: ROLAP stands for Relational Online Analytical Processing. ROLAP stores
data in columns and rows (also known as relational tables) and retrieves the information on
demand through user submitted queries. A ROLAP database can be accessed through
complex SQL queries to calculate information. ROLAP can handle large data volumes, but
the larger the data, the slower the processing times.
Because queries are made on-demand, ROLAP does not require the storage and pre-
computation of information. However, the disadvantage of ROLAP implementations are the
potential performance constraints and scalability limitations that result from large and
inefficient join operations between large tables. Examples of popular ROLAP products
include Metacube by Stanford Technology Group, Red Brick Warehouse by Red Brick
Systems, and AXSYS Suite by Information Advantage.
MOLAP: MOLAP stands for Multidimensional Online Analytical Processing. MOLAP uses
a multidimensional cube that accesses stored data through various combinations. Data is pre-
computed, pre-summarized, and stored (a difference from ROLAP, where queries are served
on-demand).
A multicube approach has proved successful in MOLAP products. In this approach, a series
of dense, small, precalculated cubes make up a hypercube. Tools that incorporate MOLAP
include Oracle Essbase, IBM Cognos, and Apache Kylin.
Its simple interface makes MOLAP easy to use, even for inexperienced users. Its speedy data
retrieval makes it the best for “slicing and dicing” operations. One major disadvantage of
MOLAP is that it is less scalable than ROLAP, as it can handle a limited amount of data.
HOLAP: HOLAP stands for Hybrid Online Analytical Processing. As the name suggests,
the HOLAP storage mode connects attributes of both MOLAP and ROLAP. Since HOLAP
involves storing part of your data in a ROLAP store and another part in a MOLAP store,
developers get the benefits of both.
With this use of the two OLAPs, the data is stored in both multidimensional databases and
relational databases. The decision to access one of the databases depends on which is most
appropriate for the requested processing application or type. This setup allows much more
flexibility for handling data. For theoretical processing, the data is stored in a
multidimensional database. For heavy processing, the data is stored in a relational database.
Microsoft Analysis Services and SAP AG BI Accelerator are products that run off HOLAP.
Q3) What is Knowledge discovery in database? How it is related to data mining? (2+3=5)
Q4) a)Given the Following measurement for the variable age: 18,22,25,42,28,43,33,35,56,28
Standardize the variable by the following
1. Computer the mean absolute deviation of age
2. Computer the Z-score for the first four measurements. (6)
b) Explain K- Mediods. (2)