0% found this document useful (0 votes)

17 views

Capstone Project

The document discusses the use of Big Data analytics and recommendation systems in retail to optimize product sales and enhance decision-making. It outlines the architecture of Data Warehouses, differentiating between operational and informational data, and describes the multi-dimensional data model used for sales analysis. Additionally, it covers various OLAP technologies, including ROLAP, MOLAP, and HOLAP, and their applications in different business scenarios.

Uploaded by

Sayandeep Mondal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Capstone Project

Uploaded by

Sayandeep Mondal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

Atanu Manna Market Data Analytics and

Rupsa Jana recommended system for

Manirul Mallick retail vertical using Big Data
Sayandeep Mandal Analysis

Product Sales Analytics,

Swagata Nanda Sales Prediction and
Shadan Alam Optimization for e-commerce
Nandana Mukherjee business environment using
MD Rahit Azim Data Warehouse Architecture.
There are so many choices that people often feel trapped,
whether they're trying to choose a movie to watch, the
right product to buy, or new music to listen to. To solve
this problem, recommendation systems comes into play
that help people find their way through all of these
choices by giving them unique ideas based on their likes
and dislikes.

Predicting product sales involves analyzing historical sales data,

considering market trends, competitor activity, and potential
changes in consumer behavior to estimate future sales
volume; this can be done using various forecasting methods like
moving averages, linear regression, or opportunity stage
forecasting, depending on the product and available data.
Data Warehouse

The Data Warehouse is an

architecture,
not a technology.
Data Warehouse Architecture
A Data Warehouse Architecture (DWA) is a way of representing the
overall structure of data, communication, processing and presentation
that exists for end-user computing within the enterprise. The
architecture is made up of a number of interconnected parts:

➢ Operational Database / External Database Layer

➢ Information Access Layer
➢ Data Access Layer
➢ Data Directory (Metadata) Layer
➢ Process Management Layer
➢ Data Warehouse Layer

4
Data warehousing concepts
Operational / informational data:
❑ Operational data is the data you use to run your business. This data is what is
typically stored, retrieved, and updated by your Online Transactional
Processing (OLTP) system. An OLTP system may be, for example, a
reservations system, an accounting application, or an order entry application.
❑ Informational data is created from the wealth of operational data that exists in
your business and some external data useful to analyze your business.
Informational data is what makes up a data warehouse. Informational data is
typically:
➢ Summarized operational data
➢ De-normalized and replicated data
➢ Infrequently updated from the operational systems
➢ Optimized for decision support applications
➢ Possibly "read only" (no updates allowed)
➢ Stored on separate systems to lessen impact on operational systems

5
Operational vs. Informational
Data
Operational Informational
Primarily primitive, Primarily derived,
highly detailed somewhat summarized
Current; guaranteed Historical; accuracy
accurate now maintained over time
Constantly updated Infrequently updated
Minimal redundancy Managed redundancy
Static structure, Dynamic structure,
dynamic content static content
Referential integrity Historical integrity
Supports day-to-day Supports long-term
business functions informational requirements
The Complete DWH System

Information Sources Data Warehouse OLAP Servers Clients

Server (Tier 2) (Tier 3)
(Tier 1)
e.g., MOLAP
OLAP
Semistructured Data
Sources Warehouse serve

extract Query/Reporting
transform serve
load
refresh e.g., ROLAP
Operational etc.
Data Mining
DB’s serve

Data Marts
Three-Tier DWH System
• Warehouse database server
– Almost always a relational DBMS, Analytical DBMS, rarely
flat files
• OLAP servers
– Relational OLAP (ROLAP): extended relational DBMS that
maps operations on multidimensional data to standard
relational operators
– Multidimensional OLAP (MOLAP): special-purpose server
that directly implements multidimensional data and operations
• Clients
– Query and reporting tools
– Analysis tools
– Data mining tools
Sample Unified BI Architecture: Investment Banking Domain
Data Feeds Factory FTP Data Validation Components Presentation
Server Layer
Heterogeneous
A DWH Database Server
Common
Operational Interface Star-Schema MIS/OP

Normalized Schema
(Atomic Level Data)
(Source Image)
Source Layer B Reports

Staging Area
Systems Model 1 Model 4
2
Flat Files 1
(Risk) Analytics
Model 2
Oracle 3 Model 5

Security Layer
Flat Files
Model 3 (Profitability)
FTP

SQL Aggregated Data

Flat Files
Server
1 2 3 Web
Loading & Transformation
Extraction

Portal
Flat Files Operational MIS &Adhoc OLAP Server
(Optional)
ETL Report Builder
ETL Server

Report build &

Banking Feeds 1 2 3
Publish Layer

(Bank Specific Tool

Implementation)
Cube / WEB

Reference
Customized Report Publish SERVER
ETL Server
Master Data
Application
These are various paths in the report generation process. If any of these paths are not needed for a Bank, then the
1 2 3
components on that path are not needed in the setup.
Data validation: Option A-Sample data from Staging area, Option B-Sample data from report for validation.
A B
9 These will be compared to check the validity of data and result is stored in a log file.
Sample: Data Marts Blocks
Flat File Systems - 2 Dimensional
Character Positions

Records
Relational Data - 2 Dimensional

Columns Row 1 Row 2 Row 3 Row 4 Row 5

Customer ID AZ12345 AZ12345 AZ12345 AZ12345 AZ12345
Cust Record Change Date 01/15/94 05/01/95 06/01/95 03/15/96 04/01/96
Customer Record End Date 04/30/95 05/31/95 03/14/96 03/31/96 null
Customer Status Active Suspend Active Active Active
Customer Status Date 01/15/94 05/01/95 06/01/95 06/01/95 06/01/95
Customer Address State UT UT UT UT UT
Customer Zip Code 84094 84094 84094 84094 84094
Customer Type Corp Corp Corp Corp Corp
Discount Plan A A A A B
Account Manager Jones Jones Jones Smith Smith
The Multi-Dimensional Data Model
“Sales by product line over the past six months”
“Sales and Quantity by store between 1990 and 1995”

Store Info Key columns joining fact table

to dimension tables Numerical Measures

Prod Id Time Id Store Id Sales Qty

Fact table for
Product Info measures

Dimension tables Time Info

...
MOLAP: Dimensional Modeling Using the Multi Dimensional
Model

• MDDB: a special-purpose data model

• Facts stored in multi-dimensional arrays
• Dimensions used to index array
• Sometimes on top of relational DB
The MOLAP Cube
Fact table view: Multi-dimensional cube:
sale prodId storeId amt
p1 s1 12 s1 s2 s3
p2 s1 11 p1 12 50
p1 s3 50 p2 11 8
p2 s2 8

dimensions = 2
3-D Cube
Fact table view: Multi-dimensional cube:

sale prodId storeId date amt

p1 s1 1 12
p2 s1 1 11 s1 s2 s3
day 2
p1 s3 1 50 p1 44 4
p2 s2 1 8 p2 s1 s2 s3
p1 s1 2 44 day 1
p1 12 50
p1 s2 2 4 p2 11 8

dimensions = 3
Multi-Dimensional Cube Visualization
roll-up to region
New Town Dimensions:
Salt Lake
Time, Product, Store
roll-up to brandAttributes:
JU
10
Product (upc, price, …)
Juice
Store …
Product

Milk 34
56 …
Coke
Cream 32 Hierarchies:
Soap 12 Product → Brand → …
Bread 56 roll-up to week
Day → Week → Quarter
M T W Th F S S
Store → Region → Country
Time
56 units of bread sold in JU/Kolkata on M (Monday)
Cube Aggregation: Roll-up

Example: computing sums

s1 s2 s3
day 2 ...
p1 44 4
p2 s1 s2 s3
day 1
p1 12 50
p2 11 8

s1 s2 s3
sum 67 12 50
s1 s2 s3
p1 56 4 50
p2 11 8
129
sum
rollup p1 110
p2 19
drill-down
Cube Operators for Roll-up

s1 s2 s3
day 2 ...
p1 44 4
p2 s1 s2 s3
day 1
p1 12 50
p2 11 8 sale(s1,*,*)

s1 s2 s3
sum 67 12 50
s1 s2 s3
p1 56 4 50
p2 11 8
129
sum
sale(s2,p2,*) p1 110
p2 19 sale(*,*,*)
Extended Cube
* s1 s2 s3 *
p1 56 4 50 110
p2 11 8 19
day 2 *
s1 67
s2 12
s3 *50 129
p1 44 4 48
p2
s1 s2 s3 *
day 1
p1
*
12
44 4
50 62
48 sale(*,p2,*)
p2 11 8 19
* 23 8 50 81
Aggregates
• Add up amounts for day 1
• In SQL: SELECT sum(amt) FROM SALE
WHERE date = 1

sale prodId storeId date amt

p1 s1 1 12
p2 s1 1 11 81
p1 s3 1 50
p2 s2 1 8
p1 s1 2 44
p1 s2 2 4
Aggregates
• Add up amounts by day
• In SQL: SELECT date, sum(amt) FROM SALE
GROUP BY date

sale prodId storeId date amt

p1 s1 1 12
p2 s1 1 11 ans date sum
p1 s3 1 50 1 81
p2 s2 1 8 2 48
p1 s1 2 44
p1 s2 2 4
Another Example
• Add up amounts by day, product
• In SQL: SELECT date, sum(amt) FROM SALE
GROUP BY date, prodId
sale prodId storeId date amt
p1 s1 1 12 sale prodId date amt
p2 s1 1 11 p1 1 62
p1 s3 1 50 p2 1 19
p2 s2 1 8
p1 s1 2 44 p1 2 48
p1 s2 2 4

rollup
drill-down
Aggregation Using Hierarchies

s1 s2 s3
day 2
p1 44 4
store
p2 s1 s2 s3
day 1
p1 12 50
p2 11 8
region

country

region A region B
p1 56 54
p2 11 8
(store s1 in Region A;
stores s2, s3 in Region B)
Points to be noticed about MOLAP
• Pre-calculating or pre-consolidating transactional data improves
speed.
BUT
Fully pre-consolidating incoming data, MDDs require an enormous
amount of overhead both in processing time and in storage. An input
file of 200MB can easily expand to 5GB

MDDs are great candidates for the <50GB department data marts.

• Rolling up and Drilling down through aggregate data.

• With MDDs, application design is essentially the definition of

dimensions and calculation rules, while the RDBMS requires that
the database schema be a star or snowflake.
Hybrid OLAP (HOLAP)
• HOLAP = Hybrid OLAP:

– Best of both worlds

– Storing detailed data in RDBMS

– Storing aggregated data in MDBMS

– User access via MOLAP tools

Data Flow in HOLAP
RDBMS Server MDBMS Server Client
Multi-
dimensiona
SQL- l access
Read
User Multidimension
Multi-
data Meta data al Viewer
dimensio
Derived
naldata
data
SQL-
Reach Relational
Through Viewer
SQL-
Read
When deciding which technology to go for, consider:

1) Performance:

• How fast will the system appear to the end-user?

• MDD server vendors believe this is a key point in their favor.

2) Data volume and scalability:

• While MDD servers can handle up to 50GB of storage, RDBMS

servers can handle hundreds of gigabytes and terabytes.
Examples
• ROLAP
– Telecommunication startup: call data records (CDRs)
– ECommerce Site
– Credit Card Company
– Share/Capital Market
• MOLAP
– Analysis and budgeting in a financial department
– Sales analysis
• HOLAP
– Sales department of a multi-national company
– Banks and Financial Service Providers
Dimensional Model-
Business View
Business Requirement
• Business Questions:
– “What kind of customers buy which of our Video
products, and where?
– Are there geographic buying patterns for particular
products?
– Are there certain customer demographics that effect
purchasing patterns?
– And do these demographics have an impact on store
location, or the other way around?”
• Metric:
Video Sales by Customer Type by Product Type by
Store Location
(over some period of Time)
Multidimensional Analysis

Product

Household

Telecomm
‘99

Video ‘98

‘97
Audio

Sales Channel
Retail Direct Special
Multidimensional Analysis
1996

1995

Product

Household
Video

Telecomm

Audio
Direct
Retail

Video

Audio

Sales Channel
Retail Direct Special
Multidimensional Analysis

Product

Household Time ?
Telecomm

Video

Audio

Sales Channel
Retail Direct Special
Multidimensional Analysis

Customer
Product Information
• Geographic
Location
Household • Customer Type
• Income Bracket
Telecomm
?

Video ?

?
Audio

Sales Channel
Retail Direct Special
Multidimensional Analysis
Government

Commercial

Video Video
Audio Audio

1997 1998 1999

Multidimensional Analysis
Government

Government
Commercial
Commercial
Product
Individual

Household
Video

Telecomm
Retail

Video

Audio

Sales Channel
Retail Direct Special
Multidimensional Analysis
Retail

Government

Direct
Product
Special

Household
Video

Telecomm
Commercial
Government

Video

Audio

Customer Type
Government Comm Individual
ercial
Multidimensional Analysis
Government

Commercial
Product
Individual

Household

Telecomm

Europe (EC)
Video
Asia Pacific

Audio North America

Sales Channel
Retail Direct Special
Multidimensional Analysis
Retail Direct Special
Household $300 $200 $100
Government $100 $50 $10
Commercial $100 $75 $70
Individual $100 $75 $30
Telecomm $6000 $3000 $400
Government $2000 $500 $100
Commercial $1000 $1500 $100
Individual $3000 $1000 $200
Video $4444 $2222 $777
Government $1030 $150 $222
Commercial $1311 $1175 $111
Individual $2103 $897 $444
Audio $50 $75 $25
Government $10 $25 $20
Commercial $10 $25 $0
Individual $30 $25 $5
Tools available
• ROLAP:
– ORACLE 8i
– ORACLE Reports; ORACLE Discoverer
– ORACLE Warehouse Builder
– Arbors Software’s Essbase

• MOLAP:
– ORACLE Express Server
– ORACLE Express Clients (C/S and Web)
– MicroStrategy’s DSS server
– Platinum Technologies’ Plantinum InfoBeacon

• HOLAP:
– ORACLE 8i
– ORACLE Express Serve
– ORACLE Relational Access Manager
– ORACLE Express Clients (C/S and Web)
Conclusion
• ROLAP: RDBMS -> star/snowflake schema

• MOLAP: MDD -> Cube structures

• ROLAP or MOLAP: Data models used play major role in performance differences

• MOLAP: for summarized and relatively lesser volumes of data (10-50GB)

• ROLAP: for detailed and larger volumes of data

• Both storage methods have strengths and weaknesses

• The choice is requirement specific, though currently data warehouses are

predominantly built using RDBMSs/ROLAP.
Simple Example of Parallel storage and parallel processing

• One farmer wants to harvest grapes and sell out all the fruits it
in the nearby Town.
• After harvesting he then stores the produce in a storage room.
Simple Example of Parallel storage and parallel processing

Challenge: Now the storage became the bottleneck to store and access all the
fruits in a single storage place.
Simple Example of Parallel storage and parallel processing

So the farmer now decided to distribute the storage area and give each
one of them a different storage area.
Simple Example of Parallel storage and parallel processing
High yield and processing
approach Business Decision

To complete the order on time, all of them work parallelly

with their own storage space.
Simple Example of Parallel storage and parallel processing

This solution helps them to complete the order on time without any hassles.
This is the way they
continue to grow more
and more and deliver
more and more fruit
baskets and formed a large
farm.

HOW THIS STORY IS

RELATED TO BIG
DATA??
• Now a days we have verities of data which needs to be processed.
• So one single processor and storage cannot process such a large verities of on demand data in
real time.
• These data processing are very much time consuming
Challenges of Single Storage Unit

• Single storage unit became the bottleneck due to which network overhead
was generated.
• The solution was to use distributed storage for each processor
• This enabled easy access to store and retrieve data.
• This method worked and no network overhead is generated
• No bottleneck will be there for data pull or push with high lead time
• This is known as parallel processing of distributed storage called
MapReduce
What is in with Hadoop
The Hadoop Distributed
File System (HDFS) is a
distributed file system
designed to run
on hardware based on
open standards or what is
called commodity
hardware. This means
the system is capable of
running
MapReduce isdifferent
a
1. Storage Unit of Hadoop operating
programming model or systems
- HDFS ( Hadoop Distributed file system) (OSes)
pattern such as Windows
within the
or Linux without
Hadoop
- It is specially designed for storing huge data sets in commodity framework requiring
that
2. hardware
Processing Unit of Hadoop special drivers.
is used to access big
Map Reduce is a used to retrieve big data from HDFS data stored in the
Hadoop File System
(HDFS). It is a core
component, integral to
the functioning of the
There is only one Name
Node but there could be
multiple data node.
Master/Slave node is
typically form the HDFS
Cluster

Name node maintain and

manages the data node. It
also store the metadata.

Data Nodes stores the

actual data, does reading,
writing and processing,
performs replications as
well.

Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
Applied Microsoft Analysis Services 2005
100% (1)
Applied Microsoft Analysis Services 2005
713 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Learn SAP BI in 24 Hours
From Everand
Learn SAP BI in 24 Hours
Alex Nordeen
3/5 (1)
Learn HANA in 24 Hours
From Everand
Learn HANA in 24 Hours
Alex Nordeen
5/5 (1)
Digieye 760: Ceiling Suspension Series
No ratings yet
Digieye 760: Ceiling Suspension Series
11 pages
DW - Rolap Molap Holap
No ratings yet
DW - Rolap Molap Holap
48 pages
Designing The Data Warehouse - Part 1
100% (2)
Designing The Data Warehouse - Part 1
45 pages
DW Olap
No ratings yet
DW Olap
57 pages
Introduction To Data Warehousing
No ratings yet
Introduction To Data Warehousing
24 pages
2 1 Datawarehouses
No ratings yet
2 1 Datawarehouses
56 pages
Data Warehousing: Data Models and OLAP Operations
No ratings yet
Data Warehousing: Data Models and OLAP Operations
39 pages
Introduction To Data Warehousing
No ratings yet
Introduction To Data Warehousing
24 pages
03 Data Warehousing Data Mining MIM
No ratings yet
03 Data Warehousing Data Mining MIM
48 pages
Data Warehousing: Data Models and OLAP Operations
No ratings yet
Data Warehousing: Data Models and OLAP Operations
41 pages
The Data Warehouse and Data Mining
No ratings yet
The Data Warehouse and Data Mining
51 pages
Data Warehousing: Online Analytical Processing (OLAP)
No ratings yet
Data Warehousing: Online Analytical Processing (OLAP)
44 pages
BusinessIntelligence 2023
No ratings yet
BusinessIntelligence 2023
36 pages
Lecture 13
No ratings yet
Lecture 13
17 pages
Chapter6_DataWareHousing_final
No ratings yet
Chapter6_DataWareHousing_final
46 pages
Data Warehousing: Data Models and OLAP Operations: by Kishore Jaladi
No ratings yet
Data Warehousing: Data Models and OLAP Operations: by Kishore Jaladi
41 pages
The Key in Business Is To Know Something That Nobody Else Knows.
No ratings yet
The Key in Business Is To Know Something That Nobody Else Knows.
43 pages
4th Year Dw& Dm Kai075 Unit 1
No ratings yet
4th Year Dw& Dm Kai075 Unit 1
25 pages
Data Warehousing AND Data Mining
No ratings yet
Data Warehousing AND Data Mining
37 pages
Data Warehousing and BA
No ratings yet
Data Warehousing and BA
77 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
47 pages
Unit2 Olap
No ratings yet
Unit2 Olap
13 pages
Data Warehousing AND Data Mining
No ratings yet
Data Warehousing AND Data Mining
51 pages
Data Warehouse Modeling
100% (1)
Data Warehouse Modeling
87 pages
DW Concepts
100% (1)
DW Concepts
40 pages
ACM - IntrotoDW-data Warehousing
No ratings yet
ACM - IntrotoDW-data Warehousing
58 pages
Adbms: Data Warehousing OLAP Technology
No ratings yet
Adbms: Data Warehousing OLAP Technology
57 pages
Unit 3
No ratings yet
Unit 3
53 pages
Data Mining v3
No ratings yet
Data Mining v3
54 pages
Introduction To Big Data & Basic Data Analysis
No ratings yet
Introduction To Big Data & Basic Data Analysis
51 pages
Chapter-04-Analisis Dan Drfinisi Kebutuhan Datawarehouse
No ratings yet
Chapter-04-Analisis Dan Drfinisi Kebutuhan Datawarehouse
56 pages
Adbs Unit IV
No ratings yet
Adbs Unit IV
34 pages
Unit #5 - Data Warehouse and Data Mining
No ratings yet
Unit #5 - Data Warehouse and Data Mining
49 pages
Advances in Database Querying: S. Sudarshan
No ratings yet
Advances in Database Querying: S. Sudarshan
86 pages
Data Warehousing unit 1,2
No ratings yet
Data Warehousing unit 1,2
9 pages
Data Mining and Warehosuing Lecture 01
No ratings yet
Data Mining and Warehosuing Lecture 01
36 pages
Ba Important
No ratings yet
Ba Important
13 pages
Designing The Data Warehouse / Data Mart: Methodologies and Techniques
0% (1)
Designing The Data Warehouse / Data Mart: Methodologies and Techniques
45 pages
adbms-unit5 (1)
No ratings yet
adbms-unit5 (1)
10 pages
Eserver I5 and Db2: Business Intelligence Concepts
No ratings yet
Eserver I5 and Db2: Business Intelligence Concepts
12 pages
3
No ratings yet
3
77 pages
Data Mining Unit-2 notes
No ratings yet
Data Mining Unit-2 notes
8 pages
DWDM Unit 1----
No ratings yet
DWDM Unit 1----
23 pages
OLAP and Data Warehousing: Slides Courtesy Of: Julia Stoyanovitch
No ratings yet
OLAP and Data Warehousing: Slides Courtesy Of: Julia Stoyanovitch
46 pages
Data Warehouse
No ratings yet
Data Warehouse
16 pages
Data Warehousing AND Data Mining
100% (1)
Data Warehousing AND Data Mining
90 pages
Introduction to Data Warehouse
No ratings yet
Introduction to Data Warehouse
17 pages
Data Warehousing: Data Models and OLAP Operations: Lecture-1
No ratings yet
Data Warehousing: Data Models and OLAP Operations: Lecture-1
47 pages
pptcs1661
No ratings yet
pptcs1661
38 pages
DW Notes
No ratings yet
DW Notes
72 pages
Introduction To Data Warehousing and Business Intelligence
No ratings yet
Introduction To Data Warehousing and Business Intelligence
72 pages
Lecture 3: Business Intelligence: OLAP, Data Warehouse, and Column Store
No ratings yet
Lecture 3: Business Intelligence: OLAP, Data Warehouse, and Column Store
119 pages
UNIT-1 (RIT-062) : Data Warehousing
No ratings yet
UNIT-1 (RIT-062) : Data Warehousing
34 pages
Oracle Modernization Solutions
From Everand
Oracle Modernization Solutions
Tom Laszewski
No ratings yet
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
Department of Mathematics and Statistics
No ratings yet
Department of Mathematics and Statistics
4 pages
Switchgear 61439
No ratings yet
Switchgear 61439
72 pages
Third Conditional
No ratings yet
Third Conditional
7 pages
Don't Teach. Incentivize
No ratings yet
Don't Teach. Incentivize
59 pages
Wood-Mizer Sawmill: Parts Manual LT30HD Rev. G1.00 - J7.00 LT40HD Rev. G1.00 - J8.00
No ratings yet
Wood-Mizer Sawmill: Parts Manual LT30HD Rev. G1.00 - J7.00 LT40HD Rev. G1.00 - J8.00
181 pages
5 Nano 2022 Conference Template
No ratings yet
5 Nano 2022 Conference Template
5 pages
Aiaa 2008 852
No ratings yet
Aiaa 2008 852
14 pages
Inter-AS Option AB (A.k
No ratings yet
Inter-AS Option AB (A.k
5 pages
Seat Leakage Guide
No ratings yet
Seat Leakage Guide
16 pages
Eucass2019 0473
No ratings yet
Eucass2019 0473
8 pages
Information For Authors - APS
No ratings yet
Information For Authors - APS
12 pages
R Pos
No ratings yet
R Pos
3 pages
A Machine Learning Approach For Predicting Physical Activity Intensity From Wearable Sensor Data
No ratings yet
A Machine Learning Approach For Predicting Physical Activity Intensity From Wearable Sensor Data
6 pages
HT-ST-Manual-V1
No ratings yet
HT-ST-Manual-V1
11 pages
2018 COE Engine
No ratings yet
2018 COE Engine
6 pages
Golden Gate Colleges P. Prieto ST., Batangas City: College of Nursing
No ratings yet
Golden Gate Colleges P. Prieto ST., Batangas City: College of Nursing
3 pages
The_Last_Train_Home
No ratings yet
The_Last_Train_Home
24 pages
Release Notes
No ratings yet
Release Notes
5 pages
Arabic Typesetting (OpenType) PDF
No ratings yet
Arabic Typesetting (OpenType) PDF
1 page
CDS13181 M181 Ecu
No ratings yet
CDS13181 M181 Ecu
10 pages
Experiment No.: Aim: - To Measure Three Phase Power and Power Factor in A Balanced Three Phase Circuit Using
No ratings yet
Experiment No.: Aim: - To Measure Three Phase Power and Power Factor in A Balanced Three Phase Circuit Using
5 pages
Topic 2 - Data Processes
No ratings yet
Topic 2 - Data Processes
70 pages
Muet Writing Pass Year
No ratings yet
Muet Writing Pass Year
3 pages
New GA700 - Kaepc71061700e - 4 - 0
No ratings yet
New GA700 - Kaepc71061700e - 4 - 0
74 pages
32 Direct Acting Normally Closed Valve 17 MM
No ratings yet
32 Direct Acting Normally Closed Valve 17 MM
2 pages
Ie151-2x Or1 Concept
No ratings yet
Ie151-2x Or1 Concept
4 pages
RS170 Spec Engl
No ratings yet
RS170 Spec Engl
2 pages
DBAS6211MM
No ratings yet
DBAS6211MM
173 pages
Lex (Software)
No ratings yet
Lex (Software)
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Capstone Project

Uploaded by

Capstone Project

Uploaded by

Atanu Manna Market Data Analytics and

Rupsa Jana recommended system for

Product Sales Analytics,

Predicting product sales involves analyzing historical sales data,

The Data Warehouse is an

➢ Operational Database / External Database Layer

Information Sources Data Warehouse OLAP Servers Clients

SQL Aggregated Data

Report build &

(Bank Specific Tool

Columns Row 1 Row 2 Row 3 Row 4 Row 5

Store Info Key columns joining fact table

Prod Id Time Id Store Id Sales Qty

Dimension tables Time Info

• MDDB: a special-purpose data model

sale prodId storeId date amt

Example: computing sums

sale prodId storeId date amt

sale prodId storeId date amt

• Rolling up and Drilling down through aggregate data.

• With MDDs, application design is essentially the definition of

– Best of both worlds

– Storing detailed data in RDBMS

– Storing aggregated data in MDBMS

– User access via MOLAP tools

• How fast will the system appear to the end-user?

• MDD server vendors believe this is a key point in their favor.

2) Data volume and scalability:

• While MDD servers can handle up to 50GB of storage, RDBMS

1997 1998 1999

Audio North America

• MOLAP: MDD -> Cube structures

• MOLAP: for summarized and relatively lesser volumes of data (10-50GB)

• ROLAP: for detailed and larger volumes of data

• Both storage methods have strengths and weaknesses

• The choice is requirement specific, though currently data warehouses are

To complete the order on time, all of them work parallelly

HOW THIS STORY IS

Name node maintain and

Data Nodes stores the

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.