0% found this document useful (0 votes)
3 views

Data Warehouse

A data warehouse is a centralized repository that stores large amounts of data from various sources, designed for analysis and reporting to inform business decisions. It integrates and organizes data to facilitate better decision-making, allowing users to uncover trends and insights. Key characteristics include being subject-oriented, integrated, historical, and read-only, which differentiates it from traditional databases.

Uploaded by

nics1425
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Data Warehouse

A data warehouse is a centralized repository that stores large amounts of data from various sources, designed for analysis and reporting to inform business decisions. It integrates and organizes data to facilitate better decision-making, allowing users to uncover trends and insights. Key characteristics include being subject-oriented, integrated, historical, and read-only, which differentiates it from traditional databases.

Uploaded by

nics1425
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 22

DATA WAREHOUSING:

A data warehouse is a central repository that


stores large amounts of data from various sources
within an organization. This data is specifically
designed and organized for analysis and reporting,
enabling users to uncover trends, patterns, and
other insights that can inform business decisions.
Introduction:
In today’s rapidly changing corporate environment,
organizations are turning to cloud-based technologies for
convenient data collection, reporting and analysis.
Organization needs to have reliable reporting and analysis of
large amounts of data.
Businesses need their data to be consolidated and integrated for
different levels of aggregation, from customer service to
partner integration to top-level executive business decisions.
This is where data warehousing comes in as it makes reporting
and analysis easier.
Data Warehousing comes in as a core component of business
intelligence that enables businesses to enhance their
performance.
What Is a Data Warehouse?
The data warehouse (DWH) is a repository where an organization
electronically stores data by extracting it from operational
systems, and making it available for ad-hoc queries and
scheduled reporting.
The process of building a data warehouse entails designing a data
model that can quickly generate insights.
Data stored in the DWH is different from data found in the
operational environment.
It is organized so that relevant data is clustered together to
facilitate day-to-day operations, analysis, and reporting.
This helps determine the trends over time and allows users to
create plans based on that information.
Hence, reinforcing the importance of data warehouse use in
• Data warehouses serve as a central repository for storing
and analyzing information to make better informed
decisions.
• An organization's data warehouse receives data from a
variety of sources, typically on a regular basis, including
transactional systems, relational databases, and other
sources.
• A data warehouse is a centralized storage system that allows
for the storing, analyzing, and interpreting of data in order
to facilitate better decision-making.
• Transactional systems, relational databases, and other
sources provide data into data warehouses on a regular
basis.
• A data warehouse is a type of data management
system that facilitates and supports business
intelligence (BI) activities, specifically analysis.
• Data warehouses are primarily designed to facilitate
searches and analyses and usually contain large
amounts of historical data.
• A data warehouse can be defined as a collection of
organizational data and information extracted from
operational sources and external data sources.
• The data is periodically pulled from various internal
applications like sales, marketing, and finance;
customer-interface applications; as well as external
partner systems.
• This data is then made available for decision-makers
to access and analyze.

Definition:
A data warehouse is a comprehensive repository of
current and historical information that is designed to
enhance an organization’s performance.
Key characteristics of a data warehouse:
• Centralized: Data is gathered from multiple sources and stored in a
single location.
• Integrated: Data goes through a cleaning and transformation process
to ensure consistency across different sources.
• Subject-oriented: The data is organized around specific business
subjects, such as customers, products, or sales.
• Historical: Data warehouses typically store historical data, allowing for
analysis of trends over time.
• Read-only: Data warehouses are primarily used for analysis, so the
data is usually read-only.
Characteristics of Data Warehouse:
Subject-Oriented:
• A data warehouse is subject-oriented since it provides
topic-wise information rather than the overall
processes of a business.
• Such subjects may be sales, promotion, inventory, etc.
• For example, if you want to analyze your company’s
sales data, you need to build a data warehouse that
concentrates on sales.
• Such a warehouse would provide valuable information
like ‘who was your best customer last year?’ or ‘who is
likely to be your best customer in the coming year?’
Integrated:
• A data warehouse is developed by integrating data from varied
sources into a consistent format.
• The data must be stored in the warehouse in a consistent and
universally acceptable manner in terms of naming, format, and
coding.
• This facilitates effective data analysis.
Non-Volatile:
• Data once entered into a data warehouse must remain
unchanged.
• All data is read-only.
• Previous data is not erased when current data is entered.
• This helps you to analyze what has happened and when.
Time-Variant:
• The data stored in a data warehouse is documented
with an element of time, either explicitly or implicitly.
• An example of time variance in Data Warehouse is
exhibited in the Primary Key, which must have an
element of time like the day, week, or month.
Database vs. Data Warehouse
• Although a data warehouse and a traditional database share
some similarities, they need not be the same idea.
• The main difference is that in a database, data is collected
for multiple transactional purposes.
• However, in a data warehouse, data is collected on an
extensive scale to perform analytics.
• Databases provide real-time data, while warehouses store
data to be accessed for big analytical queries.
• Data warehouse is an example of an OLAP system or an
online database query answering system.
• OLTP is an online database modifying system, for example,
ATM.
Data Warehouse Architecture
• A data warehouse architecture uses dimensional
models to identify the best technique for extracting
meaningful information from raw data and translating
it into an easy-to-understand structure.
Three main types of architecture when designing a
business-level real-time data warehouse;
• Single-tier Architecture
• Two-tier Architecture
• Three-tier Architecture
Data Warehouse Architecture:
Data warehouse architecture comprises a three-tier structure;
Bottom Tier:
• The bottom tier or data warehouse server usually represents a relational
database system. Back-end tools are used to cleanse, transform and feed data
into this layer.
Middle Tier:
• The middle tier represents an OLAP server that can be implemented in two ways.
• The ROLAP or Relational OLAP model is an extended relational database
management system that maps multidimensional data process to standard
relational process.
• The MOLAP or multidimensional OLAP directly acts on multidimensional data
and operations.
Top Tier:
• This is the front-end client interface that gets data out from the data warehouse.
• It holds various tools like query tools, analysis tools, reporting tools, and data
mining tools.
How Data Warehouse Works
• Data Warehousing integrates data and information collected
from various sources into one comprehensive database.
• For example, a data warehouse might combine customer
information from an organization’s point-of-sale systems, its
mailing lists, website, and comment cards.
• It might also incorporate confidential information about
employees, salary information, etc.
• Businesses use such components of data warehouse to
analyze customers.
• Data mining is one of the features of a data warehouse that
involves looking for meaningful data patterns in vast volumes
of data and devising innovative strategies for increased sales
and profits.
Types of Data Warehouse:
There are three main types of data warehouse;
Enterprise Data Warehouse (EDW):
• This type of warehouse serves as a key or central database that
facilitates decision-support services throughout the enterprise.
• The advantage to this type of warehouse is that it provides access to
cross-organizational information, offers a unified approach to data
representation, and allows running complex queries.
Operational Data Store (ODS):
• This type of data warehouse refreshes in real-time.
• It is often preferred for routine activities like storing employee
records.
• It is required when data warehouse systems do not support reporting
needs of the business.
Data Mart:
• A data mart is a subset of a data warehouse built to
maintain a particular department, region, or business
unit.
• Every department of a business has a central
repository or data mart to store data.
• The data from the data mart is stored in the ODS
periodically.
• The ODS then sends the data to the EDW, where it is
stored and used.
Data Warehouse Examples:
• Investment and Insurance companies use data warehouses
to primarily analyze customer and market trends and allied
data patterns.
• In sub-sectors like Forex and stock markets, data warehouse
plays a significant role because a single point difference can
result in huge losses across the board.
• Retail chains use data warehouses for marketing and
distribution, so they can track items, examine pricing policies
and analyze buying trends of customers.
• They use data warehouse models for business intelligence
and forecasting needs.
• Healthcare companies, use data warehouse concepts
to generate treatment reports, share data with
insurance companies and in research and medical
units.
• Healthcare systems depend heavily upon enterprise
data warehouses because they need the latest,
updated treatment information to save lives.
Data Warehousing Tools:
• Data warehouse tools are software components used to perform
several operations on an extensive data set.
• These tools help to collect, read, write and transfer data from various
sources.
• They are designed to support operations like data sorting, filtering,
merging, etc.
Data warehouse tools can be categorized as:
• Query and reporting tools
• Application Development tools
• Data mining tools
• OLAP tools
Some popular data warehouse tools are Xplenty, Amazon Redshift,
Teradata, Oracle 12c, Informatica, IBM Infosphere, Cloudera, and
Panoply.
Functions of Data Warehouse Tools and Utilities
• The following are the functions of data warehouse tools and
utilities −
• Data Extraction − Involves gathering data from multiple
heterogeneous sources.
• Data Cleaning − Involves finding and correcting the errors in data.
• Data Transformation − Involves converting the data from legacy
format to warehouse format.
• Data Loading − Involves sorting, summarizing, consolidating,
checking integrity, and building indices and partitions.
• Refreshing − Involves updating from data sources to warehouse.
Note − Data cleaning and data transformation are important steps in
improving the quality of data and data mining results.
Benefits of Data Warehouse:
• Improved data consistency
• Better business decisions
• Easier access to enterprise data for end-users
• Better documentation of data
• Reduced computer costs and higher productivity
• Enabling end-users to ask ad-hoc queries or reports without deterring
the performance of operational systems
• Collection of related data from various sources into a place
Companies having dedicated Data Warehouse teams emerge ahead of
others in key areas of product development, pricing, marketing,
production time, historical analysis, forecasting, and customer
satisfaction. Though data warehouses can be slightly expensive, they
pay in the long run.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy