DWDM-Unit-5 Notes Mr. Rohit Pratap Singh
DWDM-Unit-5 Notes Mr. Rohit Pratap Singh
By
Rohit Pratap Singh
Assistant Professor (Senior Scale)
CSE-AIML
ABESEC, Ghaziabad
Data Aggregation
• Data aggregation is any process in which information is gathered
and expressed in a summary form, for purposes such as
statistical analysis.
• A common aggregation purpose is to get more information
about particular groups based on specific variables such as age,
profession, or income.
• The information about such groups can then be used for Web
site personalization to choose content and advertising likely to
appeal to an individual belonging to one or more groups for
which data has been collected. For example, a site that sells
music CDs might advertise certain CDs based on the age of the
user and the data aggregate for their age group.
• Online analytic processing (OLAP) is a simple type of data
aggregation in which the marketer uses an online reporting
mechanism to process the information.
Data Aggregation
Unit testing:
• In unit testing, each component is separately
tested.
• Each module, i.e., procedure, program, SQL
Script, Unix shell is tested.
• This test is performed by the developer.
Data Warehousing - Testing
Integration Testing
• In integration testing, the various modules of the
application are brought together and then tested
against the number of inputs.
• It is performed to test whether the various
components do well after integration.
Data Warehousing - Testing
System Testing
• In system testing, the whole data warehouse
application is tested together.
• The purpose of system testing is to check whether
the entire system works correctly together or not.
• System testing is performed by the testing team.
• Since the size of the whole data warehouse is very
large, it is usually possible to perform minimal
system testing before the test plan can be enacted.
Data Warehousing - Testing
Test Schedule
• First of all, the test schedule is created in the process of
developing the test plan. In this schedule, we predict the
estimated time required for the testing of the entire data
warehouse system.
• There are different methodologies available to create a test
schedule, but none of them are perfect because the data
warehouse is very complex and large. Also the data
warehouse system is evolving in nature. One may face the
following issues while creating a test schedule −
• A simple problem may have a large size of query that can
take a day or more to complete, i.e., the query does not
complete in a desired time scale.
• There may be hardware failures such as losing a disk or
human errors such as accidentally deleting a table or
overwriting a large table.
Data Warehousing - Testing
Testing Backup Recovery
Testing the backup recovery strategy is extremely
important. Here is the list of scenarios for which this
testing is needed −
• Media failure
• Loss or damage of table space or data file
• Loss or damage of redo log file
• Loss or damage of control file
• Instance failure
• Loss or damage of archive file
• Loss or damage of table
• Failure during data failure
Data Warehousing - Testing
Testing Operational Environment
• There are a number of aspects that need to be
tested. These aspects are listed below.
• Security − A separate security document is required
for security testing. This document contains a list of
disallowed operations and devising tests for each.
• Scheduler − Scheduling software is required to
control the daily operations of a data warehouse. It
needs to be tested during system testing. The
scheduling software requires an interface with the
data warehouse, which will need the scheduler to
control overnight processing and the management
of aggregations.
Data Warehousing - Testing
• Disk Configuration. − Disk configuration also
needs to be tested to identify I/O bottlenecks.
The test should be performed with multiple times
with different settings.
• Management Tools. − It is required to test all the
management tools during system testing. Here is
the list of tools that need to be tested.
– Event manager
– System manager
– Database manager
– Configuration manager
– Backup recovery manager
Applications of Data Warehouse
Applications of Data Warehouse
1. Banking Industry
• In the banking industry, concentration is given to risk management
and policy reversal as well analyzing consumer data, market
trends, government regulations and reports, and more
importantly financial decision making.
2. Finance Industry
• Similar to the applications seen in banking, mainly revolve around
evaluation and trends of customer expenses which aids in
maximizing the profits earned by their clients.
3. Consumer Goods Industry
• They are used for prediction of consumer trends, inventory
management, market and advertising research. In-depth analysis
of sales and production is also carried out. Apart from these,
information is exchanged business partners and clientele.
Applications of Data Warehouse
4. Government and Education
• The federal government utilizes the warehouses for research in
compliance, whereas the state government uses it for services related to
human resources like recruitment, and accounting like payroll
management.
5. Healthcare
• One of the most important sector which utilizes data warehouses is the
Healthcare sector. All of their financial, clinical, and employee records are
fed to warehouses as it helps them to strategize and predict outcomes,
track and analyze their service feedback, generate patient reports, share
data with tie-in insurance companies, medical aid services, etc.
Applications of Data Warehouse
6. Hospitality Industry
• A major proportion of this industry is dominated by hotel and
restaurant services, car rental services, and holiday home services.
They utilize warehouse services to design and evaluate their
advertising and promotion campaigns where they target
customers based on their feedback and travel patterns.
7. Insurance
• As the saying goes in the insurance services sector, “Insurance can
never be bought, it can be only be sold”, the warehouses are
primarily used to analyze data patterns and customer trends,
apart from maintaining records of already existing participants.
The design of tailor-made customer offers and promotions is also
possible through warehouses.
Applications of Data Warehouse
8. Hospitality Industry
• A major proportion of this industry is dominated by hotel and
restaurant services, car rental services, and holiday home services.
They utilize warehouse services to design and evaluate their
advertising and promotion campaigns where they target
customers based on their feedback and travel patterns.
9. Insurance
• As the saying goes in the insurance services sector, “Insurance can
never be bought, it can be only be sold”, the warehouses are
primarily used to analyze data patterns and customer trends,
apart from maintaining records of already existing participants.
The design of tailor-made customer offers and promotions is also
possible through warehouses.
Applications of Data Warehouse
10. Manufacturing and Distribution Industry
• This industry is one of the most important sources of income for
any state. A manufacturing organization has to take several make-
or-buy decisions which can influence the future of the sector,
which is why they utilize high-end OLAP tools as a part of data
warehouses to predict market changes, analyze current business
trends, detect warning conditions, view marketing developments,
and ultimately take better decisions.
11. The Retailers
• Retailers serve as middlemen between producers and consumers.
It is important for them to maintain records of both the parties to
ensure their existence in the market.
• They use warehouses to track items, their advertising promotions,
and the consumers buying trends. They also analyze sales to
determine fast selling and slow selling product lines and
determine their shelf space through a process of elimination.
Applications of Data Warehouse
12. Services Sector
• Data warehouses find themselves to be of use in the service
sector for maintenance of financial records, revenue
patterns, customer profiling, resource management, and
human resources.