A It Report Final
A It Report Final
A It Report Final
DATABASE
Distributed Database:
A single logical database that is spread physically across
computers in multiple locations that are connected by a
data communications link.
Decentralized Database:
A collection of independent databases on non-networked
computers.
Distributed database environments (adapted from Bell and
Grimson, 1992)
Distributed Database Options
Horizontal partitioning
Different rows of a table distributed to different sites
Vertical partitioning
Different columns of a table distributed to different sites
Hybrid approach
Combination of few or all of the above techniques.
Data Replication
Advantages:
Reliability
Fast response
May avoid complicated distributed transaction integrity
routines (if replicated data is refreshed at scheduled
intervals)
Data Replication
Disadvantages:
Additional requirements for storage space
Additional time for update operations
Complexity and cost of updating
Integrity exposure of getting incorrect data if
replicated data is not updated simultaneously
Types of Data Replication
Push Replication:
updating site sends changes to other sites
Pull Replication:
receivingsites control when update messages
will be processed
Issues for Data Replication
Data timeliness – high tolerance for out-of-date data
may be required
DBMS capabilities – if DBMS cannot support multi-
node queries, replication may be necessary
Performance implications – refreshing may cause
performance problems for busy nodes
Network heterogeneity – complicates replication
Network communication capabilities – complete
refreshes place heavy demand on
telecommunications
Horizontal Partitioning
Different rows of a table at different sites
Advantages:
Data stored close to where it is used efficiency
Local access optimization better performance
Only relevant data is available security
Unions across partitions ease of query
Disadvantages:
Accessing data across partitions inconsistent access speed
No data replication backup vulnerability
Vertical Partitioning