Chapter 5- Distributed Database Systems
Chapter 5- Distributed Database Systems
Introduction
DBMS APPROACH
4
INTRODUCTION … (2)
A database is an ordered collection of related data that is built for a specific purpose.
A database may be organized as a collection of multiple tables, where a table represents a real world entity.
Each table has several different fields that represent the characteristic features of the entity.
A database management system is a collection of programs that enables creation and maintenance of a database.
DBMS is available as a software package that facilitates definition, construction, manipulation and sharing of data in
a database.
Construction of a database involves actual storing of the data in any storage medium.
Manipulation refers to the retrieving information from the database, updating the database and generating
reports.
5
CENTRALIZED DATABASE SYSTEMS … (1)
The data integrity is maximized as the whole database is stored at a single physical location. This means that it is
The data redundancy is minimal in the centralized database. All the data is stored together and not scattered
across different locations. So, it is easier to make sure there is no redundant data available.
Since all the data is in one place, there can be stronger security measures around it. So, the centralized database
is much more secure. Data is easily portable because it is stored at the same place.
The centralized database is cheaper than other types of databases as it requires less power and maintenance.
All the information in the centralized database can be easily accessed from the same location and at the same
time.
7
CENTRALIZED DATABASE SYSTEMS … (3)
Since all the data is at one location, it takes more time to search and access it. If the network is slow, this
There is a lot of data access traffic for the centralized database. This may create a bottleneck situation.
Since all the data is at the same location, if multiple users try to access it simultaneously it creates a
If there are no database recovery measures in place and a system failure occurs, then all the data in the
For proper functioning of any organization, there’s a need for a well-maintained database.
However, with the increase in globalization, organizations tend to be diversified across the globe.
They may choose to distribute data over local servers instead of a central database.
Data in each site can be managed by a DBMS independent of the other sites.
A distributed database incorporates transaction processing, but it is not synonymous with a transaction
processing system.
11
DISTRIBUTED DATABASE SYSTEMS … (3)
RELIABILITY: In distributed database system, if one system fails down or stops working for
AVAILABILITY: In distributed database system reliability can be achieved even if sever fails
locations. So the databases are available to every location which is easy to maintain.
12
DISTRIBUTED DBMS … (1)
A DDBMS is a software system that manages a distributed database, allowing data stored
DDBMS manages the distributed database and provides mechanisms so as to make the
In these systems, data is intentionally distributed among multiple nodes so that all computing
FEATURES
It synchronizes the database periodically and provides access mechanisms by the virtue of which the
It is used in application areas where large volumes of data are processed and accessed by numerous
users simultaneously.
Autonomy: Departments or subsidiaries may need independent control over their data while still
Need for Sharing of Data - The multiple organizational units often need to communicate with each
other and share their data and resources. This demands common databases or replicated databases
Reduced Network Load: Storing data close to users minimizes latency and bandwidth usage.
Parallel Processing: Queries can be executed across multiple nodes, improving response times
Horizontal Scalability: New nodes/sites can be added easily to handle growing data volumes.
No Single Point of Failure: Data replication across sites ensures availability even if one node fails.
Disaster Recovery: Distributed systems inherently support backup and recovery strategies.
16
DISTRIBUTED DBMS … (4)
Real-Time Analytics: Distributed systems enable faster processing of large-scale data (e.g., IoT,
big data).
High Availability: 24/7 access expectations (e.g., e-commerce, banking) necessitate distributed
redundancy.
Database Recovery
One of the common techniques used in DDBMS is replication of data across different sites.
Replication of data automatically helps in data recovery if database in any site is damaged.
Users can access data from other sites while the damaged site is being reconstructed. Thus,
Improved Reliability and Availability: If one site fails in a distributed system, the remaining sites can
continue to operate, ensuring system availability. This reduces the risk of a single point of failure, which
Better Performance: Queries can be processed at the location where the data resides, minimizing data
transfer and reducing query response time. This local processing enhances the system’s overall
performance.
Scalability: Distributed databases can be easily expanded by adding more sites without significantly
affecting performance. This allows organizations to scale their systems as data or user demands grow.
18
BENEFITS OF DISTRIBUTED DATABASE SYSTEMS … (2)
Modular Growth: New nodes can be added incrementally without disturbing the existing system,
making distributed databases more modular and flexible than centralized ones.
Data Localization: Frequently accessed data can be stored at the site closer to where it is needed,
Reduced Communication Costs: Since local sites handle local queries, less data is transmitted over
Improved Resource Sharing: Resources such as storage and processing power are distributed across
Increased Autonomy: Each site can operate independently and manage its own data, which allows for
complex than a centralized one. It involves data distribution, synchronization, concurrency control, and
Higher Setup and Maintenance Cost: Distributed databases require more hardware, software, and
Data Integrity and Consistency Challenges: Ensuring data consistency across all sites is difficult,
environment is more complex due to the involvement of multiple nodes, networks, and data replicas.
20
LIMITATION OF DISTRIBUTED DATABASE SYSTEMS … (1)
Security Risks: More access points and data transfers across networks increase the system’s exposure
Difficult Backup and Recovery: Coordinating a consistent backup or recovery process across
Version Management: Keeping software versions, schemas, and configurations synchronized across
In a homogeneous distributed database, all the sites use identical DBMS and operating systems.
The sites use identical DBMS or DBMS from the same vendor.
Each site is aware of all other sites and cooperates with other sites to process user requests.
In a heterogeneous distributed database, different sites have different operating systems, DBMS
The system may be composed of a variety of DBMSs like relational, network, hierarchical or object
oriented.
A site may not be aware of other sites and so there is limited co-operation in processing user quests.
25
TYPES OF DISTRIBUTED DATABASE … (5)
26
DISTRIBUTED DATABASE STORAGE … (1)
o REPLICATION
o FRAGMENTATION
27
DISTRIBUTED DATABASE STORAGE … (2)
REPLICATION
The advantage of database replication is that it increases data availability on different sites and allows for
However, database replication means that data requires constant updates and synchronization with other
Any changes made on one site must be recorded on other sites, or else inconsistencies occur.
Constant updates cause a lot of server overhead and complicate concurrency control, as a lot of concurrent
FRAGMENTATION
When it comes to fragmentation of distributed database storage, the relations are fragmented, which
The prerequisite for fragmentation is to make sure that the fragments can later be reconstructed into the
The advantage of fragmentation is that there are no data copies, which prevents data inconsistency.
2. Vertical fragmentation
30
DISTRIBUTED DATABASE STORAGE … (5)
TEACHING YOU IS GOOD LUCK