0% found this document useful (0 votes)
6 views

Chapter 5- Distributed Database Systems

The document provides an overview of distributed database systems, detailing their features, benefits, and limitations compared to centralized database systems. It explains the role of Distributed Database Management Systems (DDBMS) in managing data across multiple locations and highlights the types of distributed databases, including homogeneous and heterogeneous systems. Additionally, it discusses data storage methods such as replication and fragmentation, emphasizing the importance of reliability, availability, and performance in modern database applications.

Uploaded by

natnaelabera96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

Chapter 5- Distributed Database Systems

The document provides an overview of distributed database systems, detailing their features, benefits, and limitations compared to centralized database systems. It explains the role of Distributed Database Management Systems (DDBMS) in managing data across multiple locations and highlights the types of distributed databases, including homogeneous and heterogeneous systems. Additionally, it discusses data storage methods such as replication and fragmentation, emphasizing the importance of reliability, availability, and performance in modern database applications.

Uploaded by

natnaelabera96
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

HARAMAYA UNIVERSITY

COLLEGE OF COMPUTING AND INFORMATICS


DEPARTMENT OF SOFTWARE ENGINEERING

ADVANCED DATABASE SYSTEMS (SENG 3072)

CHAPTER FIVE: DISTRIBUTED DATABASE SYSTEM


2 CONTENTS

 Introduction

 Centralized Database System

 Distributed Database System

 Distributed Database Management System (DDBMS)

 Benefits and Limitations of Distributed Databases

 Types of Distributed Databases Systems

 Distributed Database Architecture

 Distributed Data Storage


3
INTRODUCTION … (1)

DBMS APPROACH
4
INTRODUCTION … (2)

 A database is an ordered collection of related data that is built for a specific purpose.

 A database may be organized as a collection of multiple tables, where a table represents a real world entity.

 Each table has several different fields that represent the characteristic features of the entity.

 A database management system is a collection of programs that enables creation and maintenance of a database.

 DBMS is available as a software package that facilitates definition, construction, manipulation and sharing of data in

a database.

 Definition of a database includes description of the structure of a database.

 Construction of a database involves actual storing of the data in any storage medium.

 Manipulation refers to the retrieving information from the database, updating the database and generating

reports.
5
CENTRALIZED DATABASE SYSTEMS … (1)

A centralized database: is a database that is located,

stored, and maintained in a single location. This location

is most often a central computer or database system, for

example a desktop or server, or a mainframe computer


6
CENTRALIZED DATABASE SYSTEMS … (2)

BENEFITS OF CENTRAL DATABASE SYSTEM

 The data integrity is maximized as the whole database is stored at a single physical location. This means that it is

easier to coordinate the data and it is as accurate and consistent as possible.

 The data redundancy is minimal in the centralized database. All the data is stored together and not scattered

across different locations. So, it is easier to make sure there is no redundant data available.

 Since all the data is in one place, there can be stronger security measures around it. So, the centralized database

is much more secure. Data is easily portable because it is stored at the same place.

 The centralized database is cheaper than other types of databases as it requires less power and maintenance.

 All the information in the centralized database can be easily accessed from the same location and at the same

time.
7
CENTRALIZED DATABASE SYSTEMS … (3)

LIMITATION OF CENTRAL DATABASE SYSTEM

 Since all the data is at one location, it takes more time to search and access it. If the network is slow, this

process takes even more time.

 There is a lot of data access traffic for the centralized database. This may create a bottleneck situation.

 Since all the data is at the same location, if multiple users try to access it simultaneously it creates a

problem. This may reduce the efficiency of the system.

 If there are no database recovery measures in place and a system failure occurs, then all the data in the

database will be destroyed.


8
CENTRALIZED DATABASE SYSTEMS … (4)

For proper functioning of any organization, there’s a need for a well-maintained database.

 In the recent past, databases used to be centralized in nature.

 However, with the increase in globalization, organizations tend to be diversified across the globe.

 They may choose to distribute data over local servers instead of a central database.

 Thus, arrived the concept of Distributed Databases.

 A distributed database is a collection of multiple interconnected databases, which are spread

physically across various locations that communicate via a computer network.


9
DISTRIBUTED DATABASE SYSTEMS … (1)
10
DISTRIBUTED DATABASE SYSTEMS … (2)

FEATURES OF DISTRIBUTED DATABAE SYSTEMS

 Databases in the collection are logically interrelated with each other.

 Often they represent a single logical database.

 Data is physically stored across multiple sites.

 Data in each site can be managed by a DBMS independent of the other sites.

 The processors in the sites are connected via a network.

 They do not have any multiprocessor configuration.

 A distributed database is not a loosely connected file system.

 A distributed database incorporates transaction processing, but it is not synonymous with a transaction

processing system.
11
DISTRIBUTED DATABASE SYSTEMS … (3)

THE CONCEPT OF DISTRIBUTED DATABASE WAS BUILT WITH A GOAL TO IMPROVE:

RELIABILITY: In distributed database system, if one system fails down or stops working for

some time another system can complete the task.

AVAILABILITY: In distributed database system reliability can be achieved even if sever fails

down. Another system is available to serve the client request.

PERFORMANCE: Performance can be achieved by distributing database over different

locations. So the databases are available to every location which is easy to maintain.
12
DISTRIBUTED DBMS … (1)

 A DDBMS is a software system that manages a distributed database, allowing data stored

across multiple locations to be accessed and manipulated as if it were a single, centralized

database, while ensuring coordination, consistency, and transparency.

 DDBMS manages the distributed database and provides mechanisms so as to make the

databases transparent to the users.

 In these systems, data is intentionally distributed among multiple nodes so that all computing

resources of the organization can be optimally used.


13
DISTRIBUTED DBMS … (2)

FEATURES

 It is used to create, retrieve, update and delete distributed databases.

 It synchronizes the database periodically and provides access mechanisms by the virtue of which the

distribution becomes transparent to the users.

 It ensures that the data modified at any site is universally updated.

 It is used in application areas where large volumes of data are processed and accessed by numerous

users simultaneously.

 It is designed for heterogeneous database platforms.

 It maintains confidentiality and data integrity of the databases.


14
DISTRIBUTED DBMS … (3)

FACTORS ENCOURAGING DDBMS

The following factors encourage moving over to DDBMS

 Distributed Nature of Organizational Units:

 Geographical Distribution: Businesses with multiple branches, remote offices, or global

operations require local data access while maintaining centralized control.

 Autonomy: Departments or subsidiaries may need independent control over their data while still

participating in a unified system.

 Need for Sharing of Data - The multiple organizational units often need to communicate with each

other and share their data and resources. This demands common databases or replicated databases

that should be used in a synchronized manner.


15
DISTRIBUTED DBMS … (4)

Performance and Scalability:

 Reduced Network Load: Storing data close to users minimizes latency and bandwidth usage.

 Parallel Processing: Queries can be executed across multiple nodes, improving response times

for large datasets.

 Horizontal Scalability: New nodes/sites can be added easily to handle growing data volumes.

Reliability and Fault Tolerance-

 No Single Point of Failure: Data replication across sites ensures availability even if one node fails.

 Disaster Recovery: Distributed systems inherently support backup and recovery strategies.
16
DISTRIBUTED DBMS … (4)

Modern Application Demand

 Real-Time Analytics: Distributed systems enable faster processing of large-scale data (e.g., IoT,

big data).

 High Availability: 24/7 access expectations (e.g., e-commerce, banking) necessitate distributed

redundancy.

Database Recovery

 One of the common techniques used in DDBMS is replication of data across different sites.

 Replication of data automatically helps in data recovery if database in any site is damaged.

 Users can access data from other sites while the damaged site is being reconstructed. Thus,

database failure may become almost inconspicuous to users.


17
BENEFITS OF DISTRIBUTED DATABASE SYSTEMS … (1)

Following are the advantages of distributed databases over centralized databases.

 Improved Reliability and Availability: If one site fails in a distributed system, the remaining sites can

continue to operate, ensuring system availability. This reduces the risk of a single point of failure, which

is common in centralized databases.

 Better Performance: Queries can be processed at the location where the data resides, minimizing data

transfer and reducing query response time. This local processing enhances the system’s overall

performance.

 Scalability: Distributed databases can be easily expanded by adding more sites without significantly

affecting performance. This allows organizations to scale their systems as data or user demands grow.
18
BENEFITS OF DISTRIBUTED DATABASE SYSTEMS … (2)

 Modular Growth: New nodes can be added incrementally without disturbing the existing system,

making distributed databases more modular and flexible than centralized ones.

 Data Localization: Frequently accessed data can be stored at the site closer to where it is needed,

improving access speed and reducing network load.

 Reduced Communication Costs: Since local sites handle local queries, less data is transmitted over

the network, resulting in reduced communication costs.

 Improved Resource Sharing: Resources such as storage and processing power are distributed across

multiple sites, allowing for better utilization and sharing.

 Increased Autonomy: Each site can operate independently and manage its own data, which allows for

greater autonomy and local control.


19
LIMITATION OF DISTRIBUTED DATABASE SYSTEMS … (1)

 Complexity of Implementation: Designing and managing a distributed system is significantly more

complex than a centralized one. It involves data distribution, synchronization, concurrency control, and

recovery across multiple sites.

 Higher Setup and Maintenance Cost: Distributed databases require more hardware, software, and

networking infrastructure, increasing the cost of setup and ongoing maintenance.

 Data Integrity and Consistency Challenges: Ensuring data consistency across all sites is difficult,

especially when data is replicated or accessed simultaneously by multiple users.

 Difficult Troubleshooting and Debugging: Identifying and resolving issues in a distributed

environment is more complex due to the involvement of multiple nodes, networks, and data replicas.
20
LIMITATION OF DISTRIBUTED DATABASE SYSTEMS … (1)

 Security Risks: More access points and data transfers across networks increase the system’s exposure

to security threats, such as unauthorized access or data interception.

 Concurrency Control Overhead: Coordinating transactions across distributed sites requires

sophisticated concurrency control mechanisms, which can affect performance.

 Difficult Backup and Recovery: Coordinating a consistent backup or recovery process across

multiple sites is more challenging than in a centralized database.

 Version Management: Keeping software versions, schemas, and configurations synchronized across

distributed systems adds operational burden.


21
TYPES OF DISTRIBUTED DATABASE … (1)
22
TYPES OF DISTRIBUTED DATABASE … (2)

HOMOGENEOUS DISTRIBUTED DATABASES

 In a homogeneous distributed database, all the sites use identical DBMS and operating systems.

 Its properties are

 The sites use very similar software.

 The sites use identical DBMS or DBMS from the same vendor.

 Each site is aware of all other sites and cooperates with other sites to process user requests.

 The database is accessed through a single interface as if it is a single database.


23
TYPES OF DISTRIBUTED DATABASE … (3)
24
TYPES OF DISTRIBUTED DATABASE … (4)

HETEROGENEOUS DISTRIBUTED DATABASES

 In a heterogeneous distributed database, different sites have different operating systems, DBMS

products and data models.

 Its properties are:

 Different sites use dissimilar schemas and software.

 The system may be composed of a variety of DBMSs like relational, network, hierarchical or object

oriented.

 Query processing is complex due to dissimilar schemas.

 Transaction processing is complex due to dissimilar software.

 A site may not be aware of other sites and so there is limited co-operation in processing user quests.
25
TYPES OF DISTRIBUTED DATABASE … (5)
26
DISTRIBUTED DATABASE STORAGE … (1)

Distributed database storage is managed in two ways:

o REPLICATION

o FRAGMENTATION
27
DISTRIBUTED DATABASE STORAGE … (2)

REPLICATION

 In database replication, the systems store copies of data on different sites.

 If an entire database is available on multiple sites, it is a fully redundant database.

 The advantage of database replication is that it increases data availability on different sites and allows for

parallel query requests to be processed.

 However, database replication means that data requires constant updates and synchronization with other

sites to maintain an exact database copy.

 Any changes made on one site must be recorded on other sites, or else inconsistencies occur.

 Constant updates cause a lot of server overhead and complicate concurrency control, as a lot of concurrent

queries must be checked in all available sites.


28
DISTRIBUTED DATABASE STORAGE … (3)
29
DISTRIBUTED DATABASE STORAGE … (4)

FRAGMENTATION

 When it comes to fragmentation of distributed database storage, the relations are fragmented, which

means they are split into smaller parts.

 Each of the fragments is stored on a different site, where it is required.

 The prerequisite for fragmentation is to make sure that the fragments can later be reconstructed into the

original relation without losing data.

 The advantage of fragmentation is that there are no data copies, which prevents data inconsistency.

 There are two types of fragmentation:


1. Horizontal fragmentation

2. Vertical fragmentation
30
DISTRIBUTED DATABASE STORAGE … (5)
TEACHING YOU IS GOOD LUCK

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy