0% found this document useful (0 votes)

63 views

Distributed Databases: CMP-3440 - Database Systems

The document discusses distributed databases and distributed processing. Key points include: 1) A distributed database is a single logical database that is physically spread across multiple connected computers. 2) Distributed databases allow for data to be replicated, partitioned horizontally or vertically across sites. 3) A distributed database management system (DDBMS) manages the distributed database and makes the distribution transparent to users.

Uploaded by

Asad Tahir Kalyar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Distributed Databases: CMP-3440 - Database Systems

Uploaded by

Asad Tahir Kalyar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

CMP-3440 – Database Systems

Distributed Processing and Distributed Databases

Lecture 11

Distributed Databases
• A large organization having multiple locations may choose to use a central
database server or to distribute database to local servers.

• The distributed database is still centrally administered.

• The network must allow the users to share the data.

• A distributed database required multiple instances of a DBMS.

2
Evolution
• Merging of two diverging concepts:
1. Integration through the use of database technology; and
2. Distribution through the use of data communication technology.

Concepts
• A distributed database is a single logical database that is spread physically across
computers in multiple locations that are connected by a communication network.

• Software system that permits the management of distributed database and

makes the distribution transparent tousers.

• Data splits into fragments.

• Fragments may be replicated.

• Each DBMS participates at least one global application.

4
Environment
1. Homogeneous: The same DBMS is used at each node. Two types:
i. Autonomous: each DBMS works independently, passing messages back and
forth to share data updates.
ii. Nonautonomous: a central, or master, DBMS coordinates database access
and updates across the nodes.

2. Heterogeneous: potentially different DBMSs are used at each node.

i. Systems: support some or all of the functionality of one logical database.
ii. Gateways: simple paths are created to other databases, without the
benefits of one logical database.

Advantages Disadvantages
• Transparency. • Software cost and complexity.
• Local autonomy. • Processing overhead.
• Increased reliability and availability. • Data integrity.
• Modular growth. • Slow response
• Lower communication cost.
• Faster response.

6
Strategies for Distributing
1. Data Replication.

2. Horizontal Partitioning.

3. Vertical Partitioning.

4. Combination of the above.

Strategies for Distributing

Data Replication.
• Updates data copies. Can use either synchronous or asynchronous distributed
database technologies.
i. Snapshot Replication: simple table copying or periodic snapshots from
multiple sites are collected at a master/primary database to; then the snapshot
is sent periodically to each site where there is a copy. Can be full, differential or
incremental.
ii. Near-Real-Time Replication: messages for each completed transaction are
triggered for broadcast across the network informing all nodes to update data
as soon as possible; without forcing a confirmation to the originating node.
iii. Pull Replication: the target, not the source node, controls when a local
database is updated. Thus the target database determines when it needs to be
updated/refreshed and requests a snapshot.

8
Strategies for Distributing
Horizontal Partitioning.
• Some of the rows of a table/relation are put into a base relation at one site,and
other rows are put into a base relation at anothersite.
• The transactions are processed locally to minimize response time. (normal
pattern for persons using ATM).

Strategies for Distributing

Vertical Partitioning.
• Some of the columns of a table/relation are put into a base relation at one site,
and other columns are put into a base relation at anothersite.

10
Strategies for Distributing
Combination of Operations.
• Almost unlimited combinations of strategies.
i. Engineering parts, accounting, customer data can be vertically partitioned?
ii. Standard parts data can be horizontally partitioned among 3 locations?
iii. Standard price list data can be replicated in all 3 locations?

Selecting right Data Distribution Strategy

1. Totally centralized at one location, accessed from many geographically
distributed sites.
2. Partially or totally replicated across geographically distributed sites, with each
copy periodically updated with snapshots.
3. Partially or totally replicated across geographically distributed sites, with near-
real-time synchronization of updates.
4. Partitioned into segments at different geographically distributed sites, but still
within one logical database and one distributed DBMS.
5. Partitioned into independent, non-integrated segments spanning multiple
computers and database software.

12
Comparison of Data Distribution Strategy

Functions of a Distributed DBMS

i. Keep track of where data are located in a distributed data dictionary.
ii. Determine the location from which to retrieve data and the location at which
to process each part of a distributed query.
iii. Provide security, concurrency and dead-lock control, global query optimization.
iv. Provide consistency among copies of data across remote sites.
v. Present a single logical database that is physically distributed. Using global
primary key control.
vi. Provide scalability and transparency.
vii. Permit different nodes to run different DBMS.

14
Location Transparency
• Users can act as if all the data were located at a single node.

• Querying does not require the user to know where the data are physically stored.

• Administrator does not need to create a view using UNION operator.

• To achieve location transparency the distributed DBMS must have access to an

accurate and current data dictionary/directory that indicates locations of all data.

Replication Transparency
• Users may treat the item as if it were a single item at a single node.

• The distributed DBMS will consult the data dictionary and determine that this is a
local transaction or a copy has been replicated locally.

• If data are replicated at some sites but not at all, that request will have to be
routed to another site.

• The DDBMS will select the fastest route of response without letting the user
know whether replication was done or not.

16
Failure Transparency
• Each node in a DDBMS is subject to the same types of failure as in a centralized
system; with some additional risk of failures of a communication link.

• Error detection and system reconfiguration are probably the functions of

communications controller or processor, however the DDBMS is responsible for
data recovery when a failure has occurred.

• DDBMS at each node has a component called transaction manager to:

i. Maintain a log of transactions and before & after databaseimages.
ii. Maintain concurrency control scheme to ensure integrity during parallel
transactions.

Commit Protocol
• Transaction Manager executes a commit protocol; which is a well-defined procedure to ensure
that global transaction is either successfully completed at each site or else aborted.

• Most widely used two-phase commit protocol:

i. First the originating site of global transaction sends a request to each of the sites that will
process some portion of the transaction.
ii. Each site locks its portion of database being updated; and processes the sub-transaction but
does not immediately commit to local database. Instead the result is stored in temporary file.
iii. Each site notifies originating site when it has completed its sub-transaction.
iv. When all sites have responded, a message is broadcasted to all participating sites to ask
whether they want to commit; each site returns an “OK” or “NOT OK” message.
v. If all “OK” are received, it broadcasts message to commit their portions, if one or more “NOT
OK” are received, it broadcasts message to abort transaction.

18
Concurrency Transparency
• Concurrency control is more complexed in a DDBMS; because concurrent users
are spread out among multiple sites and the data are often replicated at several
sites.

• Transaction managers at each site must cooperate to provide concurrency using:

i. Locking.
ii. Versioning.
iii. Time-Stamping.

Time-Stamping
• Even if two events occur simultaneously at different sites, each will have a unique
time-stamp.

• To ensure that transactions are processed in serial order; thus avoiding the need
of locks (and possible deadlocks).

20
Query Optimization
• With DDBMS the response of a query may require the DBMS to assemble data
from several different sites.

• Suppose:
Supplier_T(SupplierNumber, City) 10,000 records, stored in Lahore

Part_T(PartNumber, Color) 100,000 records, stored in Faisalabad

Shipment_T(SupplierNumber, PartNumber) 1,000,000 records, stored in Rawalpindi

Query Optimization
• A query, written by a user from Lahore is:

SELECT Supplier_T.SupplierNumber
FROM Supplier_T, Shipment_T, Part_T
WHERE Supplier_T.City = ‘Karachi’
AND Shipment_T.PartNumber = Part_T.PartNumber
AND Part_T.Color = ‘Red’;

22
Oracle Replication
• Still an emerging technology, rather than established. Current releases do not provide all
of the features.

• Oracle GoldenGate:
• (Heterogeneous) Middleware to replicate data between oracle and non-oracle data-stores.
• Oracle Streams:
• (Homogeneous) Built-in feature of the Oracle database, is a data replication and integration
feature.
• Snapshot Replication:
• Materialized views are mostly used for unidirectional (one-way) replication; for pulling data.
• Advanced Replication:
• Supports unidirectional replication, multiple masters, conflict resolution.

Oracle Replication Manager

• GUI for setting up, managing and monitoring a replicationenvironment.

Software Review Testing - SRT Question and Answers - Trenovision
50% (4)
Software Review Testing - SRT Question and Answers - Trenovision
46 pages
Easy2boot Usb Multiboot
57% (7)
Easy2boot Usb Multiboot
41 pages
Distributed Database Concepts
No ratings yet
Distributed Database Concepts
52 pages
Examination Seating Arrangement System
58% (43)
Examination Seating Arrangement System
58 pages
Chapter 4 - Distributed Database System
No ratings yet
Chapter 4 - Distributed Database System
52 pages
Distributed Databases: Not Just A Client/server System
No ratings yet
Distributed Databases: Not Just A Client/server System
43 pages
Distributed Databases: Not Just A Client/server System
No ratings yet
Distributed Databases: Not Just A Client/server System
43 pages
ch6 Distributed Database
No ratings yet
ch6 Distributed Database
25 pages
DDB Slides
No ratings yet
DDB Slides
67 pages
Distributed Database
100% (1)
Distributed Database
24 pages
Distributed Data Management: Distributed Systems Department of Computer Science UC Irvine
No ratings yet
Distributed Data Management: Distributed Systems Department of Computer Science UC Irvine
67 pages
10 Distributeddbms
No ratings yet
10 Distributeddbms
56 pages
Chapter 5 - Distributed Databases Roobera
No ratings yet
Chapter 5 - Distributed Databases Roobera
58 pages
Distributed Databases
No ratings yet
Distributed Databases
55 pages
Distributed Databases
No ratings yet
Distributed Databases
53 pages
DDBMS
No ratings yet
DDBMS
44 pages
Distributed DBMS
No ratings yet
Distributed DBMS
62 pages
Enterprise Systems: Distributed Databases and Systems - DT211 4
No ratings yet
Enterprise Systems: Distributed Databases and Systems - DT211 4
25 pages
DISTRIBUTED DATABASES Presentation
No ratings yet
DISTRIBUTED DATABASES Presentation
13 pages
Distributed Databases
No ratings yet
Distributed Databases
46 pages
ddb unit 1-5
No ratings yet
ddb unit 1-5
190 pages
Module 1
No ratings yet
Module 1
24 pages
Distributed Databases AND Client-Server Architechures
No ratings yet
Distributed Databases AND Client-Server Architechures
73 pages
Tybca Recent Trends in It Chpter 1
No ratings yet
Tybca Recent Trends in It Chpter 1
16 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Advanced Database Chapter 6 and 7
No ratings yet
Advanced Database Chapter 6 and 7
30 pages
Distributed Database Systems
No ratings yet
Distributed Database Systems
50 pages
Chapter - 7 Distributed Database System
100% (1)
Chapter - 7 Distributed Database System
54 pages
Final
No ratings yet
Final
46 pages
DDIS U1-3
No ratings yet
DDIS U1-3
40 pages
DDB Slides
No ratings yet
DDB Slides
30 pages
CSE 453 Slide 1
No ratings yet
CSE 453 Slide 1
46 pages
Unit - I Distributed Data Processing
100% (2)
Unit - I Distributed Data Processing
27 pages
Lecture 1
No ratings yet
Lecture 1
46 pages
UNIT- 1 DDB
No ratings yet
UNIT- 1 DDB
34 pages
Distributed DBM S
No ratings yet
Distributed DBM S
67 pages
17 DatabaseArchitectures
No ratings yet
17 DatabaseArchitectures
41 pages
Unit-4-DDBMS (1)
No ratings yet
Unit-4-DDBMS (1)
58 pages
Data Communication Basics CH 7
No ratings yet
Data Communication Basics CH 7
27 pages
Distributed DB
No ratings yet
Distributed DB
16 pages
Distributed Database Systems (DDBS)
No ratings yet
Distributed Database Systems (DDBS)
30 pages
A Distributed Database Management System ('DDBMS') Is A Software System
No ratings yet
A Distributed Database Management System ('DDBMS') Is A Software System
5 pages
Topic 7 DDBMS
No ratings yet
Topic 7 DDBMS
28 pages
Advanced Data Base Management Systems
No ratings yet
Advanced Data Base Management Systems
35 pages
Basis For Distributed Database Technology
No ratings yet
Basis For Distributed Database Technology
35 pages
Distributed Database Management Systems (2)
No ratings yet
Distributed Database Management Systems (2)
73 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
55 pages
Ddbms Notes
No ratings yet
Ddbms Notes
21 pages
ch6 Distributed Database
No ratings yet
ch6 Distributed Database
35 pages
07-DistributedDataManagement
No ratings yet
07-DistributedDataManagement
44 pages
Distributed Databases: Benefits and Issues To Be Considered
No ratings yet
Distributed Databases: Benefits and Issues To Be Considered
25 pages
lecture-1-ho (1)
No ratings yet
lecture-1-ho (1)
62 pages
Lecture 1 Ho PDF
No ratings yet
Lecture 1 Ho PDF
62 pages
Distributed Database Management Systems
No ratings yet
Distributed Database Management Systems
123 pages
Distributed Database: Source
No ratings yet
Distributed Database: Source
19 pages
DB unit-2
No ratings yet
DB unit-2
27 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
Unit - 2 (1) DBMS
No ratings yet
Unit - 2 (1) DBMS
25 pages
Chapter 6
No ratings yet
Chapter 6
45 pages
Introduction To Building Dapps: A Comprehensive Guide
From Everand
Introduction To Building Dapps: A Comprehensive Guide
Joshua Baba Adugibilla
No ratings yet
Database Management System
From Everand
Database Management System
Knowledge Flow
No ratings yet
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
From Everand
Mastering Terraform A Comprehensive Guide to Infrastructure As Code
Mario Marinov
No ratings yet
Optimized Caching Techniques: Application for Scalable Distributed Architectures
From Everand
Optimized Caching Techniques: Application for Scalable Distributed Architectures
Peter Jones
No ratings yet
Logical Functions: Excel Easy
No ratings yet
Logical Functions: Excel Easy
4 pages
BeagleBone Black Cookbook - Sample Chapter
100% (2)
BeagleBone Black Cookbook - Sample Chapter
67 pages
Pdfmyurl Com
No ratings yet
Pdfmyurl Com
2 pages
Module 2 Unified - Process PDF
No ratings yet
Module 2 Unified - Process PDF
38 pages
Lee Strobel El Caso Del Creador Descargar
50% (2)
Lee Strobel El Caso Del Creador Descargar
4 pages
Mcafee Data Loss Prevention 11.6.x Interface Reference Guide 11-1-2022
No ratings yet
Mcafee Data Loss Prevention 11.6.x Interface Reference Guide 11-1-2022
294 pages
Huawei 2G, 3G, Lte
0% (1)
Huawei 2G, 3G, Lte
2 pages
MFRA
No ratings yet
MFRA
54 pages
Res 2 Dmod
100% (1)
Res 2 Dmod
12 pages
List of Civil Engineering Softwares - Online Civil
0% (1)
List of Civil Engineering Softwares - Online Civil
4 pages
Resume Final
No ratings yet
Resume Final
2 pages
Yukta Bhati Resume
No ratings yet
Yukta Bhati Resume
2 pages
User Manual: Smart Bracelet I5 Plus
No ratings yet
User Manual: Smart Bracelet I5 Plus
7 pages
AZ-204 Develop Azure Compute Solutions (25-30%)
No ratings yet
AZ-204 Develop Azure Compute Solutions (25-30%)
20 pages
Apache Karaf in Real Life - 1
No ratings yet
Apache Karaf in Real Life - 1
51 pages
Import Extport Facilities
No ratings yet
Import Extport Facilities
7 pages
BlazonEnterprise MSIPropertiesGuide
No ratings yet
BlazonEnterprise MSIPropertiesGuide
11 pages
List of Programming Languages
No ratings yet
List of Programming Languages
8 pages
CISA's Guidance Mobile Communications Best Practices
No ratings yet
CISA's Guidance Mobile Communications Best Practices
5 pages
Distributed Filesystems Review
No ratings yet
Distributed Filesystems Review
30 pages
C - Double Buffering On lpc1788 - Stack Overflow
No ratings yet
C - Double Buffering On lpc1788 - Stack Overflow
3 pages
C Exam
0% (1)
C Exam
8 pages
Jaamsim Instructor's Guide
No ratings yet
Jaamsim Instructor's Guide
7 pages
Getting Started: Nuget Package Manager
No ratings yet
Getting Started: Nuget Package Manager
2 pages
AP2152 - IT Support (How To Access VDI Environment)
No ratings yet
AP2152 - IT Support (How To Access VDI Environment)
15 pages
Uttam Resume
No ratings yet
Uttam Resume
4 pages
M.S. Excel PDF (Sscstudy - Com)
No ratings yet
M.S. Excel PDF (Sscstudy - Com)
50 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Distributed Databases: CMP-3440 - Database Systems

Uploaded by

Distributed Databases: CMP-3440 - Database Systems

Uploaded by

CMP-3440 – Database Systems

Distributed Processing and Distributed Databases

• The distributed database is still centrally administered.

• The network must allow the users to share the data.

• A distributed database required multiple instances of a DBMS.

• Software system that permits the management of distributed database and

• Data splits into fragments.

• Fragments may be replicated.

• Each DBMS participates at least one global application.

2. Heterogeneous: potentially different DBMSs are used at each node.

4. Combination of the above.

Strategies for Distributing

Strategies for Distributing

Selecting right Data Distribution Strategy

Functions of a Distributed DBMS

• Administrator does not need to create a view using UNION operator.

• To achieve location transparency the distributed DBMS must have access to an

• Error detection and system reconfiguration are probably the functions of

• DDBMS at each node has a component called transaction manager to:

• Most widely used two-phase commit protocol:

• Transaction managers at each site must cooperate to provide concurrency using:

Part_T(PartNumber, Color) 100,000 records, stored in Faisalabad

Shipment_T(SupplierNumber, PartNumber) 1,000,000 records, stored in Rawalpindi

Oracle Replication Manager

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.