0% found this document useful (0 votes)

8 views14 pages

Lecture 8

The document provides an overview of distributed data replication, detailing its definition, goals, types, consistency models, challenges, examples, and trade-offs. It emphasizes the importance of replication for reliability, performance, and fault tolerance, while discussing various replication methods such as synchronous, asynchronous, and multi-master replication. Additionally, it addresses challenges like the CAP theorem, conflict resolution, and the implications of performance versus consistency in real-world applications.

Uploaded by

Kashmala Alam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views14 pages

Lecture 8

Uploaded by

Kashmala Alam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Distributed Data Replication

Prepared By Miangul Shafiq Ahmad Jan

Contents:

• Introduction to Distributed Data Replication

• Goals of Distributed Data Replication
• Types of Replication in Distributed Systems
• Consistency Models in Distributed Replication
• Challenges in Distributed Data Replication
• Examples and Use Cases
• Trade-Offs and Real-World Considerations
1. Introduction to Distributed Data Replication
Definition:
Distributed data replication is the process of storing copies of data on multiple nodes within a
distributed system. These nodes could be located in the same data center or across different geographic
locations.
Why Replicate Data?
• Reliability and Availability: If one node fails, data is still accessible from another node.
• Performance Optimization: Replication reduces latency by bringing data closer to users.
• Fault Tolerance: Replication helps protect against data loss and enhances disaster recovery.
Example:
Think about a global social media platform like Facebook. If you access it from Europe, the server
handling your requests might be closer to Europe, while someone in Asia would connect to a server in
Asia. The data (like profile info and posts) is replicated across these servers for faster access and
reliability.
2. Goals of Distributed Data Replication
• High Availability: Ensuring data can be accessed even in the event of system
failures.
• Fault Tolerance: Allowing the system to recover from hardware failures.
• Scalability: Distributing the load across multiple nodes to handle larger amounts of
data and traffic.
• Low Latency: Bringing data closer to users to reduce delays.
3. Types of Replication in Distributed Systems
1. Synchronous Replication:
o Changes are instantly copied to all replicas.
o Pros: Strong consistency across replicas, as updates are immediately reflected
everywhere.
o Cons: Increased latency since changes must be confirmed by all replicas before
the system considers an operation complete.
o Example: Banks may use synchronous replication for transactions to ensure
that account balances remain consistent across all replicas.
3. Types of Replication in Distributed Systems(cont.)

2. Asynchronous Replication:

o Changes are made to a primary replica and then propagated to other replicas over time.

o Pros: Faster writes because the system does not wait for all replicas to confirm.

o Cons: Temporary inconsistency since other replicas might not have the latest data
immediately.

o Example: Social media posts may use asynchronous replication, where updates can
propagate over a few seconds without causing issues.
3. Types of Replication in Distributed Systems(cont.)

3. Multi-Master Replication:

• Allows updates on multiple replicas simultaneously, which are then synchronized across
replicas.

• Pros: Enables high availability and distributed workloads.

• Cons: Conflict resolution is challenging, as multiple replicas might have conflicting updates.

• Example: Collaboration tools like Google Docs use multi-master replication to allow multiple
users to edit documents simultaneously.
3. Types of Replication in Distributed Systems(cont.)

4. Primary-Backup Replication (Master-Slave):

• A primary replica handles all writes, and other replicas (backups) receive updates from the
primary.

• Pros: Simple conflict management, as only one replica accepts writes.

• Cons: The primary node can be a bottleneck and a single point of failure.

• Example: A website with a master database that replicates data to read-only replicas to
handle traffic more efficiently.
4. Consistency Models in Distributed Replication
Consistency refers to the state of data across replicas. Different models offer different levels of consistency:
1. Strong Consistency:
• Guarantees that all replicas reflect the latest write before any read operation.
• Use Case: Critical systems where accurate data is essential, like online banking.
2. Eventual Consistency:
• Guarantees that, in the absence of new updates, all replicas will eventually become consistent.
• Use Case: Social media platforms where a few seconds of inconsistency don’t cause issues.
3. Causal Consistency:
• Ensures that if one operation causally affects another, the system respects this order in replication.
• Use Case: Messaging applications where responses should follow messages in order.
4. Read-After-Write Consistency:
• Ensures that once a write completes, the client can read the updated data immediately.
• Use Case: Blog platforms, where users should see their posts immediately after publishing.
5. Challenges in Distributed Data Replication
1. Consistency vs. Availability (CAP Theorem):

o The CAP Theorem states that in any distributed data system, you can only choose two of
the following three guarantees:

▪ Consistency: Every read receives the most recent write.

▪ Availability: Every request receives a response.

▪ Partition Tolerance: The system continues to operate even if there’s a

communication breakdown between nodes.

o Implication: Choosing between these factors impacts the design of the distributed
system. For instance, systems that prioritize availability and partition tolerance may
sacrifice consistency (eventual consistency).
5. Challenges in Distributed Data Replication (cont.)
2. Conflict Resolution:
o Techniques to resolve conflicts include:
▪ Timestamping: Keeping the most recent update.
▪ Vector Clocks: Tracking the order of operations across nodes.
▪ Application Logic: Using domain-specific rules (like majority vote) to handle conflicts.
3. Data Latency:
o Replication adds network overhead, as data needs to be copied to multiple locations.
o Trade-offs must be considered between data accuracy and the speed of access.
4. Fault Detection and Failover:
o Detecting failed nodes and rerouting requests to healthy replicas requires careful coordination.
o Load balancers or distributed algorithms (like Paxos or Raft) help manage failover and maintain
system stability.
6. Examples and Use Cases
• Google Search Infrastructure: Google replicates search data across multiple data
centers globally, ensuring fast access and reliability.
• Netflix: Uses distributed data replication to store and stream media content across
the world, reducing latency and providing a seamless user experience.
• Amazon Web Services (AWS) S3: AWS S3 replicates data in multiple geographic
locations for redundancy and disaster recovery.
7. Trade-Offs and Real-World Considerations
• Performance vs. Consistency:
A highly available system may prioritize quick responses over having the latest data,
while a highly consistent system may experience delays.
• Cost:
Data replication requires additional storage, bandwidth, and hardware, which can
increase operational costs.
• Geographic Distribution and Legal Considerations:
Some regions have regulations requiring data to stay within specific geographic
boundaries, influencing where and how data is replicated.

Ch02 - Big Data Storage Concepts
No ratings yet
Ch02 - Big Data Storage Concepts
23 pages
Distributed Systems Assignment 2
100% (1)
Distributed Systems Assignment 2
3 pages
2002 Replication Techniques in Distributed Systems Advances in Database Systems.9780792398004.29706
100% (1)
2002 Replication Techniques in Distributed Systems Advances in Database Systems.9780792398004.29706
166 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
48 pages
DRKP Module 2 1
No ratings yet
DRKP Module 2 1
77 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
73 pages
Distributed DBMS
No ratings yet
Distributed DBMS
62 pages
IAU ST Lecture5
No ratings yet
IAU ST Lecture5
50 pages
6 Replication Nhom3
No ratings yet
6 Replication Nhom3
44 pages
CH-07 Replication
No ratings yet
CH-07 Replication
35 pages
Nosql 1
No ratings yet
Nosql 1
40 pages
REPLICATION
No ratings yet
REPLICATION
20 pages
DS Chapter V7replication
No ratings yet
DS Chapter V7replication
33 pages
Lecture 10 - Replication
No ratings yet
Lecture 10 - Replication
37 pages
Hugoguerreiro Midterm
No ratings yet
Hugoguerreiro Midterm
32 pages
Consistency and Replication1
No ratings yet
Consistency and Replication1
30 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Chapter 7 - Consistency and Replication
No ratings yet
Chapter 7 - Consistency and Replication
28 pages
Unit 4
No ratings yet
Unit 4
24 pages
Unit - IV Notes
No ratings yet
Unit - IV Notes
42 pages
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
No ratings yet
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
28 pages
Replication
No ratings yet
Replication
21 pages
Ds Chapter 6
No ratings yet
Ds Chapter 6
23 pages
Introduction To Distributed Computing
No ratings yet
Introduction To Distributed Computing
57 pages
(Ci) 09 (Ci) 12
No ratings yet
(Ci) 09 (Ci) 12
31 pages
Fault Tolerance Unit 3-4
No ratings yet
Fault Tolerance Unit 3-4
32 pages
Lec 3 - Basic Concepts
No ratings yet
Lec 3 - Basic Concepts
32 pages
DS CH6 - Consistency and Replication
No ratings yet
DS CH6 - Consistency and Replication
18 pages
DS (Unit 4)
No ratings yet
DS (Unit 4)
32 pages
Consistency and Replication55
No ratings yet
Consistency and Replication55
17 pages
Replication
No ratings yet
Replication
16 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
51 pages
Chapter 7 Consistency and Replication
No ratings yet
Chapter 7 Consistency and Replication
43 pages
DBMS
No ratings yet
DBMS
16 pages
Consistency Models
No ratings yet
Consistency Models
15 pages
UNIT 3 Mob. Comp.
No ratings yet
UNIT 3 Mob. Comp.
12 pages
Module 2
No ratings yet
Module 2
40 pages
Testing
No ratings yet
Testing
11 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
18 pages
Distributed DBMS Replication
No ratings yet
Distributed DBMS Replication
10 pages
Unit I
No ratings yet
Unit I
17 pages
DSC5
No ratings yet
DSC5
13 pages
Chapter 7kec
No ratings yet
Chapter 7kec
8 pages
DC Mod 5
No ratings yet
DC Mod 5
12 pages
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
No ratings yet
Advanced Distributed Systems Replication: What Is Replication? Reasons For Replication
20 pages
DS Unit5
No ratings yet
DS Unit5
13 pages
Unit 5
No ratings yet
Unit 5
12 pages
Distributed System Notes
No ratings yet
Distributed System Notes
24 pages
Distributed 3
No ratings yet
Distributed 3
5 pages
Css Essay Paper 2025 All Topic Outline
No ratings yet
Css Essay Paper 2025 All Topic Outline
31 pages
No SQL Ia-01 - Micro
No ratings yet
No SQL Ia-01 - Micro
6 pages
Distributed 5
No ratings yet
Distributed 5
5 pages
DATA Replication
No ratings yet
DATA Replication
4 pages
11 DS - Ch18
No ratings yet
11 DS - Ch18
9 pages
NoSQL - Unit 2
No ratings yet
NoSQL - Unit 2
11 pages
NoSQL - Unit2
No ratings yet
NoSQL - Unit2
8 pages
Module 2 Nosql
No ratings yet
Module 2 Nosql
10 pages
Distributed Systems As DS DS
No ratings yet
Distributed Systems As DS DS
7 pages
Full Stack Development With Spring Boot and React
100% (1)
Full Stack Development With Spring Boot and React
6 pages
Supplementary Assignment
No ratings yet
Supplementary Assignment
7 pages
Consistency Models in Distributed Systems
No ratings yet
Consistency Models in Distributed Systems
1 page
Synchronization
No ratings yet
Synchronization
3 pages
CIRRUS Photo - DICOM Conformance Statement
No ratings yet
CIRRUS Photo - DICOM Conformance Statement
93 pages
Xii Ip Study Material
No ratings yet
Xii Ip Study Material
92 pages
Online Graduation Clearance
No ratings yet
Online Graduation Clearance
39 pages
Flood Prediction Analysis
No ratings yet
Flood Prediction Analysis
42 pages
Lecture 04
No ratings yet
Lecture 04
23 pages
Lecture 3.10 - Deadlock and Multiple Granularity
No ratings yet
Lecture 3.10 - Deadlock and Multiple Granularity
23 pages
Italy - Periodic VAT Return: SAP Library Documentation
No ratings yet
Italy - Periodic VAT Return: SAP Library Documentation
11 pages
Ach 1039171363
No ratings yet
Ach 1039171363
25 pages
Lecture 4
No ratings yet
Lecture 4
19 pages
Salesforce Administrator Interview Questions and Answers
No ratings yet
Salesforce Administrator Interview Questions and Answers
38 pages
04.M.E. Remotesensing and Geomatics
No ratings yet
04.M.E. Remotesensing and Geomatics
57 pages
Data Warehousing
No ratings yet
Data Warehousing
61 pages
MongoDB Datatypes
No ratings yet
MongoDB Datatypes
14 pages
Practical File
No ratings yet
Practical File
32 pages
Syntax Tree
No ratings yet
Syntax Tree
24 pages
Climate Change Pakistan CSS Lecture
No ratings yet
Climate Change Pakistan CSS Lecture
10 pages
Hult Prize - Pitch Layout
No ratings yet
Hult Prize - Pitch Layout
9 pages
Dfa and Nfa
No ratings yet
Dfa and Nfa
50 pages
Gridding PDF
No ratings yet
Gridding PDF
91 pages
Entrepreneurship MidTerm Notes
No ratings yet
Entrepreneurship MidTerm Notes
3 pages
Query Optimization
No ratings yet
Query Optimization
9 pages
Genero Studio
No ratings yet
Genero Studio
18 pages
Groups of HCI BS-CS 8th (Morning)
No ratings yet
Groups of HCI BS-CS 8th (Morning)
2 pages
Jurnal Kesehatan: Perancangan Sistem Informasi Medical Check Up Guna Mempercepat Pelayanan MCU Di RSUD Brebes
No ratings yet
Jurnal Kesehatan: Perancangan Sistem Informasi Medical Check Up Guna Mempercepat Pelayanan MCU Di RSUD Brebes
16 pages
Answer Updated
No ratings yet
Answer Updated
35 pages
Chapter 5 Ensuring People Centered Clean and Efficient Governance
No ratings yet
Chapter 5 Ensuring People Centered Clean and Efficient Governance
15 pages
Senior Data Scientist
No ratings yet
Senior Data Scientist
3 pages
Mysql 2. Oracle 3. Microsoft SQL Server
No ratings yet
Mysql 2. Oracle 3. Microsoft SQL Server
11 pages
20761C TrainerPrepGuide PDF
No ratings yet
20761C TrainerPrepGuide PDF
7 pages
ERP Strategy MGMT
No ratings yet
ERP Strategy MGMT
5 pages
Process Control Narratives
No ratings yet
Process Control Narratives
7 pages
Airlift - DevOps Cloud - Case Study
No ratings yet
Airlift - DevOps Cloud - Case Study
4 pages
Access Full Complete Solution Manual Here: Chapter 1: Databases and Database Users
50% (2)
Access Full Complete Solution Manual Here: Chapter 1: Databases and Database Users
10 pages
Billing Management Console
No ratings yet
Billing Management Console
5 pages
Having Total 4+ Years of IT Experience and On Business Intelligence Tool-Tableau Desktop
No ratings yet
Having Total 4+ Years of IT Experience and On Business Intelligence Tool-Tableau Desktop
4 pages
Lab 1 Introduction To MS Access: Fig. 1 Database Window
No ratings yet
Lab 1 Introduction To MS Access: Fig. 1 Database Window
6 pages
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture 8

Uploaded by

Lecture 8

Uploaded by

Distributed Data Replication

Prepared By Miangul Shafiq Ahmad Jan

• Introduction to Distributed Data Replication

• Pros: Enables high availability and distributed workloads.

4. Primary-Backup Replication (Master-Slave):

• Pros: Simple conflict management, as only one replica accepts writes.

▪ Consistency: Every read receives the most recent write.

▪ Availability: Every request receives a response.

▪ Partition Tolerance: The system continues to operate even if there’s a

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.