Consistency and Replication: CS403/534 Distributed Systems Erkay Savas Sabanci University

Consistency and Replication
Part I
CS403/534
Distributed Systems
Erkay Savas
Sabanci University
1
Consistency and Replication
• Reasons
1. Enhance reliability
2. Improve performance
• Major problems:
– Keep the replicated data consistent
– More bandwidth between the replicas to communicate
updates
• Overview
– Consistency models: data-centric, client-centric
– Implementation of consistency models
2
Replication as a Scaling Technique
• Main problem: to keep replicas consistent.
– Updates must be propagated to every replica
– All conflicting (update) operations to be performed in
the same order everywhere
• Issues involved:
– Negative impact on bandwidth requirements
– Enforcing an ordered relationship between events
require synchronization: physical or logical clocks
– How seriously we should take the strict (tight)
consistency approach.
– To what extent consistency loosened highly depended
• on the purpose for which those data are used
• on the access & update patterns of the replicated
data 3
Data-Centric Consistency Models
• Data store:
– A term used to refer to a physically distributed
shared data
– Read and write operations are of concern
– Each process has a replica nearby
Process Process Process
Local
(nearby)
copy
Distributed data store 4

Consistency Model
• It is a contract between the processes and data
store
• If processes agree to obey certain rules, the
store promises to work correctly.
• For example, when a process wants to read a
data item, then it expects to have the value that
is the result of the last write operation on the
data item.
• Without a global clock, it is hard to decide which
write operation is the last.
5
Strong Models: Strict Consistency
• Any read on a shared data item x returns a value
corresponding to the result of the most recent write
on x.
• All writes are instantaneously visible to all processes
• It is impossible to implement it in distributed systems
since strict consistency relies on absolute global time
P1: W(x)a P1: W(x)a
P2: R(x)a P2: R(x)0 R(x)a
time
a) A strictly consistent b) A store that is not strictly

store. consistent.
6
Sequential Consistency
• When processes run concurrently on (possibly)
different machines, any valid order of read and
write operations is acceptable, as long as all
processes observe the same order of operations
• No reference to “most recent” write
• No reference to physical time
• No reference to logical time
• Somewhat a weaker consistency model
7
Sequential Consistency: Example
P1: W(x)a P1: W(x)a
P2: W(x)b P2: W(x)b
P2: R(x)b R(x)a P2: R(x)b R(x)a
P2: R(x)b R(x)a P2: R(x)a R(x)b
a) A sequentially consistent data b) A data store that is not

store. sequentially consistent.
8
Sequential Consistency: Linearizability
• Linearizability is similar to sequential
consistency; but stronger than sequential
consistency (but weaker than strict consistency).
• Operations are assumed to receive a timestamp
using a globally available clock
– one with finite precision
• If tsOP1(x) < tsOP2(x) then OP1(x) should
precede OP2(x) in the sequence
• Linearizable data is also sequentially consistent
• Processes use loosely synchronized clocks
9
Sequential Consistency : Example (1)
Process P1 Process P2 Process P3
x = 1; y = 1; z = 1;
print (y, z); print (x, z); print (x, y);
• Three concurrently executing processes: P1, P2, P3.

• Initial setting: x = y = z = 0
• Assignment write
• print read
• With six operations, there are 6! = 720 possible
execution sequences, (however, some of these violate
program order)
• Only, 90 of them preserve the program order.
10
Sequential Consistency : Example(2)
• Four valid execution sequences for the processes of the
previous slide. The vertical axis is time. The signature is
a string that is the concatenation of outputs of P1, P2,
P3 in that order.
x = 1; x = 1; y = 1; y = 1;
print (y, z); y = 1; z = 1; x = 1;
y = 1; print (x,z); print (x, y); z = 1;
print (x, z); print(y, z); print (x, z); print (x, z);
z = 1; z = 1; x = 1; print (y, z);
print (x, y); print (x, y); print (y, z); print (x, y);
Prints: 001011 Prints: 101011 Prints: 010111 Prints: 111111
Signature: Signature: Signature: Signature:

001011 101011 110101 111111
(a) (b) (c) (d)

Time 11
Sequential Consistency : Example(3)
• Not all the signatures are allowed.
• Is the Signature 001001 allowed?
• P1: 00 P1 executes before P2 and P3
• P2: 10 P2 executes after P1 but before P3
(OK)
• P3: 01 P3 executes before P1 starts
(Contradiction!)
12
Casual Consistency
• Necessary condition:
– Writes that are potentially casually related must be
seen by all processes in the same order.
– Concurrent writes (that are not causally related)
may be seen in a different order on different
machines.
• Two writes are causally related to each other
through a read operation.
• Vector timestamps are used to implement
casual consistency
13
Casual Consistency: Example (1)
P1: W(x)a W(x)c
P2: R(x)a W(x)b
P3: R(x)a R(x)c R(x)b
P4: R(x)a R(x)b R(x)c
• This sequence is allowed with a casually-consistent store,

but not with sequentially or strictly consistent store.
14
Casual Consistency: Example (2)
a) A violation of a casually-consistent store.
b) A correct sequence of events in a casually-

consistent store. 15
FIFO Consistency
• Necessary Condition:
– Writes done by a single process are seen by all
other processes in the order in which they
were issued, but writes from different
processes may be seen in a different order by
different processes.
• Weaker form of strong consistency
• All writes generated by different
processes are concurrent
16
FIFO Consistency: Example (1)
P1: W(x)a
P2: R(x)a W(x)b W(x)c
P3: R(x)b R(x)a R(x)c
P4: R(x)a R(x)b R(x)c
• A valid sequence of events of FIFO consistency
17
P1: P2: P3:
x = 1; y = 1; z = 1;
print (y, z); print (x, z); print (x, y);
x = 1; x = 1; y = 1;
print (y, z); y = 1; print (x, z);
y = 1; print(x, z); z = 1;
print(x, z); print ( y, z); print (x, y);
z = 1; z = 1; x = 1;
print (x, y); print (x, y); print (y, z);
Prints: 00 Prints: 10 Prints: 01
(a) (b) (c)
• Statement execution as seen by the three processes

from the previous slide. The statements in bold are the
ones that generate the output shown. 18
Process P1 Process P2
x = 1; y = 1;
if (y == 0) kill (P2); if (x == 0) kill (P1);
• Two concurrent processes.

• Both processes can be killed in FIFO consistency
19
Weak Consistency Model
• Basic idea:
– individual read & write operations are not
immediately made known to other processes.
– Final effect is communicated (as in transactions)
– synchronization variable (S) (a lock or barrier)
– A synchronization variable has only one
operation: synchronize(S)
– A process gains exclusive access to a critical
region through synchronization operation
– While a process is in the critical region, the
inconsistencies will happen.
– synchronize operation pushes local updates to
other replicas + brings about the remote updates
to the local replica
20
Weak Consistency: Properties
1. Access to synchronization variable is sequential
• If processes P1 and P2 call synchronize(S), the
execution order of these operations will be the same
everywhere
2. Synchronization flushes the pipeline
• It forces all writes that are in progress or partially
completed or completed at some local copies but not
all to complete everywhere.
3. When data items are accessed, either for
reading or writing, all previous synchronization
will have been completed.
• By doing synchronization before reading a shared
data, a process can be sure of getting the most
recent values. 21
Weak Consistency: Properties
• A good consistency model when isolated accesses
to shared data is rare.
• With weak consistency, sequential consistency is
enforced between groups of operations
• Synchronization variables to delimit those
groups.
• Weak consistency models tolerates a greater
degree of inconsistency for a limited amount of
time.
22
Weak Consistency: Example
P1: S W(x)a W(x)b S
P2: S R(x)b S
P3: R(x)a R(x)b S
23
Release Consistency (1)
• Drawbacks of weak consistency: When a synchronization
variable is accessed by a process, the data store does not
know if
– the process is finished writing data (exiting critical region)
– or it is just about to read data (entering critical region)
• Therefore, when an access to a synchronization variable
is initiated, the local data store does two things:
1. All locally initiated writes have been propagated to all other
copies
2. Gather in all writes from other copies
• If data store makes the distinction between entering or
exiting from critical regions, problems will be solved
24
Release Consistency: Example
• Two types of synchronization operations are used:
1. Acquire
2. Release
• Programmer is responsible to call these operations
before entering and exiting critical region
P1: Acq(L) W(x)a W(x)b Rel(L)
P2: Acq(L) R(x)b Rel(L)
P3: R(x)a
• A valid event sequence for release consistency. 25

Release Consistency (2)
• Eager release consistency: release operation

pushes out all the modified data to all other
process
– It does not matter whether the other processes
need that data
• Lazy release consistency: release operation
does send nothing
– Acquiring process must come and get them.
26
Entry Consistency
• With release consistency, all local updates are
made available to all copies during the release of
the lock
• With entry consistency, each individual shared
data item is associated with some
synchronization variable (e.g. lock or barrier)
• When acquiring the synchronization variable, the
most recent values of its associated shared data
item must be fetched
• Note: where release consistency affects all
shared data, entry consistency affects only
those associated with a synchronization variable.
27
Entry Consistency: Conditions
1. At an acquire, all remote changes to the
guarded data must be made visible
2. Before updating a shared data item, a
process must enter the critical region in
exclusive mode
3. If a process wants to enter a critical region
in nonexclusive mode, it must first check
with the owner of the synchronization
variable to fetch the most recent copies of
shared data.
28
Entry Consistency: Example
P1: Acq(Lx) W(x)a Acq(Ly) W(y)b Rel(Lx) Rel(Ly)
P2: Acq(Lx) R(x)b R(y)NIL
P3: Acq(Ly) R(y)b
• A valid event sequence for entry consistency.
• Questions: What would be a convenient way of making

entry consistency more or less transparent to
programmers with distributed objects?
29
Summary of Consistency Models
Consistency models not using synchronization
operations (Strong models)
Consistency Description
Strict Absolute time ordering of all shared accesses.
All processes must see all shared accesses in the same order.
Linearizability Accesses are furthermore ordered according to a global
timestamp
All processes see all shared accesses in the same order.
Sequential
Accesses are not ordered in time
All processes see causally-related shared accesses in the
Causal
same order.
All processes see writes from each other in the order they
FIFO were used. Writes from different processes may not always
be seen in that order
30
Summary of Consistency Models
Models with synchronization operations.
Consistency Description
Weak Shared data can be counted on to be consistent only after a
synchronization is done
Release Shared data are made consistent when a critical region is
exited
Entry Shared data pertaining to a critical region are made consistent
when a critical region is entered.
31
Client-Centric Consistency Models
• Data-centric consistency: guarantees systemwide
consistency on data store
• Client-centric consistency: provides guarantee of
consistency for accesses of a single client
– Most large-scale distributed systems apply replication
for scalability,
– Simultaneous updates are rare; and if they happen they
are easy to resolve
– DNS: Updates are done by one processes in a domain (no
write-write conflicts); propagate slowly
– WWW: Caches all over the place, but there need be no
guarantee that you are reading the most recent version
of a page
32
Eventual Consistency
• Large-scale distributed and replicated
databases can tolerate a relatively high degree
of inconsistency.
• If no updates take place for a long time, all
replicas will gradually become 100% consistent.
• This model guarantees that updates to be
propagated to all replicas eventually.
• Write-write conflicts are often relatively easy
to solve assuming only a small group of processes
can perform updates.
• Very inexpensive to implement.
33
Consistency for Mobile Client
Client moves to another location &
(transparently) connect to other replica
Replicas need to maintain

client-centric consistency
Wide-area network
Distributed & replicated database
Read & write operations 34

Notation
• Assumption: Data items have an associated
owner
• xi[t]: the version of data item x at local copy Li
at time t.
• WS(xi[t]): write operations that took place at
Li since the initialization.
• WS(xi[t1]; xj[t2]): write operations in
WS(xi[t1]) have been also performed at local
copy Lj at a later time t2.
• Time index is dropped from the notation if the
timing or ordering of operations are obvious from
the context.
35
Models: Monotonic Reads
• If a process reads the value of a data item x, any
successive read operation on x by that process
will always return that same or a more recent
value
• This guarantees that if a process has seen a
value x at time t, it will never see an older
version of x afterwards.
• Example: Distributed e-mail database
– Updates are propagated in a lazy fashion
– Reading (not modifying) incoming e-mails while you are
on the move.
– Each time you connect to a different e-mail server;
that server fetches (at least) all the updates from the
server you previously connected. 36
Monotonic Reads: Example
The read operations performed by a single process P
at two different local copies of the same data store.
L1: WS(x1) R(x1)
L2: WS(x1 ; x2) R(x2)
a) A monotonic-read consistent data store

L1: WS(x1) R(x1)
L2: WS(x2) R(x2) WS(x1 ; x2)
b) A data store that does not provide monotonic reads.

37
Monotonic Writes
• A write operation by a process on a data item x
is completed before any successive write
operation on x by the same process
– Resembles the FIFO consistency; different in the
sense that monotonic writes model cares what the
single process, which is performing writes, sees.
– A write operation on a copy of data item x is
performed only if that copy has been brought up to
date by means of any preceding write operation that
may have taken place on other copies of x by the same
process.
– Example: Updating a C library at server Si and
ensuring that all previous updates on all components of
the library on which compilation and linking depends,
are also propagated to Si.
38
Monotonic Writes: Example
The write operations performed by a single process P at
two different local copies of the same data store
L1: W(x1)
L2: W(x1) W(x2)
a) A monotonic-write consistent data store.
L1: W(x1)
L2: W(x2)
b) A data store that does not provide monotonic-write consistency.

39
Read Your Writes
• The effect of a write operation by a process on

data item x, will always be seen by a successive
read operation on x by the same process
• A write operation is always completed before a
successive read operation by the same process,
no matter where this read operation takes place.
• Example: Updating your Web page and
guaranteeing that your Web browser shows the
newest version instead of its cached copy.
• Example: updating passwords on password server
40
Read Your Writes: Example
L1: WS(x1)
L2: WS(x1 ; x2) R(x2)
a) A data store that provides read-your-writes consistency.
L1: WS(x1)
L2: WS(x2) R(x2)
b) A data store that does not.

41
Client-Centric Consistency:
Implementation (1)
• Straightforward implementation
– Each write operation is assigned a globally unique id
– For each client, we keep track of two sets of write ids:
read set and write set
– Read set consists of write ids of write operations that is
relevant for the read operations by the client
– Write set consists of write identifiers for write
operations performed by the client
– Example: Monotonic read consistency
– A client wants to perform a read operation at a server
– The client gives its read set to the server
– Server checks if all the identified write operations are
performed locally.
– If not it contacts other servers 42
Client-Centric Consistency:
Implementation (2)
• Example: Monotonic read consistency (cont)
– Contacted server must have logged all the write
operations so that it can be replayed at another server
– Write operations must be performed in the order they
were initiated.
– Lamport timestamps can be included in the write id.
• Drawback: Costly
– Read and write sets may get too large
• Better Approach
– Use vector timestamps
43
Monotonic Reads with Vector
Timestamps
Read_Set = (0, 0, 0)
Read_Set = (2, 1, 1)
Read_Set = (2, 3, 1)
1. (0, 0, 0)
5. (2, 3, 1)
3. (2, 1, 1)
2. (2, 1, 1)
4. Sync
server 1 server 2 server 3
RCVD(1) = (2, 1, 1) RCVD(2) = (0, 3, 0) RCVD(3) = (2, 0, 2)

RCVD(1) = (2, 3, 1) RCVD(1) = (2, 3, 1)
44

Consistency and Replication: CS403/534 Distributed Systems Erkay Savas Sabanci University

Uploaded by

Copyright:

Available Formats

Consistency and Replication: CS403/534 Distributed Systems Erkay Savas Sabanci University

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Consistency and Replication: CS403/534 Distributed Systems Erkay Savas Sabanci University

Uploaded by

Copyright:

Available Formats

Consistency and Replication

Process Process Process

Distributed data store 4

P1: W(x)a P1: W(x)a

P2: R(x)a P2: R(x)0 R(x)a

a) A strictly consistent b) A store that is not strictly

P1: W(x)a P1: W(x)a

P2: W(x)b P2: W(x)b

P2: R(x)b R(x)a P2: R(x)b R(x)a

P2: R(x)b R(x)a P2: R(x)a R(x)b

a) A sequentially consistent data b) A data store that is not

• Three concurrently executing processes: P1, P2, P3.

Prints: 001011 Prints: 101011 Prints: 010111 Prints: 111111

Signature: Signature: Signature: Signature:

(a) (b) (c) (d)

P1: W(x)a W(x)c

P2: R(x)a W(x)b

P3: R(x)a R(x)c R(x)b

P4: R(x)a R(x)b R(x)c

• This sequence is allowed with a casually-consistent store,

a) A violation of a casually-consistent store.

b) A correct sequence of events in a casually-

P2: R(x)a W(x)b W(x)c

P3: R(x)b R(x)a R(x)c

P4: R(x)a R(x)b R(x)c

• A valid sequence of events of FIFO consistency

Prints: 00 Prints: 10 Prints: 01

(a) (b) (c)

• Statement execution as seen by the three processes

• Two concurrent processes.

P1: S W(x)a W(x)b S

P3: R(x)a R(x)b S

P1: Acq(L) W(x)a W(x)b Rel(L)

P2: Acq(L) R(x)b Rel(L)

• A valid event sequence for release consistency. 25

• Eager release consistency: release operation

P1: Acq(Lx) W(x)a Acq(Ly) W(y)b Rel(Lx) Rel(Ly)

P2: Acq(Lx) R(x)b R(y)NIL

P3: Acq(Ly) R(y)b

• A valid event sequence for entry consistency.

• Questions: What would be a convenient way of making

Replicas need to maintain

Distributed & replicated database

Read & write operations 34

L1: WS(x1) R(x1)

L2: WS(x1 ; x2) R(x2)

a) A monotonic-read consistent data store

L2: WS(x2) R(x2) WS(x1 ; x2)

b) A data store that does not provide monotonic reads.

L2: W(x1) W(x2)

a) A monotonic-write consistent data store.

b) A data store that does not provide monotonic-write consistency.

• The effect of a write operation by a process on

L2: WS(x1 ; x2) R(x2)

a) A data store that provides read-your-writes consistency.

L2: WS(x2) R(x2)

b) A data store that does not.

RCVD(1) = (2, 1, 1) RCVD(2) = (0, 3, 0) RCVD(3) = (2, 0, 2)

You might also like