0% found this document useful (0 votes)

19 views30 pages

cs516 Unit III

Uploaded by

321106410027

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views30 pages

cs516 Unit III

Uploaded by

321106410027

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

CS516 Distributed Operating Systems Unit III

UNIT III SYNCHRONIZATION AND PROCESSES

Synchronization: Clock synchronization - Mutual exclusion - Election algorithms, - Atomic
transactions - Deadlocks; Processes: Threads - System models - processor allocation -
Scheduling - Fault tolerance - Real time distributed systems.

CLOCK SYNCHRONIZATION

Contents
o Need for clock synchronization
o Lamport’s algorithm (logical clock)
o Physical clock synchronization algorithms
o Christian’s algorithm (Passive)
o Berkely Algorithm (Active)
Need for clock synchronization
 When each machine has its own clock, an event that occurred after another event may
nevertheless be assigned an earlier time
 No common clock or other precise global time source exists in a distributed system
 This could lead to unexpected behavior and various system failures
o Example: Building output using make tool, which relies on timestamp of source files
and object files

Logical clocks (Lamport)

 Internal consistency of the clocks matters, not their closeness with real time
 For many DS algorithms, associating an event to an absolute real time is not essential, we
only need to know an unambiguous order of events
Physical clocks (Christian - Passive, Berkley - Active)
The clocks must be same and must not deviate from real time by certain amount

Lamport's Algorithm
 Synchronizes logical clocks
 happens-before relation
o a -> b read as a happens before b, means all processes agree that event a
happens before event b
o transitive relation : a -> b and b -> c, then a -> c
 Each message carries the sending time as per its clock
 If the receivers clock shows a value prior to the sent time, then it fast forwards its clock to
one more than sending time

MTech CSE (PT, 2011-14) SRM, Ramapuram 1 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

 Between every two events, the clock must tick at least once
o Example: 3 process : each with own clock of 6, 8, 10 ticks
o 56(1) changed to 61(1), 69(1), 77(1), 85(1)
o 54(0) changed to 70(0), 76(0)

Physical Clock Synchronization Algorithms

 WWV -> Call sign of radio broadcast from National Institute of Standard Time (NIST)
 Each machine is assumed to have a timer, which causes an interrupt H times a second
 When the timer goes off, the interrupt handler adds one to a Software clock
 UTC Time is t
 Ideally dC / dt should be 1 for perfect clock

Christian’s Algorithm
 One machine as a WWV receiver (time server) and the goal is to have all the other
machines keep synchronized with it
 Periodically each machine sends a message to the time server asking it for the current
time
 The machine responds as fast as it can with a message containing its current time
 The time server is passive

MTech CSE (PT, 2011-14) SRM, Ramapuram 2 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

Berkeley Algorithm
 The time server is active, polling every machine periodically and asking its time
 Based on the answer, it computes an average time and tell all the other machines to
advanced their clock to the new time or slow the clocks down until some specified
reduction has been achieved

MTech CSE (PT, 2011-14) SRM, Ramapuram 3 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

DISTRIBUTED MUTUAL EXCLUSION AND ALGORITHMS

 A state in which one process enters into a critical region and read or update certain shared
data structures
 It ensures that no other process will use the shared data structures at the same time
Centralized Algorithm
 One process is elected as the coordinator (Ex: the one running on the machine with the
highest network address)
 Whenever a process wants to enter a critical region, it sends a request message to the
coordinator stating which critical section it wants to enter and asking for permission
 If no other process is currently in that critical region, the coordinator sends back a reply -
granting permission
 When the reply arrives, the requesting process enters the critical region
 If some other process is already in critical region, the coordinator cannot grant permission
 Actual method to deny permission could be system dependent like No reply, or
"Permission denied" message etc.
 When the process exits the critical section, it sends a message to coordinator releasing its
exclusive access

Distributed Algorithm

 When a process wants to enter a critical section, it builds a message containing the name
of the critical region, its process number and the current time
 It sends the message to all other processes, including itself
 If the receiver is not in the critical region and does not want to enter it, it sends back an
OK message to the sender

MTech CSE (PT, 2011-14) SRM, Ramapuram 4 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

 If the receiver is already in the critical region, it does not reply. It queues the request.
 If the receiver wants to enter the critical region, it compares the timestamp of the
incoming message with the one that it has sent. The lowest one wins.
 If the incoming message is lower, the receiver sends back and OK message
 If its own message has a lower timestamp, the receiver queues the incoming request and
sends nothing
 After sending out requests, the process sits back and waits until everyone else has given
permission
 After it exits the critical region, it sends OK message to all processes on its queue and
deletes them all
Token Ring Algorithm

 When the ring is initialized, process 0 is given a token.

 The token circulates around the ring.
 It is passed from process k to process k + 1 (modulo the ring size) in P2P messages.
 When a process acquires the token from its neighbor, it checks to see if it is attempting to
enter a critical region.
 If so, the process enters the region, does all the work it needs to, and leaves the region.
 After it has exited, it passes the token along the ring.
 It is not permitted to enter a second critical region using the same token.
 If a process is handed the token by its neighbor and is not interested in entering a critical
region, it just passes it along.
 As a consequence, when no processes want to enter any critical regions, the token just
circulates at high speed around the ring.
Comparison of three algorithms

Messages per Delay before entry

Algorithm Problems
entry / exit (in message times)

Centralized 3 2 Coordinator crash

Distributed 2 (n – 1) 2 (n – 1) Crash of any process

Token ring 1 to ~ 0 to n - 1 Lost taken, process crash

MTech CSE (PT, 2011-14) SRM, Ramapuram 5 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

ELECTION ALGORITHMS

The Bully Algorithm

 When a process notices that the coordinator is no longer responding to requests, it
initiates an election as follows
 P sends an ELECTION message to all processes with higher numbers
 If no one responds, P wins the election and becomes coordinator
 If one of the higher-ups answers, it takes over as new coordinator

A Ring Algorithm
 When any process notices that the
coordinator is not functioning, it builds an
ELECTION message containing its own
process number and sends the message
to its successor.
 If the successor is down, the sender skips
over the successor and goes to the next
member along the ring. or the one after
that, until a running process is located.
 At each step, the sender adds its own
process number to the list in the
message.
 Eventually, the message gets back to the
process that started it all.
 That process recognizes this event when
it receives an incoming message containing its own process number.
 At that point, the message type is changed to COORDINATOR and circulated once again, this
time to inform everyone else who the coordinator is (the list member with the highest
number) and who the members of the new ring are.
 When this message has circulated once, it is removed and everyone goes back to work.

MTech CSE (PT, 2011-14) SRM, Ramapuram 6 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

ATOMIC TRANSACTIONS
 The Transaction Model
o Stable Storage
o Transaction Primitives
o ACID Properties
o Nested Transaction
 Implementation
o Private Workspace
o Writeahead Log
o Two-phase Commit Protocol
 Concurrency Control
o Locking
o Two-phase locking
o Optimistic Concurrency Control
o Timestamps

 Higher-level abstraction, that hides technical issues (like mutual exclusion, critical region
management, deadlock prevention, and crash recovery) and allows the programmer to
concentrate on the algorithms and how the processes work together in parallel.
 Example banking application:
o Withdraw( amount, account 1).
o Deposit(amount, account2).
 If the telephone connection is broken after the first one but before the second one, the first
account will have been debited but the second one will not have been credited.
 The money vanishes into thin air
 The key is rolling back to the initial state if the transaction fails to complete

The Transaction Model

Stable Storage

Storage comes in three categories

 RAM memory,

MTech CSE (PT, 2011-14) SRM, Ramapuram 7 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

o wiped out when the power fails or a machine crashes. Next we have
 Disk storage
o survives CPU failures but which can be lost in disk head crashes.
 Stable storage
o designed to survive anything except major calamities such as floods and earthquakes.
o Stable storage can be implemented with a pair of ordinary disks

Transaction Primitives
1. BEGIN_ TRANSACTION: Mark the start of a transaction.
2. END_ TRANSACTION: Terminate the transaction and try to commit.
3. ABORT TRANSACTION: Kill the transaction; restore the old values.
4. READ: Read data from a file (or other object).
5. WRITE: Write data to a file (or other object).

ACID Properties of Transactions

1. Atomic: To the outside world, the transaction happens indivisibly.
2. Consistent: The transaction does not violate system invariants.
3. Isolated: Concurrent transactions do not interfere with each other.
4. Durable: Once a transaction commits, the changes are permanent.

Nested Transactions
 Transactions may contain sub transactions, often called nested transactions.
 The top-level transaction may fork off children that run in parallel with one another, on
different processors, to gain performance or simplify programming

Implementation
Private Workspace
 when a process starts a transaction, it is given a private workspace containing all the files
(and other objects) to which it has access.
 Until the transaction either commits or aborts, all of its reads and writes go to the private
workspace, rather than the "real" one, by which we mean the normal file system

MTech CSE (PT, 2011-14) SRM, Ramapuram 8 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

Writeahead Log
 sometimes called an intentions list
 files are actually modified in place, but before any block is changed, a record is written to the
 writeahead log on stable storage telling
o which transaction is making the change,
o which file and block is being changed
o what the old and new values are.
 Only after the log has been written successfully is the change made to the file.
 If the transaction succeeds and is committed, a commit record is written to the log
 If the transaction aborts, the log can be used to back up to the original state (rollback)

Two-Phase Commit Protocol

 One of the processes involved functions as the coordinator
 Usually, this is the one executing the transaction

Concurrency Control
When multiple transactions are executing simultaneously in different processes (on different
processors), some mechanism is needed to keep them out of each other's way.

That mechanism is called a concurrency control algorithm.

Locking
 when a process needs to read or write a file (or other object) as part of a transaction, it first
locks the file.
 the lock manager maintains a list of locked files, and rejects all attempts to lock files that are
already locked by another process
 The issue of how large an item to lock is called the granularity of locking.

MTech CSE (PT, 2011-14) SRM, Ramapuram 9 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

 The finer the granularity, the more precise the lock can be, and the more parallelism can be
achieved

Two-phase locking

 strict two-phase locking

o the shrinking phase does not take place until the transaction has finished running and
has either committed or aborted
 cascaded aborts
o having to undo a committed transaction because it saw a file it should not have seen.

Optimistic Concurrency Control

 just go ahead and do whatever you want to, without paying attention to what anybody else is
doing.
 If there is a problem, worry about it later.
 keep track of which files have been read and written.
 At the point of committing, it checks all other transactions to see if any of its files have been
changed since the transaction started.
o If so, the transaction is aborted.
o If not, it is committed.

Timestamps
 assign each transaction a timestamp at the moment it does BEGIN_TRANSACTION
 Every file in the system has a read timestamp and a write timestamp associated with it

MTech CSE (PT, 2011-14) SRM, Ramapuram 10 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

DISTRIBUTED DEADLOCK DETECTION

Centralized Deadlock Detection

 Each machine maintains the graph for its own processes and resources
 It has to be sent to the coordinator explicitly in any of the following ways
1. Whenever an arc is added or deleted from the resource graph, a message can be sent
to the coordinator providing the update.
2. Periodically every process can send a list of arcs added or deleted since the previous
update
3. The coordinator can ask for information when it needs it
 none of these methods work well

 Consider a system with processes A and B running on machine 0, and process C running on
machine 1.
 Three resources exist: R, S, and T.
 A holds S but wants R, which it cannot have because B is using it
 C has T and wants S
 As soon as B finishes, A can get R and finish, releasing S for C
 This configuration is safe.

 After a while, B releases R and asks for T, a perfectly legal and safe swap.
 Machine 0 sends a message to the coordinator announcing the release of R
 Machine 1 sends a message to the coordinator announcing the fact that B is now waiting for
its resource, T.
 Assume that the message from machine 1 arrives first, leading the coordinator to incorrectly
concludes that a deadlock exists and kills some process
 Such a situation is called a false deadlock

 Use Lamport's algorithm to provide global time

 When the coordinator gets the message from machine 1 that leads it to suspect deadlock, it
could send a message to every machine in the system
 When every machine has replied, positively or negatively, the coordinator will see that the arc
from R to B has vanished, so the system is still safe.
 Although this method eliminates the false deadlock, it requires global time and is expensive.

MTech CSE (PT, 2011-14) SRM, Ramapuram 11 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

Distributed Deadlock Detection

 Chandy-Misra-Haas algorithm
 Processes are allowed to request multiple resources (e.g, locks) at once, instead of one at a
time.
 By allowing multiple requests simultaneously, the growing phase of a transaction can be
speeded up considerably.
 The consequence of this change to the model is that a process may now wait on two or more
resources simultaneously.
 The Chandy-Misra-Haas algorithm is invoked when a process has to wait for some resource,
for example, process 0 blocking on process 1.
 At that point a special probe message is generated and sent to the process (or processes)
holding the needed resources.
 The message consists of three numbers:
1. the process that just blocked,
2. the process sending the message, and
3. the process to whom it is being sent.

 The initial message from 0 to 1 contains the triple (0, 0, 1 ).

 When the message arrives, the recipient checks to see if it itself is waiting for any processes.
 If so, the message is updated, keeping the first field but replacing the second field by its own
process number and the third one by the number of the process it is waiting for.
 The message is then sent to the process on which it is blocked
 If it is blocked on multiple processes, all of them are sent (different) messages
 If a message goes all the way around and comes back to the original sender, that is, the
process listed in the first field, a cycle exists and the system is deadlocked
Ways to break the deadlock
 One way is to have the process that initiated the probe commit suicide.
o This method has problems if several processes invoke the algorithm simultaneously -
overkill.
 Alternative algorithm is to have each process add its identity to the end of the probe message
so that when it returned to the initial sender, the complete cycle would be listed
 If multiple processes discover the same cycle at the same time, they will all choose the same
victim

MTech CSE (PT, 2011-14) SRM, Ramapuram 12 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

DISTRIBUTED DEADLOCK PREVENTION

Wait-die algorithm
 When one process is about to block waiting for a resource that another process is using, a
check is made to see which has a larger timestamp (i.e., is younger).
 We can then allow the wait only if the waiting process has a lower timestamp (is older) than
the process waited for.
 In this manner, following any chain of waiting processes, the timestamps always increase, so
cycles are impossible.

Wound-wait algorithm
 one transaction is supposedly wounded (it is actually killed) and the other waits.
 If an old process wants a resource held by a young one, the old process preempts the young
one, whose transaction is then killed
 The young one probably starts up again immediately, and tries to acquire the resource,
forcing it to wait.
 Compared with Wait-die algorithm,
o if the young one wants a resource held by the old one, the young one is killed.
o It will undoubtedly start up again and be killed again.
o This cycle may go on many times before the old one releases the resource.
 Wound-wait does not have this property

MTech CSE (PT, 2011-14) SRM, Ramapuram 13 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

THREADS

Per process items

Per thread items
 Address space
 Program counter
 Global variables
 Stack
 Open files
 Register set
 Child processes
 Child threads
 Timers
 State
 Signals
 Semaphores
 Accounting information

Thread Usage

 to allow parallelism to be combined with sequential execution and blocking system calls.

Three organizations of threads in a process are,

a) Dispatcher/worker model.
b) Team model.
c) Pipeline model.

MTech CSE (PT, 2011-14) SRM, Ramapuram 14 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

Three ways to construct a server

Model Characteristics

Threads Parallelism, blocking system calls

Single Thread Process No parallelism, blocking system calls

Finite-state machine Parallelism, nonblocking system calls

Design Issues for Threads Packages

A set of primitives (e.g., library calls) available to the user relating to threads is called a threads
package

Thread management
 Two alternatives are possible here,
 static threads
o the choice of how many threads there will be is made when the program is written or
when it is compiled.
o Each thread is allocated a fixed stack.
o This approach is simple, but inflexible
 dynamic threads
o allow threads to be created and destroyed on-the-fly during execution
o The thread creation call usually specifies the thread's main program (as a pointer to a
procedure) and a stack size, and may specify other parameters as well, for example, a
scheduling priority.
o The call usually returns a thread identifier to be used in subsequent calls involving the
thread
o In this model, a process starts out with one (implicit) thread, but can create one or
more threads as needed, and these can exit when finished
 Threads can be terminated in one of two ways.
o A thread can exit voluntarily when it finishes its job,
o it can be killed from outside

Shared Data
 data that are shared among multiple threads, such as the buffers in a producer-consumer
system.
 Access to shared data is usually programmed using critical regions, to prevent multiple
threads from trying to access the same data at the same time.
 Critical regions are most easily implemented using semaphores, monitors, and similar
constructions

Mutex
 one of two states, unlocked or locked
 Operations: LOCK, UNLOCK, TRYLOCK
 mutexes are used for short-term locking, mostly for guarding the entry to critical regions

Condition variable
 used for long-term waiting until a resource becomes available
lock mutex; lock mutex;

MTech CSE (PT, 2011-14) SRM, Ramapuram 15 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

check data structures; mark resource as free;

while(resource busy) unlock mutex;
wait (condition variable); wakeup (condition variable),
mark resource as busy;
unlock mutex;

Global Variable
 variables that are global to a thread but not global to the entire program do cause trouble
 Solutions
o prohibit global variables altogether
o assign each thread its own private global variables

Scheduling
 Threads can be scheduled using various scheduling algorithms, including priority round robin,
and others.
 Threads packages often provide calls to give the user the ability to specify the scheduling
algorithm and set the priorities, if any

Implementing a Threads Package

Implementing Threads in User Space
 put the threads package entirely in user space.
 The kernel knows nothing about them.
 As far as the kernel is concerned, it is managing ordinary, single-threaded processes
 The threads run on top of a runtime system, which is a collection of procedures that manage
threads
 entire thread switch can be done in a handful of instructions.
 faster than trapping to the kernel
 They allow each process to have its own customized scheduling algorithm
 Letting the thread actually make the system call is unacceptable, since this will stop all the
threads
 The code placed around the system call to do the checking whether it will block is called a
jacket
 spin lock or busy waiting

MTech CSE (PT, 2011-14) SRM, Ramapuram 16 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

Implementing Threads in the Kernel

when a thread wants to create a new thread or destroy an existing thread, it makes a kernel call,
which then does the creation or destruction

Threads and RPC

 When a server thread, S, starts up, it exports its interface by telling the kernel about it.
 The interface defines which procedures are callable, what their parameters are, and so on.
 When a client thread C starts up, it imports the interface from the kernel and is given a
special identifier to use for the call.
 The kernel now knows that C is going to call S later, and creates special data structures to
prepare for the call.

MTech CSE (PT, 2011-14) SRM, Ramapuram 17 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

SYSTEM MODELS
1. Workstation model
2. Processor pool model
3. Hybrid form
Workstation Model
The system consists of workstations (high-end personal computers) scattered throughout a
building or campus and connected by a high-speed LAN.
1. Diskless workstations
a. Do not have local disks
b. File system must be implemented by one or more remote file servers
2. Diskful workstations / Disky workstations
a. Have local disks
When the workstations have private disks, these disks can be used in one ofat least four ways
1. Paging and temporary files
2. Paging temporary and system binaries
3. Paging temporary files, system binaries and file caching
4. Complete local file system

Disk usage on workstations

Disk usage Advantages Disadvantages

Low cost, easy hardware and software Heavy network usage; file servers
(Diskless)
maintenance, symmetry and flexibility may become bottlenecks

Paging, scratch Reduces network load over diskless Higher cost due to large number of
files case disks needed

Paging, scratch Higher cost; additional complexity of

Reduces network load even more
files, binaries updating the binaries

Paging, scratch
Still lower network load; reduces load Higher cost; cache consistency
files, binaries,
on file servers as well problems
file caching

Full local file Hardly any network load; eliminates

Loss of transparency
system need for file servers

Using Idle Workstations

The workstation can be said to be idle, if
 no one has touched the keyboard or mouse for several minutes
 no user-initiated processes are running
The algorithms used to locate idle workstations can be divided into two categories:
1. server driven
2. client driven

MTech CSE (PT, 2011-14) SRM, Ramapuram 18 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

1. Machine registers when it goes idle

2. Request idle workstation, get reply
3. Claim machine
4. Deregister
5. Setup environment
6. Start process
7. Process runs
8. process exits
9. Notify originator

The Processor Pool Model

 Construct a processor pool, a rack full of CPUs in the machine room, which can be
dynamically allocated to users on demand
 Converting all the computing power into "idle workstations" that can be accessed
dynamically. Users can be assigned as many CPUs as they need for short periods

A Hybrid Model
Provide each user with a personal workstation and to have a processor pool in addition

MTech CSE (PT, 2011-14) SRM, Ramapuram 19 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

PROCESSOR ALLOCATION

Design considerations
 Migratory or Nonmigratory nature of process
 CPU Utilization
o Maximize this number
o Make sure that every CPU has something to do
 Response time
o Minimize mean response time
 Response ratio
o Amount of time it takes to run a process on some machine, divided by how long it
would take on some unloaded benchmark processor.

Design Issues for Processor Allocation Algorithm

 Deterministic versus heuristic algorithms

o Appropriate when everything about process behavior is known in advance
o Systems where the load is completely unpredictable uses ad hoc techniques called
heuristics
 Centralized versus distributed algorithms

MTech CSE (PT, 2011-14) SRM, Ramapuram 20 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

o Collecting all the information in one place allows a better decision to be made, but is
less robust and can put a heavy load on the central machine
 Optical versus suboptimal algorithms
 Local versus global algorithms
o transfer policy
 Sender-initiated versus receiver-initiated algorithms
o location policy

Implementation Issues for Processor Allocation Algorithms

 Measuring the CPU utilization
 how overhead is dealt with
 complexity
 stability

Processor Allocation Algorithms

1. A Graph-Theoretic Deterministic Algorithm
2. A Centralized Algorithm
3. A Hierarchical Algorithm
4. A Sender-Initiated Distributed Heuristic Algorithm
5. A Receiver-Initiated Distributed Heuristic Algorithm
6. A Bidding Algorithm

A Graph-Theoretic Deterministic Algorithm

 The system can be represented as a weighted graph, with
 each node being a process
 each arc representing the flow of messages between two processes.
 The total network traffic is the sum of the arcs intersected by the dotted cut lines
 The goal is then to find the partitioning that minimizes the network traffic

A Centralized Algorithm
 heuristic algorithm that does not require any advance information
 called up-down
 a coordinator maintains a usage table with one entry per personal workstation
 concerned with giving each workstation owner a fair share of the computing power

MTech CSE (PT, 2011-14) SRM, Ramapuram 21 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

A Hierarchical Algorithm
 organize them in a logical hierarchy independent of the physical structure of the network
 Some of the machines are workers and others are managers
 For each group of k workers, one manager machine (the "department head") is assigned the
task of keeping track of who is busy and who is idle
 If the manager receiving the request thinks that it has too few processors available, it passes
the request upward in the tree to its boss

A Sender-Initiated Distributed Heuristic Algorithm

 when a process is created, the machine on which it originates sends probe messages to a
randomly-chosen machine, asking if its load is below some threshold value.
 If so, the process is sent there.
 If not, another machine is chosen for probing.
 Probing does not go on forever.
 If no suitable host is found within N probes, the algorithm terminates and the process runs on
the originating machine

MTech CSE (PT, 2011-14) SRM, Ramapuram 22 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

A Receiver-Initiated Distributed Heuristic Algorithm

 initiated by an underloaded receiver
 whenever a process finishes, the system checks to see if it has enough work.
 If not, it picks some machine at random and asks it for work.
 If that machine has nothing to offer, a second, and then a third machine is asked.
 If no work is found with N probes, the receiver temporarily stops asking, does any work it has
queued up, and tries again when the next process finishes.
 If no work is available, the machine goes idle.
 After some fixed time interval, it begins probing again

A Bidding Algorithm
 The processes, which must buy CPU time to get their work done, and processors, which
auction their cycles off to the highest bidder.
 Each processor advertises its approximate price by putting it in a publicly readable file
 Different processors may have different prices, depending on their
o speed,
o memory size,
o presence of floating-point hardware
o other features
o indication of the service provided like expected response time
 When a process wants to start up a child process, it goes around and checks out who is
currently offering the service that it needs.
 It then determines the set of processors whose services it can afford.
 From this set, it computes the best candidate, where "best" may mean cheapest, fastest, or
best price/performance, depending on the application.
 It then generates a bid and sends the bid to its first choice.
 The bid may be higher or lower than the advertised price.
 Processors collect all the bids sent to them, make a choice, by picking the highest one.
 The winners and losers are informed, and the winning process is executed.
 The published price of the server is then updated to reflect the new going rate.

Scheduling in a distributed system

 each processor does its own local scheduling
 when a group of related, heavily interacting processes are all running on different processors,
independent scheduling is not always the most efficient way.

(a) Two jobs running out of phase with each other. (b) Scheduling matrix for eight processors,
each with six time slots. The Xs indicated allocated slots.

MTech CSE (PT, 2011-14) SRM, Ramapuram 23 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

co-scheduling
 which takes interprocess communication patterns into account while scheduling to ensure that
all members of a group run at the same time.
 have each processor use a round-robin scheduling algorithm with all processors first running
the process in slot 0 for a fixed period, then all processors running the process in slot 1 for a
fixed period, and so on.
 A broadcast message could be used to tell each processor when to do process switching, to
keep the time slices synchronized.
 Variant breaks the matrix into rows and concatenates the rows to form one long row. With k
processors, any k consecutive slots belong to different processors

FAULT TOLERANCE
 Component Faults
 System Failures
 Synchronous versus Asynchronous Systems

Component Faults
A fault is a malfunction, possibly caused by

 a design error
 a manufacturing error
 a programming error,
 physical damage,
 deterioration in the course of time,
 harsh environmental conditions
 unexpected inputs,
 operator error,
 rodents eating part of it, and
 many other causes

Classified as

 transient
o occur once and then disappear
o A bird flying through the beam of a microwave transmitter
 intermittent
o occurs, then vanishes of its own accord, then reappears, and so on
o A loose contact on a connector
 permanent
o continues to exist until the faulty component is repaired.
o Burnt-out chips, software bugs, and disk head crashes

System Failures
Two types of processor faults can be distinguished

 Fail-silent faults
o a faulty processor just stops and does not respond to subsequent input or produce
further output, except perhaps to announce that it is no longer functioning
o also called fail-stop faults
 Byzantine faults

MTech CSE (PT, 2011-14) SRM, Ramapuram 24 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

o a faulty processor continues to run, issuing wrong answers to questions, and possibly
working together maliciously

Synchronous versus Asynchronous Systems

 Synchronous
o a system that has the property of always responding to a message within a known
finite bound if it is working
 Asynchronous Systems
o A system not having this property

Use of Redundancy
Three kinds are possible:

 information redundancy
o extra bits are added to allow recovery from garbled bits
o Ex: Hamming code
 time redundancy
o an action is performed, and then, if need be, it is performed again
 physical redundancy
o extra equipment is added to make it possible for the system as a whole to tolerate the
loss or malfunctioning of some components
o two ways to organize these extra processors:
 active replication
 primary backup

Fault Tolerance Using Active Replication

 Active replication is a well-known technique for providing fault tolerance using physical
redundancy.
 Also referred as state machine approach

Fault Tolerance Using Primary Backup

 The essential idea of the primary-backup method is that at any one instant, one server is the
primary and does all the work.
 If the primary fails, the backup takes over.
 Ideally, the cutover should take place in a clean way and be noticed only by the client
operating system, not by the application programs.

MTech CSE (PT, 2011-14) SRM, Ramapuram 25 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

Agreement in Faulty Systems

 there is a need to have processes agree on something.
 Examples are electing a coordinator, deciding whether to commit a

 in a system with m faulty processors, agreement can be achieved only if 2m + 1 correctly

functioning processors are present, for a total of 3m + 1.
 agreement is possible only if more than two-thirds of the processors are working properly.

MTech CSE (PT, 2011-14) SRM, Ramapuram 26 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

REAL-TIME DISTRIBUTED SYSTEMS

 real-time programs (and systems) interact with the external world in a way that involves time
 When a stimulus appears, the system must respond to it in a certain way and before a certain
deadline.
 If it delivers the correct answer, but after the deadline, the system is regarded as having
failed.
 When the answer is produced is as important as which answer is produced.
 types of stimuli
o periodic: occurring regularly
o aperiodic : recurrent, but not regular
o sporadic : unexpected
 types of RTDS
o Soft real-time - missing an occasional deadline is all right
o Hard real-time systems - single missed deadline is unacceptable
 Myths
o Myth 1: Real-time systems are about writing device drivers in assembly code
o Myth 2: Real-time computing is fast computing
o Myth 3: Fast computers will make real-time system obsolete

DESIGN ISSUES

Clock Synchronization
 Same as earlier

Event-Triggered versus Time-Triggered Systems

 Event-Triggered
o when a significant event in the outside world happens, it is detected by some sensor,
which then causes the attached CPU to get an interrupt.
o Event-triggered systems are thus interrupt driven
o event shower : massive interrupts
o give faster response at low load but more overhead and chance of failure at high load
 Time triggered
o a clock interrupt occurs every ∆T milliseconds
o At each clock tick (selected) sensors are sampled and (certain) actuators are driven.
o No interrupts occur other than clock ticks
o suitable in a relatively static environment in which a great deal is known about system
behavior in advance

MTech CSE (PT, 2011-14) SRM, Ramapuram 27 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

Predictability
 it should be clear at design time that the system can meet all of its deadlines,
even at peak load
 it is often known what the worst-case behavior of these processes is

Fault Tolerance
 Many real-time systems control safety-critical devices
 Active replication – sometimes used
 Primary-backup schemes are less popular because deadlines
 fault-tolerant real-time systems must be able to cope with the maximum number of faults
and the maximum load at the same time
 Some real-time systems have the property that they can be stopped cold when a serious
failure occurs. A system that can halt operation like this without danger is said to be fail-safe.
o Ex: railroad signaling system unexpectedly blacks out

Language Support
 specialized real-time languages can potentially be of great assistance
 it should be easy to express the work as a collection of short tasks (e.g., lightweight
processes or threads) that can be scheduled independently, subject to user-defined
precedence and mutual exclusion constraints
 maximum execution time of every task can be computed at compile time
 Recursion cannot be tolerated
 need a way to deal with time itself
 special variable, clock, should be available, containing the current time in ticks
 Range of a 32-bit clock before overflowing for various resolutions
o 1 ns => 4 secs; 1 us => 72 mins; 1 ms => 50 days; 1 sec => 136 years
 way to express minimum and maximum delays
 way to express what to do if an expected event does not occur within a certain interval
 useful to have a statement of the form e v e r y ( 2 5 m s e c ) { . . . } that causes the
statements within the curly brackets to be executed every 25 msec

Real-Time Communication
Time Division Multiple Access (TDMA)

The Time-Triggered Protocol (TTP)

 TTP protocol consists of a single layer that handles end-to-end data transport, clock
synchronization, and membership management
 The control field contains a bit used to initialize the system
 Periodically, a packet with the initialization bit is broadcast.

MTech CSE (PT, 2011-14) SRM, Ramapuram 28 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

 This packet also contains the current global state

 properties of TTP are
o the detection of lost packets by the receivers, not the senders,
o the automatic membership protocol,
o the CRC on the packet plus global state
o the way that clock synchronization is done

Real-Time Scheduling
Characterized by

1. Hard real time versus soft real time.

2. Preemptive versus nonpreemptive scheduling.
3. Dynamic versus static.
4. Centralized versus decentralized.

A set of tasks that meets the foregoing requirement is said to be schedulable.

Dynamic Scheduling Algorithms

 algorithms that decide during program execution which task to run next

Rate monotonic algorithm

 designed for preemptively scheduling periodic tasks with no ordering or mutual exclusion
constraints on a single processor
 In advance, each task is assigned a priority equal to its execution frequency.
 For example, a task run every 20 msec is assigned priority 50 and a task run every 100 msec
is assigned priority 10.
 At run time, the scheduler always selects the highest priority task to run, preempting the
current task if need be.

Earliest deadline first

 Whenever an event is detected, the scheduler adds it to the list of waiting tasks.
 This list is always keep sorted by deadline, closest deadline first.
 For a periodic task, the deadline is the next occurrence.
 The scheduler then just chooses the first task on the list, the one closest to its deadline

Least laxity

 algorithm first computes for each task the amount of time it has to spare, called the laxity
(slack).

MTech CSE (PT, 2011-14) SRM, Ramapuram 29 hcr:innovationcse@gg

CS516 Distributed Operating Systems Unit III

o For a task that must finish in 200 msec but has another 150 msec to run, the laxity is
50 msec
 This algorithm chooses the task with the least laxity,
o that is, the one with the least breathing room.

Static Scheduling
 Static scheduling is done before the system starts operating.
 The input consists of a list of all the tasks and the times that each must run.
 The goal is to find an assignment of tasks to processors and for each processor, a static
schedule giving the order in which the tasks are to be run.

Comments & Feedback

Thanks to my family members who supported me while I spent hours and hours to prepare this.
Your feedback is welcome at GHCRajan@gmail.com

MTech CSE (PT, 2011-14) SRM, Ramapuram 30 hcr:innovationcse@gg

Distributed Systems: Mutual Exclusion
No ratings yet
Distributed Systems: Mutual Exclusion
24 pages
Mutual Exclusion in Distributed System
100% (1)
Mutual Exclusion in Distributed System
39 pages
Synchronization in Distributed Systems
No ratings yet
Synchronization in Distributed Systems
51 pages
dnp_huyenp (8-13)
No ratings yet
dnp_huyenp (8-13)
132 pages
Mutual Exclusion in Distributed Computing
100% (1)
Mutual Exclusion in Distributed Computing
15 pages
Coordination Algorithms
No ratings yet
Coordination Algorithms
30 pages
3.Synchronization
No ratings yet
3.Synchronization
45 pages
DS Chapter 5 Synchronizations (1)
No ratings yet
DS Chapter 5 Synchronizations (1)
34 pages
7Mutual Exclusion
No ratings yet
7Mutual Exclusion
27 pages
Ds chapter 5 (2)
No ratings yet
Ds chapter 5 (2)
31 pages
Chapter 6
No ratings yet
Chapter 6
31 pages
Lecture 16,17, 18 Mutual Exclusion Algorithms
No ratings yet
Lecture 16,17, 18 Mutual Exclusion Algorithms
90 pages
05a Mutex Slides
No ratings yet
05a Mutex Slides
35 pages
Unit- 3 DOS
No ratings yet
Unit- 3 DOS
26 pages
6.2 Lamport 1 Logical
No ratings yet
6.2 Lamport 1 Logical
27 pages
Chapter 6 Synchronization
No ratings yet
Chapter 6 Synchronization
37 pages
Distributed Computing Module 3 Important Topics PYQs
No ratings yet
Distributed Computing Module 3 Important Topics PYQs
18 pages
Synchronization
No ratings yet
Synchronization
45 pages
DC Endsems
No ratings yet
DC Endsems
157 pages
Coordination and Agreement: 2. Distributed Mutual Exclusion
No ratings yet
Coordination and Agreement: 2. Distributed Mutual Exclusion
62 pages
Chapter 6-Synchronozation
No ratings yet
Chapter 6-Synchronozation
24 pages
L15-Mutual Exclusion-DOS
No ratings yet
L15-Mutual Exclusion-DOS
19 pages
DS Unit-5
No ratings yet
DS Unit-5
20 pages
chap3dc
No ratings yet
chap3dc
13 pages
Dos Notes
No ratings yet
Dos Notes
18 pages
unit1
No ratings yet
unit1
6 pages
Chapter 6 Synchronization
No ratings yet
Chapter 6 Synchronization
50 pages
CST402-SCHEME
No ratings yet
CST402-SCHEME
9 pages
DC(UNIT-3)
No ratings yet
DC(UNIT-3)
12 pages
532ebdistributed Processing
No ratings yet
532ebdistributed Processing
53 pages
Lecture 9 - Mutual Exclusion
No ratings yet
Lecture 9 - Mutual Exclusion
27 pages
dos tae2
No ratings yet
dos tae2
4 pages
Distributed Mutual Exclusion
No ratings yet
Distributed Mutual Exclusion
23 pages
u4p2
No ratings yet
u4p2
46 pages
DS IAT 2 Answer Key
No ratings yet
DS IAT 2 Answer Key
8 pages
Clock Synchronization in Centralized Systems
No ratings yet
Clock Synchronization in Centralized Systems
30 pages
Distributed Computing Unit 1 & 2
No ratings yet
Distributed Computing Unit 1 & 2
4 pages
Distributed Mutual Exclusion Upload
No ratings yet
Distributed Mutual Exclusion Upload
48 pages
Comparative Study of Syn Algo
No ratings yet
Comparative Study of Syn Algo
5 pages
2 Marks-2
No ratings yet
2 Marks-2
3 pages
Cluster Computing (Unit 1-5)
No ratings yet
Cluster Computing (Unit 1-5)
74 pages
Chapter 5-Synchronozation
No ratings yet
Chapter 5-Synchronozation
43 pages
Question No. A B C D Answer Key: Scheme R2016 Semester 8 Course Code CSC802 Course Name Distributed Computing
No ratings yet
Question No. A B C D Answer Key: Scheme R2016 Semester 8 Course Code CSC802 Course Name Distributed Computing
61 pages
CS3551 - DC - DPT - Iv - Q and A Unit - Iii Notes (2 - Topics)
No ratings yet
CS3551 - DC - DPT - Iv - Q and A Unit - Iii Notes (2 - Topics)
15 pages
Distributed Mutual Exclusion
No ratings yet
Distributed Mutual Exclusion
9 pages
Name: Charles Kumar Singh Class: CSE (A) Class Roll: 172038144 University Roll No: 12500117083 Subject: Cs704a Date and Time: 09/10/2020 & 1:40 PM
No ratings yet
Name: Charles Kumar Singh Class: CSE (A) Class Roll: 172038144 University Roll No: 12500117083 Subject: Cs704a Date and Time: 09/10/2020 & 1:40 PM
5 pages
DBMS PPT Unit-5
100% (1)
DBMS PPT Unit-5
85 pages
DC IAT
No ratings yet
DC IAT
14 pages
GB Unit3 Mutual Exclusion
No ratings yet
GB Unit3 Mutual Exclusion
58 pages
6.-Synchonization
No ratings yet
6.-Synchonization
49 pages
Last Class: Clock Synchronization: Logical Clocks Vector Clocks Global State
No ratings yet
Last Class: Clock Synchronization: Logical Clocks Vector Clocks Global State
13 pages
CS3551 DC 5 Units Notes
No ratings yet
CS3551 DC 5 Units Notes
102 pages
One of the Main Characteristics of Distributed Systems is Concurrency
No ratings yet
One of the Main Characteristics of Distributed Systems is Concurrency
4 pages
Outline: - Distributed Mutual Exclusion
No ratings yet
Outline: - Distributed Mutual Exclusion
38 pages
Synchronization: CS403/534 Distributed Systems Erkay Savas Sabanci University
No ratings yet
Synchronization: CS403/534 Distributed Systems Erkay Savas Sabanci University
46 pages
Mutual Exclusion Record
No ratings yet
Mutual Exclusion Record
24 pages
QB_Module 3
No ratings yet
QB_Module 3
15 pages
QUESTION BANK - Dbms
No ratings yet
QUESTION BANK - Dbms
8 pages
Logical Time in Asynchronous Systems Email Example: A B A B
No ratings yet
Logical Time in Asynchronous Systems Email Example: A B A B
8 pages
MCS-023 (3)__merged
No ratings yet
MCS-023 (3)__merged
60 pages
Mutual Exclusion RECORD
No ratings yet
Mutual Exclusion RECORD
7 pages
DBMS Pedagogy
No ratings yet
DBMS Pedagogy
24 pages
Mca 5 Compiled Vol - II
No ratings yet
Mca 5 Compiled Vol - II
125 pages
DBMS UNIT-4 (Database Transactions & Query Processing)
No ratings yet
DBMS UNIT-4 (Database Transactions & Query Processing)
119 pages
Transaction Management and Concurrency Control and Recovery in DBMS
No ratings yet
Transaction Management and Concurrency Control and Recovery in DBMS
68 pages
Unit IV CPM, PERT and Crashing
100% (1)
Unit IV CPM, PERT and Crashing
12 pages
Unit-3 PPT
No ratings yet
Unit-3 PPT
40 pages
CH-3 Concurrency Control New
No ratings yet
CH-3 Concurrency Control New
71 pages
ln13 Ds
No ratings yet
ln13 Ds
21 pages
8.concurrency Control
No ratings yet
8.concurrency Control
66 pages
cs516 Unit II
No ratings yet
cs516 Unit II
24 pages
Ai Chapter 2
No ratings yet
Ai Chapter 2
27 pages
Chapter 15 Transaction MGMT Edited#1
No ratings yet
Chapter 15 Transaction MGMT Edited#1
55 pages
Concurrency Controls in DBMS Ensuring Data Integrity
No ratings yet
Concurrency Controls in DBMS Ensuring Data Integrity
9 pages
hw4 Sols
No ratings yet
hw4 Sols
11 pages
Chapter 4 Distributed Databases
No ratings yet
Chapter 4 Distributed Databases
36 pages
Dbms Unit V Notes
No ratings yet
Dbms Unit V Notes
27 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
58 pages
Multi Version Two-Phase Locking Protocol
100% (9)
Multi Version Two-Phase Locking Protocol
2 pages
DBMS QB
No ratings yet
DBMS QB
9 pages
ISD IISEM ME-v1 PDF
No ratings yet
ISD IISEM ME-v1 PDF
220 pages
Chapters 15-17: Transaction Management: Transactions, Concurrency Control and Recovery
No ratings yet
Chapters 15-17: Transaction Management: Transactions, Concurrency Control and Recovery
67 pages
Chapter 7 - Distributed Database System
No ratings yet
Chapter 7 - Distributed Database System
27 pages
DCCN Model Question-Theory
No ratings yet
DCCN Model Question-Theory
13 pages
QB Distributed Systems
No ratings yet
QB Distributed Systems
7 pages
Advanced Transaction Processing
No ratings yet
Advanced Transaction Processing
53 pages
DBMS U4
100% (1)
DBMS U4
13 pages
DBMS 2023 Jan
No ratings yet
DBMS 2023 Jan
2 pages
Concurrency Control & Recovery
No ratings yet
Concurrency Control & Recovery
39 pages
Exercise - DBMS - Chapter 5 - Concurrency Control Techniques
No ratings yet
Exercise - DBMS - Chapter 5 - Concurrency Control Techniques
3 pages
DATABASE MANAGEMENT SYSTEMS Unit Wise Important Questions: PART - A (Short Answer Questions)
No ratings yet
DATABASE MANAGEMENT SYSTEMS Unit Wise Important Questions: PART - A (Short Answer Questions)
6 pages
Ques Bank DBMS
No ratings yet
Ques Bank DBMS
8 pages
IGNOU Operating System Previous Years Solved Papers
From Everand
IGNOU Operating System Previous Years Solved Papers
Manish Soni
No ratings yet
Beyond Bounded Communication: Unifying Theories in Concurrent Computation
From Everand
Beyond Bounded Communication: Unifying Theories in Concurrent Computation
Pasquale De Marco
5/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.