Introduction To Data-Link Layer
Introduction To Data-Link Layer
Introduction To Data-Link Layer
Introduction to
Data-Link Layer
T he TCP/IP protocol suite does not define any protocol in the data-link layer or
physical layer. These two layers are territories of networks that when connected
make up the Internet. These networks, wired or wireless, provide services to the upper
three layers of the TCP/IP suite. This may give us a clue that there are several standard
protocols in the market today. For this reason, we discuss the data-link layer in several
chapters. This chapter is an introduction that gives the general idea and common issues
in the data-link layer that relate to all networks.
❑ The first section introduces the data-link layer. It starts with defining the concept
of links and nodes. The section then lists and briefly describes the services pro-
vided by the data-link layer. It next defines two categories of links: point-to-point
and broadcast links. The section finally defines two sublayers at the data-link layer
that will be elaborated on in the next few chapters.
❑ The second section discusses link-layer addressing. It first explains the rationale
behind the existence of an addressing mechanism at the data-link layer. It then
describes three types of link-layer addresses to be found in some link-layer proto-
cols. The section discusses the Address Resolution Protocol (ARP), which maps
the addresses at the network layer to addresses at the data-link layer. This protocol
helps a packet at the network layer find the link-layer address of the next node for
delivery of the frame that encapsulates the packet. To show how the network layer
helps us to find the data-link-layer addresses, a long example is included in this
section that shows what happens at each node when a packet is travelling through
the Internet.
237
238 PART III DATA-LINK LAYER
9.1 INTRODUCTION
The Internet is a combination of networks glued together by connecting devices (rout-
ers or switches). If a packet is to travel from a host to another host, it needs to pass
through these networks. Figure 9.1 shows the same scenario we discussed in Chapter 3,
but we are now interested in communication at the data-link layer. Communication at
the data-link layer is made up of five separate logical connections between the data-link
layers in the path.
Switched R5
WAN Network
National ISP R5 Data-link
Physical
ISP
R7
To other Network
ISPs Data-link
R6 R7 Physical
Legend
Point-to-point WAN
Bob
LAN switch
Application
Transport
WAN switch Network
Data-link
Router Bob Physical
Scientific Books
The data-link layer at Alice’s computer communicates with the data-link layer at router
R2. The data-link layer at router R2 communicates with the data-link layer at router R4,
CHAPTER 9 INTRODUCTION TO DATA-LINK LAYER 239
and so on. Finally, the data-link layer at router R7 communicates with the data-link
layer at Bob’s computer. Only one data-link layer is involved at the source or the desti-
nation, but two data-link layers are involved at each router. The reason is that Alice’s
and Bob’s computers are each connected to a single network, but each router takes
input from one network and sends output to another network. Note that although
switches are also involved in the data-link-layer communication, for simplicity we have
not shown them in the figure.
Point-to-point Point-to-point
network network
The first node is the source host; the last node is the destination host. The other
four nodes are four routers. The first, the third, and the fifth links represent the three
LANs; the second and the fourth links represent the two WANs.
9.1.2 Services
The data-link layer is located between the physical and the network layers. The data-
link layer provides services to the network layer; it receives services from the physical
layer. Let us discuss services provided by the data-link layer.
The duty scope of the data-link layer is node-to-node. When a packet is travelling
in the Internet, the data-link layer of a node (host or router) is responsible for delivering
a datagram to the next node in the path. For this purpose, the data-link layer of the
sending node needs to encapsulate the datagram received from the network in a frame,
and the data-link layer of the receiving node needs to decapsulate the datagram from
the frame. In other words, the data-link layer of the source host needs only to
240 PART III DATA-LINK LAYER
encapsulate, the data-link layer of the destination host needs to decapsulate, but each
intermediate node needs to both encapsulate and decapsulate. One may ask why we
need encapsulation and decapsulation at each intermediate node. The reason is that
each link may be using a different protocol with a different frame format. Even if one
link and the next are using the same protocol, encapsulation and decapsulation are
needed because the link-layer addresses are normally different. An analogy may help in
this case. Assume a person needs to travel from her home to her friend’s home in
another city. The traveller can use three transportation tools. She can take a taxi to go to
the train station in her own city, then travel on the train from her own city to the city
where her friend lives, and finally reach her friend’s home using another taxi. Here we
have a source node, a destination node, and two intermediate nodes. The traveller needs
to get into the taxi at the source node, get out of the taxi and get into the train at the first
intermediate node (train station in the city where she lives), get out of the train and get
into another taxi at the second intermediate node (train station in the city where her
friend lives), and finally get out of the taxi when she arrives at her destination. A kind
of encapsulation occurs at the source node, encapsulation and decapsulation occur at
the intermediate nodes, and decapsulation occurs at the destination node. Our traveller
is the same, but she uses three transporting tools to reach the destination.
Figure 9.3 shows the encapsulation and decapsulation at the data-link layer. For
simplicity, we have assumed that we have only one router between the source and des-
tination. The datagram received by the data-link layer of the source host is encapsulated
in a frame. The frame is logically transported from the source host to the router. The
frame is decapsulated at the data-link layer of the router and encapsulated at another
frame. The new frame is logically transported from the router to the destination host.
Note that, although we have shown only two data-link layers at the router, the router
actually has three data-link layers because it is connected to three physical links.
Actual link
Legend 2 Data-link header
Logical link
Data link 2 Datagram Data link Data link 2 Datagram Data link
With the contents of the above figure in mind, we can list the services provided by
a data-link layer as shown below.
CHAPTER 9 INTRODUCTION TO DATA-LINK LAYER 241
Framing
Definitely, the first service provided by the data-link layer is framing. The data-link
layer at each node needs to encapsulate the datagram (packet received from the network
layer) in a frame before sending it to the next node. The node also needs to decapsulate
the datagram from the frame received on the logical channel. Although we have shown
only a header for a frame, we will see in future chapters that a frame may have both a
header and a trailer. Different data-link layers have different formats for framing.
Flow Control
Whenever we have a producer and a consumer, we need to think about flow control. If
the producer produces items that cannot be consumed, accumulation of items occurs.
The sending data-link layer at the end of a link is a producer of frames; the receiving
data-link layer at the other end of a link is a consumer. If the rate of produced frames is
higher than the rate of consumed frames, frames at the receiving end need to be buff-
ered while waiting to be consumed (processed). Definitely, we cannot have an unlim-
ited buffer size at the receiving side. We have two choices. The first choice is to let the
receiving data-link layer drop the frames if its buffer is full. The second choice is to let
the receiving data-link layer send a feedback to the sending data-link layer to ask it to
stop or slow down. Different data-link-layer protocols use different strategies for flow
control. Since flow control also occurs at the transport layer, with a higher degree of
importance, we discuss this issue in Chapter 23 when we talk about the transport layer.
Error Control
At the sending node, a frame in a data-link layer needs to be changed to bits, trans-
formed to electromagnetic signals, and transmitted through the transmission media. At
the receiving node, electromagnetic signals are received, transformed to bits, and put
together to create a frame. Since electromagnetic signals are susceptible to error, a
frame is susceptible to error. The error needs first to be detected. After detection, it
needs to be either corrected at the receiver node or discarded and retransmitted by the
sending node. Since error detection and correction is an issue in every layer (node-to-
node or host-to-host), we have dedicated all of Chapter 10 to this issue.
Congestion Control
Although a link may be congested with frames, which may result in frame loss, most
data-link-layer protocols do not directly use a congestion control to alleviate congestion,
although some wide-area networks do. In general, congestion control is considered an
issue in the network layer or the transport layer because of its end-to-end nature. We will
discuss congestion control in the network layer and the transport layer in later chapters.
have a data-link layer that uses only part of the capacity of the link. In other words, we
can have a point-to-point link or a broadcast link. In a point-to-point link, the link is
dedicated to the two devices; in a broadcast link, the link is shared between several
pairs of devices. For example, when two friends use the traditional home phones to
chat, they are using a point-to-point link; when the same two friends use their cellular
phones, they are using a broadcast link (the air is shared among many cell phone users).
Data-link layer
We discuss the DLC and MAC sublayers later, each in a separate chapter. In addi-
tion, we discuss the issue of error detection and correction, a duty of the data-link and
other layers, also in a separate chapter.
The above discussion shows that we need another addressing mechanism in a con-
nectionless internetwork: the link-layer addresses of the two nodes. A link-layer
address is sometimes called a link address, sometimes a physical address, and some-
times a MAC address. We use these terms interchangeably in this book.
Since a link is controlled at the data-link layer, the addresses need to belong to the
data-link layer. When a datagram passes from the network layer to the data-link layer,
the datagram will be encapsulated in a frame and two data-link addresses are added to
the frame header. These two addresses are changed every time the frame moves from
one link to another. Figure 9.5 demonstrates the concept in a small internet.
To another
link
N3 L3
Frame
Alice L2 L1 N1 N8 Data N2 L2 R1 N 4 L4
N1 L1
Link 1
Data
Order of addresses
N: IP address
Frame
IP addresses: source-destination
N1 N8
Legend
L: Link-layer address Link-layer address: destination-source
L4
Link 3
L5
N 8 L8 R2
N7 L7 N5 L5
Link 2
L8 L7 N1 N 8 Data
Bob
Frame
N6 L6
To another
network
In the internet in Figure 9.5, we have three links and two routers. We also have
shown only two hosts: Alice (source) and Bob (destination). For each host, we have
shown two addresses, the IP addresses (N) and the link-layer addresses (L). Note
that a router has as many pairs of addresses as the number of links the router is con-
nected to. We have shown three frames, one in each link. Each frame carries the
same datagram with the same source and destination addresses (N1 and N8), but the
link-layer addresses of the frame change from link to link. In link 1, the link-layer
addresses are L1 and L2. In link 2, they are L4 and L5. In link 3, they are L7 and L8.
Note that the IP addresses and the link-layer addresses are not in the same order. For
IP addresses, the source address comes before the destination address; for link-layer
addresses, the destination address comes before the source. The datagrams and
244 PART III DATA-LINK LAYER
frames are designed in this way, and we follow the design. We may raise several
questions:
❑ If the IP address of a router does not appear in any datagram sent from a source to a
destination, why do we need to assign IP addresses to routers? The answer is that in
some protocols a router may act as a sender or receiver of a datagram. For example,
in routing protocols we will discuss in Chapters 20 and 21, a router is a sender or a
receiver of a message. The communications in these protocols are between routers.
❑ Why do we need more than one IP address in a router, one for each interface? The
answer is that an interface is a connection of a router to a link. We will see that an
IP address defines a point in the Internet at which a device is connected. A router
with n interfaces is connected to the Internet at n points. This is the situation of a
house at the corner of a street with two gates; each gate has the address related to
the corresponding street.
❑ How are the source and destination IP addresses in a packet determined? The
answer is that the host should know its own IP address, which becomes the source
IP address in the packet. As we will discuss in Chapter 26, the application layer
uses the services of DNS to find the destination address of the packet and passes it
to the network layer to be inserted in the packet.
❑ How are the source and destination link-layer addresses determined for each link?
Again, each hop (router or host) should know its own link-layer address, as we dis-
cuss later in the chapter. The destination link-layer address is determined by using
the Address Resolution Protocol, which we discuss shortly.
❑ What is the size of link-layer addresses? The answer is that it depends on the protocol
used by the link. Although we have only one IP protocol for the whole Internet, we
may be using different data-link protocols in different links. This means that we can
define the size of the address when we discuss different link-layer protocols.
9.2.1 Three Types of addresses
Some link-layer protocols define three types of addresses: unicast, multicast, and
broadcast.
Unicast Address
Each host or each interface of a router is assigned a unicast address. Unicasting means
one-to-one communication. A frame with a unicast address destination is destined only
for one entity in the link.
Example 9.1
As we will see in Chapter 13, the unicast link-layer addresses in the most common LAN, Ether-
net, are 48 bits (six bytes) that are presented as 12 hexadecimal digits separated by colons; for
example, the following is a link-layer address of a computer.
A3:34:45:11:92:F1
Multicast Address
Some link-layer protocols define multicast addresses. Multicasting means one-to-many
communication. However, the jurisdiction is local (inside the link).
CHAPTER 9 INTRODUCTION TO DATA-LINK LAYER 245
Example 9.2
As we will see in Chapter 13, the multicast link-layer addresses in the most common LAN,
Ethernet, are 48 bits (six bytes) that are presented as 12 hexadecimal digits separated by colons.
The second digit, however, needs to be an even number in hexadecimal. The following shows a
multicast address:
A2:34:45:11:92:F1
Broadcast Address
Some link-layer protocols define a broadcast address. Broadcasting means one-to-all
communication. A frame with a destination broadcast address is sent to all entities in
the link.
Example 9.3
As we will see in Chapter 13, the broadcast link-layer addresses in the most common LAN,
Ethernet, are 48 bits, all 1s, that are presented as 12 hexadecimal digits separated by colons. The
following shows a broadcast address:
FF:FF:FF:FF:FF:FF
ICMP IGMP IP
address
Network
layer IP
ARP
Link-layer
address
246 PART III DATA-LINK LAYER
Anytime a host or a router needs to find the link-layer address of another host or router
in its network, it sends an ARP request packet. The packet includes the link-layer and IP
addresses of the sender and the IP address of the receiver. Because the sender does not
know the link-layer address of the receiver, the query is broadcast over the link using the
link-layer broadcast address, which we discuss for each protocol later (see Figure 9.7).
LAN
System A System B
N1 L1 N 2 L2
Request
N 4 L4 N3 L 3
Request:
Looking for link-layer
address of a node with
IP address N2
a. ARP request is broadcast
LAN
System A N2 L2 System B
N1 L1
Reply
N4 L4 N3 L 3 Reply:
I am the node and my
link-layer address is
L2
b. ARP reply is unicast
Every host or router on the network receives and processes the ARP request
packet, but only the intended recipient recognizes its IP address and sends back an ARP
response packet. The response packet contains the recipient’s IP and link-layer
addresses. The packet is unicast directly to the node that sent the request packet.
In Figure 9.7a, the system on the left (A) has a packet that needs to be delivered
to another system (B) with IP address N2. System A needs to pass the packet to its
data-link layer for the actual delivery, but it does not know the physical address of
the recipient. It uses the services of ARP by asking the ARP protocol to send a
broadcast ARP request packet to ask for the physical address of a system with an IP
address of N2.
This packet is received by every system on the physical network, but only system B
will answer it, as shown in Figure 9.7b. System B sends an ARP reply packet that
includes its physical address. Now system A can send all the packets it has for this des-
tination using the physical address it received.
CHAPTER 9 INTRODUCTION TO DATA-LINK LAYER 247
Caching
A question that is often asked is this: If system A can broadcast a frame to find the link-
layer address of system B, why can’t system A send the datagram for system B using a
broadcast frame? In other words, instead of sending one broadcast frame (ARP
request), one unicast frame (ARP response), and another unicast frame (for sending the
datagram), system A can encapsulate the datagram and send it to the network. System B
receives it and keep it; other systems discard it.
To answer the question, we need to think about the efficiency. It is probable that
system A has more than one datagram to send to system B in a short period of time. For
example, if system B is supposed to receive a long e-mail or a long file, the data do not
fit in one datagram.
Let us assume that there are 20 systems connected to the network (link): system A,
system B, and 18 other systems. We also assume that system A has 10 datagrams to
send to system B in one second.
a. Without using ARP, system A needs to send 10 broadcast frames. Each of the
18 other systems need to receive the frames, decapsulate the frames, remove
the datagram and pass it to their network-layer to find out the datagrams do
not belong to them.This means processing and discarding 180 broadcast
frames.
b. Using ARP, system A needs to send only one broadcast frame. Each of the 18
other systems need to receive the frames, decapsulate the frames, remove the
ARP message and pass the message to their ARP protocol to find that the frame
must be discarded. This means processing and discarding only 18 (instead of
180) broadcast frames. After system B responds with its own data-link address,
system A can store the link-layer address in its cache memory. The rest of the
nine frames are only unicast. Since processing broadcast frames is expensive
(time consuming), the first method is preferable.
Packet Format
Figure 9.8 shows the format of an ARP packet. The names of the fields are self-
explanatory. The hardware type field defines the type of the link-layer protocol; Ethernet
is given the type 1. The protocol type field defines the network-layer protocol: IPv4 pro-
tocol is (0800)16. The source hardware and source protocol addresses are variable-length
fields defining the link-layer and network-layer addresses of the sender. The destination
hardware address and destination protocol address fields define the receiver link-layer
and network-layer addresses. An ARP packet is encapsulated directly into a data-link
frame. The frame needs to have a field to show that the payload belongs to the ARP and
not to the network-layer datagram.
Example 9.4
A host with IP address N1 and MAC address L1 has a packet to send to another host with IP address
N2 and physical address L2 (which is unknown to the first host). The two hosts are on the same net-
work. Figure 9.9 shows the ARP request and response messages.
248 PART III DATA-LINK LAYER
0 8 16 31
Hardware Type Protocol Type Hardware: LAN or WAN protocol
Protocol: Network-layer protocol
Hardware Protocol Operation
length length Request:1, Reply:2
Source hardware address
System A System B
N1 N2
L1 L2 (Not known by A)
0x0001 0x0800
0x06 0x04 0x0001
ARP request L1
N1
All 0s
N2
Multicast frame
From A to B
M A Data 1
Destination Source.
Data Link
Control (DLC)
293
294 PART III DATA-LINK LAYER
start or end of a frame. Figure 11.1 shows the format of a frame in a character-oriented
protocol.
Character-oriented framing was popular when only text was exchanged by the
data-link layers. The flag could be selected to be any character not used for text com-
munication. Now, however, we send other types of information such as graphs, audio,
and video; any character used for the flag could also be part of the information. If this
happens, the receiver, when it encounters this pattern in the middle of the data, thinks it
has reached the end of the frame. To fix this problem, a byte-stuffing strategy was
added to character-oriented framing. In byte stuffing (or character stuffing), a special
byte is added to the data section of the frame when there is a character with the same
pattern as the flag. The data section is stuffed with an extra byte. This byte is usually
called the escape character (ESC) and has a predefined bit pattern. Whenever the
receiver encounters the ESC character, it removes it from the data section and treats the
next character as data, not as a delimiting flag. Figure 11.2 shows the situation.
Extra Extra
byte byte
Byte stuffing by the escape character allows the presence of the flag in the data
section of the frame, but it creates another problem. What happens if the text contains
one or more escape characters followed by a byte with the same pattern as the flag? The
296 PART III DATA-LINK LAYER
receiver removes the escape character, but keeps the next byte, which is incorrectly
interpreted as the end of the frame. To solve this problem, the escape characters that are
part of the text must also be marked by another escape character. In other words, if the
escape character is part of the text, an extra one is added to show that the second one is
part of the text.
Character-oriented protocols present another problem in data communications.
The universal coding systems in use today, such as Unicode, have 16-bit and 32-bit
characters that conflict with 8-bit characters. We can say that, in general, the tendency
is moving toward the bit-oriented protocols that we discuss next.
Bit-Oriented Framing
In bit-oriented framing, the data section of a frame is a sequence of bits to be interpreted by
the upper layer as text, graphic, audio, video, and so on. However, in addition to headers
(and possible trailers), we still need a delimiter to separate one frame from the other. Most
protocols use a special 8-bit pattern flag, 01111110, as the delimiter to define the begin-
ning and the end of the frame, as shown in Figure 11.3.
This flag can create the same type of problem we saw in the character-oriented
protocols. That is, if the flag pattern appears in the data, we need to somehow inform
the receiver that this is not the end of the frame. We do this by stuffing 1 single bit
(instead of 1 byte) to prevent the pattern from looking like a flag. The strategy is called
bit stuffing. In bit stuffing, if a 0 and five consecutive 1 bits are encountered, an extra
0 is added. This extra stuffed bit is eventually removed from the data by the receiver.
Note that the extra bit is added after one 0 followed by five 1s regardless of the value of
the next bit. This guarantees that the flag field sequence does not inadvertently appear
in the frame.
Bit stuffing is the process of adding one extra 0 whenever five consecutive 1s follow a 0
in the data, so that the receiver does not mistake the pattern 0111110 for a flag.
Figure 11.4 shows bit stuffing at the sender and bit removal at the receiver. Note that
even if we have a 0 after five 1s, we still stuff a 0. The 0 will be removed by the receiver.
This means that if the flaglike pattern 01111110 appears in the data, it will change
to 011111010 (stuffed) and is not mistaken for a flag by the receiver. The real flag
01111110 is not stuffed by the sender and is recognized by the receiver.
CHAPTER 11 DATA LINK CONTROL (DLC) 297
Stuffed
Frame sent
Flag Header 000111110110011111001000 Trailer Flag
Two extra
Frame received bits
Flag Header 000111110110011111001000 Trailer Flag
Unstuffed
0001111111001111101000
Data to upper layer
Flow control
298 PART III DATA-LINK LAYER
The figure shows that the data-link layer at the sending node tries to push frames
toward the data-link layer at the receiving node. If the receiving node cannot process
and deliver the packet to its network at the same rate that the frames arrive, it becomes
overwhelmed with frames. Flow control in this case can be feedback from the receiving
node to the sending node to stop or slow down pushing frames.
Buffers
Although flow control can be implemented in several ways, one of the solutions is nor-
mally to use two buffers; one at the sending data-link layer and the other at the receiv-
ing data-link layer. A buffer is a set of memory locations that can hold packets at the
sender and receiver. The flow control communication can occur by sending signals
from the consumer to the producer. When the buffer of the receiving data-link layer is
full, it informs the sending data-link layer to stop pushing frames.
Example 11.1
The above discussion requires that the consumers communicate with the producers on two
occasions: when the buffer is full and when there are vacancies. If the two parties use a buffer
with only one slot, the communication can be easier. Assume that each data-link layer uses one
single memory slot to hold a frame. When this single slot in the receiving data-link layer is
empty, it sends a note to the network layer to send the next frame.
Error Control
Since the underlying technology at the physical layer is not fully reliable, we need to
implement error control at the data-link layer to prevent the receiving node from deliver-
ing corrupted packets to its network layer. Error control at the data-link layer is normally
very simple and implemented using one of the following two methods. In both methods, a
CRC is added to the frame header by the sender and checked by the receiver.
❑ In the first method, if the frame is corrupted, it is silently discarded; if it is not cor-
rupted, the packet is delivered to the network layer. This method is used mostly in
wired LANs such as Ethernet.
❑ In the second method, if the frame is corrupted, it is silently discarded; if it is not
corrupted, an acknowledgment is sent (for the purpose of both flow and error con-
trol) to the sender.
Combination of Flow and Error Control
Flow and error control can be combined. In a simple situation, the acknowledgment that
is sent for flow control can also be used for error control to tell the sender the packet has
arrived uncorrupted. The lack of acknowledgment means that there is a problem in the
sent frame. We show this situation when we discuss some simple protocols in the next
section. A frame that carries an acknowledgment is normally called an ACK to distin-
guish it from the data frame.
Connectionless Protocol
In a connectionless protocol, frames are sent from one node to the next without any
relationship between the frames; each frame is independent. Note that the term connec-
tionless here does not mean that there is no physical connection (transmission medium)
between the nodes; it means that there is no connection between frames. The frames are
not numbered and there is no sense of ordering. Most of the data-link protocols for
LANs are connectionless protocols.
Connection-Oriented Protocol
In a connection-oriented protocol, a logical connection should first be established
between the two nodes (setup phase). After all frames that are somehow related to each
other are transmitted (transfer phase), the logical connection is terminated (teardown
phase). In this type of communication, the frames are numbered and sent in order. If
they are not received in order, the receiver needs to wait until all frames belonging to the
same set are received and then deliver them in order to the network layer. Connection-
oriented protocols are rare in wired LANs, but we can see them in some point-to-point
protocols, some wireless LANs, and some WANs.
Event 1
Note: Action 1.
The colored Action 2.
arrow shows the
starting state.
State I State II Event 2
Action 3.
Event 3
Frame
Network Network
Data-link Data-link
Logical link
Sending node Receiving node
The data-link layer at the sender gets a packet from its network layer, makes a
frame out of it, and sends the frame. The data-link layer at the receiver receives a frame
from the link, extracts the packet from the frame, and delivers the packet to its network
layer. The data-link layers of the sender and receiver provide transmission services for
their network layers.
FSMs
The sender site should not send a frame until its network layer has a message to send.
The receiver site cannot deliver a message to its network layer until a frame arrives. We
can show these requirements using two FSMs. Each FSM has only one state, the ready
state. The sending machine remains in the ready state until a request comes from the
process in the network layer. When this event occurs, the sending machine encapsulates
the message in a frame and sends it to the receiving machine. The receiving machine
remains in the ready state until a frame arrives from the sending machine. When this
event occurs, the receiving machine decapsulates the message out of the frame and
delivers it to the process at the network layer. Figure 11.8 shows the FSMs for the sim-
ple protocol. We’ll see more in Chapter 23, which uses this protocol.
CHAPTER 11 DATA LINK CONTROL (DLC) 301
Ready Ready
Start Start
Example 11.2
Figure 11.9 shows an example of communication using this protocol. It is very simple. The
sender sends frames one after another without even thinking about the receiver.
Packet
Frame
Packet
Packet
Frame
Packet
acknowledgment arrives, the sender discards the copy and sends the next frame if it is
ready. Figure 11.10 shows the outline for the Stop-and-Wait protocol. Note that only
one frame and one acknowledgment can be in the channels at any time.
Data-link Data-link
FSMs
Figure 11.11 shows the FSMs for our primitive Stop-and-Wait protocol.
Sending node
Packet came from network layer.
Make a frame, save a copy, and send the frame. Time-out.
Start the timer. Resend the saved frame.
Restart the timer.
Ready Blocking
Receiving node
❑ Ready State. When the sender is in this state, it is only waiting for a packet from
the network layer. If a packet comes from the network layer, the sender creates a
frame, saves a copy of the frame, starts the only timer and sends the frame. The
sender then moves to the blocking state.
❑ Blocking State. When the sender is in this state, three events can occur:
a. If a time-out occurs, the sender resends the saved copy of the frame and restarts
the timer.
b. If a corrupted ACK arrives, it is discarded.
c. If an error-free ACK arrives, the sender stops the timer and discards the saved
copy of the frame. It then moves to the ready state.
Receiver
The receiver is always in the ready state. Two events may occur:
a. If an error-free frame arrives, the message in the frame is delivered to the net-
work layer and an ACK is sent.
b. If a corrupted frame arrives, the frame is discarded.
Example 11.3
Figure 11.12 shows an example. The first frame is sent and acknowledged. The second frame is
sent, but lost. After time-out, it is resent. The third frame is sent and acknowledged, but the
acknowledgment is lost. The frame is resent. However, there is a problem with this scheme. The
network layer at the receiver site receives two copies of the third packet, which is not right. In the
next section, we will see how we can correct this problem using sequence numbers and acknowl-
edgment numbers.
Sequence and Acknowledgment Numbers
We saw a problem in Example 11.3 that needs to be addressed and corrected. Duplicate packets,
as much as corrupted packets, need to be avoided. As an example, assume we are ordering some
item online. If each packet defines the specification of an item to be ordered, duplicate packets
mean ordering an item more than once. To correct the problem in Example 11.3, we need to add
sequence numbers to the data frames and acknowledgment numbers to the ACK frames. How-
ever, numbering in this case is very simple. Sequence numbers are 0, 1, 0, 1, 0, 1, . . . ; the
acknowledgment numbers can also be 1, 0, 1, 0, 1, 0, … In other words, the sequence numbers
start with 0, the acknowledgment numbers start with 1. An acknowledgment number always
defines the sequence number of the next frame to receive.
Example 11.4
Figure 11.13 shows how adding sequence numbers and acknowledgment numbers can prevent
duplicates. The first frame is sent and acknowledged. The second frame is sent, but lost. After
time-out, it is resent. The third frame is sent and acknowledged, but the acknowledgment is lost.
The frame is resent.
Packet Frame
Legend
Packet
ACK
Start the timer.
11.2.3 Piggybacking
The two protocols we discussed in this section are designed for unidirectional commu-
nication, in which data is flowing only in one direction although the acknowledgment
may travel in the other direction. Protocols have been designed in the past to allow data
to flow in both directions. However, to make the communication more efficient, the
data in one direction is piggybacked with the acknowledgment in the other direction. In
other words, when node A is sending data to node B, Node A also acknowledges the
data received from node B. Because piggybacking makes communication at the data-
link layer more complicated, it is not a common practice. We discuss two-way commu-
nication and piggybacking in more detail in Chapter 23.
11.3 HDLC
High-level Data Link Control (HDLC) is a bit-oriented protocol for communication
over point-to-point and multipoint links. It implements the Stop-and-Wait protocol we
discussed earlier. Although this protocol is more a theoretical issue than practical, most
of the concept defined in this protocol is the basis for other practical protocols such as
PPP, which we discuss next, or the Ethernet protocol, which we discuss in wired LANs
(Chapter 13), or in wireless LANs (Chapter 15).
CHAPTER 12
Media Access
Control (MAC)
W hen nodes or stations are connected and use a common link, called a multipoint or
broadcast link, we need a multiple-access protocol to coordinate access to the link.
The problem of controlling the access to the medium is similar to the rules of speaking in
an assembly. The procedures guarantee that the right to speak is upheld and ensure that
two people do not speak at the same time, do not interrupt each other, do not monopolize
the discussion, and so on. Many protocols have been devised to handle access to a shared
link. All of these protocols belong to a sublayer in the data-link layer called media access
control (MAC). We categorize them into three groups, as shown in Figure 12.1.
Multiple-access
protocols
325
326 PART III DATA-LINK LAYER
Station 1
Station 2
Station 3
Station 4
Time
Collision Collision
duration duration
There are four stations (unrealistic assumption) that contend with one another for
access to the shared channel. The figure shows that each station sends two frames; there
are a total of eight frames on the shared medium. Some of these frames collide because
multiple frames are in contention for the shared channel. Figure 12.2 shows that only
two frames survive: one frame from station 1 and one frame from station 3. We need to
mention that even if one bit of a frame coexists on the channel with one bit from
another frame, there is a collision and both will be destroyed. It is obvious that we need
to resend the frames that have been destroyed during transmission.
The pure ALOHA protocol relies on acknowledgments from the receiver. When a
station sends a frame, it expects the receiver to send an acknowledgment. If the
acknowledgment does not arrive after a time-out period, the station assumes that the
frame (or the acknowledgment) has been destroyed and resends the frame.
A collision involves two or more stations. If all these stations try to resend their
frames after the time-out, the frames will collide again. Pure ALOHA dictates that
when the time-out period passes, each station waits a random amount of time before
resending its frame. The randomness will help avoid more collisions. We call this time
the backoff time TB.
Pure ALOHA has a second method to prevent congesting the channel with retrans-
mitted frames. After a maximum number of retransmission attempts Kmax , a station
must give up and try later. Figure 12.3 shows the procedure for pure ALOHA based on
the above strategy.
The time-out period is equal to the maximum possible round-trip propagation delay,
which is twice the amount of time required to send a frame between the two most widely
separated stations (2 × Tp). The backoff time TB is a random value that normally depends
on K (the number of attempted unsuccessful transmissions). The formula for TB depends
on the implementation. One common formula is the binary exponential backoff. In this
method, for each retransmission, a multiplier R = 0 to 2K − 1 is randomly chosen and mul-
tiplied by Tp (maximum propagation time) or Tfr (the average time required to send out a
frame) to find TB. Note that in this procedure, the range of the random numbers increases
after each collision. The value of Kmax is usually chosen as 15.
328 PART III DATA-LINK LAYER
Station has
Legend a frame to send
K : Number of attempts
K=0
Tp : Maximum propagation time
Tfr: Average transmission time
TB : (Backoff time): R × Tp or R × Tfr
R : (Random number): 0 to 2K – 1 Send the
Wait TB
frame
Choose Wait
R (2 × Tp)
[false] ACK
K > Kmax
K=K+1 received?
[false]
[true] [true]
Abort Success
Example 12.1
The stations on a wireless ALOHA network are a maximum of 600 km apart. If we assume that
signals propagate at 3 × 108 m/s, we find Tp = (600 × 103) / (3 × 108) = 2 ms. For K = 2, the range
of R is {0, 1, 2, 3}. This means that TB can be 0, 2, 4, or 6 ms, based on the outcome of the ran-
dom variable R.
Vulnerable time
Let us find the vulnerable time, the length of time in which there is a possibility of colli-
sion. We assume that the stations send fixed-length frames with each frame taking Tfr sec-
onds to send. Figure 12.4 shows the vulnerable time for station B.
A
B
C
Time
t – Tfr t t + Tfr
Vulnerable time = 2 × Tfr
Station B starts to send a frame at time t. Now imagine station A has started to send
its frame after t − Tfr . This leads to a collision between the frames from station B and
CHAPTER 12 MEDIA ACCESS CONTROL (MAC) 329
station A. On the other hand, suppose that station C starts to send a frame before time
t + Tfr . Here, there is also a collision between frames from station B and station C.
Looking at Figure 12.4, we see that the vulnerable time during which a collision
may occur in pure ALOHA is 2 times the frame transmission time.
Pure ALOHA vulnerable time 5 2 3 Tfr
Example 12.2
A pure ALOHA network transmits 200-bit frames on a shared channel of 200 kbps. What is the
requirement to make this frame collision-free?
Solution
Average frame transmission time Tfr is 200 bits/200 kbps or 1 ms. The vulnerable time is 2 × 1 ms =
2 ms. This means no station should send later than 1 ms before this station starts transmission and
no station should start sending during the period (1 ms) that this station is sending.
Throughput
Let us call G the average number of frames generated by the system during one frame
transmission time. Then it can be proven that the average number of successfully trans-
mitted frames for pure ALOHA is S = G × e−2G. The maximum throughput Smax is 0.184,
for G = 1/2. (We can find it by setting the derivative of S with respect to G to 0; see Exer-
cises.) In other words, if one-half a frame is generated during one frame transmission
time (one frame during two frame transmission times), then 18.4 percent of these frames
reach their destination successfully. We expect G = 1/2 to produce the maximum through-
put because the vulnerable time is 2 times the frame transmission time. Therefore, if a
station generates only one frame in this vulnerable time (and no other stations generate a
frame during this time), the frame will reach its destination successfully.
Example 12.3
A pure ALOHA network transmits 200-bit frames on a shared channel of 200 kbps. What is the
throughput if the system (all stations together) produces
a. 1000 frames per second?
b. 500 frames per second?
c. 250 frames per second?
Solution
The frame transmission time is 200/200 kbps or 1 ms.
a. If the system creates 1000 frames per second, or 1 frame per millisecond, then G = 1. In
this case S = G × e−2G = 0.135 (13.5 percent). This means that the throughput is 1000 ×
0.135 = 135 frames. Only 135 frames out of 1000 will probably survive.
b. If the system creates 500 frames per second, or 1/2 frames per millisecond, then G = 1/2.
In this case S = G × e−2G = 0.184 (18.4 percent). This means that the throughput is 500 ×
0.184 = 92 and that only 92 frames out of 500 will probably survive. Note that this is the
maximum throughput case, percentagewise.
330 PART III DATA-LINK LAYER
c. If the system creates 250 frames per second, or 1/4 frames per millisecond, then G = 1/4.
In this case S = G × e−2G = 0.152 (15.2 percent). This means that the throughput is
250 × 0.152 = 38. Only 38 frames out of 250 will probably survive.
Slotted ALOHA
Pure ALOHA has a vulnerable time of 2 × Tfr . This is so because there is no rule that
defines when the station can send. A station may send soon after another station has
started or just before another station has finished. Slotted ALOHA was invented to
improve the efficiency of pure ALOHA.
In slotted ALOHA we divide the time into slots of Tfr seconds and force the sta-
tion to send only at the beginning of the time slot. Figure 12.5 shows an example of
frame collisions in slotted ALOHA.
Collision Collision
duration duration
Station 1
Station 2
Station 3
Station 4
Time
Slot 1 Slot 2 Slot 3 Slot 4 Slot 5 Slot 6
Because a station is allowed to send only at the beginning of the synchronized time
slot, if a station misses this moment, it must wait until the beginning of the next time
slot. This means that the station which started at the beginning of this slot has already
finished sending its frame. Of course, there is still the possibility of collision if two
stations try to send at the beginning of the same time slot. However, the vulnerable time
is now reduced to one-half, equal to Tfr. Figure 12.6 shows the situation.
B collides with C
Time
t – Tfr t t + Tfr
Vulnerable time = Tfr
CHAPTER 12 MEDIA ACCESS CONTROL (MAC) 331
Example 12.4
A slotted ALOHA network transmits 200-bit frames using a shared channel with a 200-kbps
bandwidth. Find the throughput if the system (all stations together) produces
a. 1000 frames per second.
b. 500 frames per second.
c. 250 frames per second.
Solution
This situation is similar to the previous exercise except that the network is using slotted ALOHA
instead of pure ALOHA. The frame transmission time is 200/200 kbps or 1 ms.
a. In this case G is 1. So S = G × e−G = 0.368 (36.8 percent). This means that the throughput
is 1000 × 0.0368 = 368 frames. Only 368 out of 1000 frames will probably survive. Note
that this is the maximum throughput case, percentagewise.
b. Here G is 1/2. In this case S = G × e−G = 0.303 (30.3 percent). This means that the
throughput is 500 × 0.0303 = 151. Only 151 frames out of 500 will probably survive.
c. Now G is 1/4. In this case S = G × e−G = 0.195 (19.5 percent). This means that the
throughput is 250 × 0.195 = 49. Only 49 frames out of 250 will probably survive.
12.1.2 CSMA
To minimize the chance of collision and, therefore, increase the performance, the
CSMA method was developed. The chance of collision can be reduced if a station
senses the medium before trying to use it. Carrier sense multiple access (CSMA)
requires that each station first listen to the medium (or check the state of the medium)
before sending. In other words, CSMA is based on the principle “sense before transmit”
or “listen before talk.”
CSMA can reduce the possibility of collision, but it cannot eliminate it. The reason
for this is shown in Figure 12.7, a space and time model of a CSMA network. Stations
are connected to a shared channel (usually a dedicated medium).
The possibility of collision still exists because of propagation delay; when a station
sends a frame, it still takes time (although very short) for the first bit to reach every station
and for every station to sense it. In other words, a station may sense the medium and find
it idle, only because the first bit sent by another station has not yet been received.
332 PART III DATA-LINK LAYER
B starts C starts
at time t1 at time t2
A B C D
t1
t2
Area where
B’s signal exists
Area where
both signals exist
Area where
Time C’s signal exists Time
At time t1, station B senses the medium and finds it idle, so it sends a frame. At
time t2 (t2 > t1), station C senses the medium and finds it idle because, at this time, the
first bits from station B have not reached station C. Station C also sends a frame. The
two signals collide and both frames are destroyed.
Vulnerable Time
The vulnerable time for CSMA is the propagation time Tp. This is the time needed for
a signal to propagate from one end of the medium to the other. When a station sends a
frame and any other station tries to send a frame during this time, a collision will result.
But if the first bit of the frame reaches the end of the medium, every station will already
have heard the bit and will refrain from sending. Figure 12.8 shows the worst case. The
leftmost station, A, sends a frame at time t1, which reaches the rightmost station, D, at
time t1 + Tp. The gray area shows the vulnerable area in time and space.
A B C D
Time Time
CHAPTER 12 MEDIA ACCESS CONTROL (MAC) 333
Persistence Methods
What should a station do if the channel is busy? What should a station do if the channel
is idle? Three methods have been devised to answer these questions: the 1-persistent
method, the nonpersistent method, and the p-persistent method. Figure 12.9 shows
the behavior of three persistence methods when a station finds a channel busy.
Transmit Transmit
Continuously sense Sense Sense
Wait Wait
Time Time
Busy Busy
a. 1-Persistent b. Nonpersistent
Time
Busy Busy
c. p-Persistent
Channel Channel
busy? [true] busy? [true] Wait
randomly
[false] [false]
Station Station
can transmit. can transmit.
a. 1-Persistent b. Nonpersistent
Channel
busy? [true]
[false]
Generate a
random number
(R = 0 to 1)
c. p-Persistent
2. With probability q = 1 − p, the station waits for the beginning of the next time slot
and checks the line again.
a. If the line is idle, it goes to step 1.
b. If the line is busy, it acts as though a collision has occurred and uses the back-
off procedure.
12.1.3 CSMA/CD
The CSMA method does not specify the procedure following a collision. Carrier sense
multiple access with collision detection (CSMA/CD) augments the algorithm to
handle the collision.
In this method, a station monitors the medium after it sends a frame to see if the
transmission was successful. If so, the station is finished. If, however, there is a colli-
sion, the frame is sent again.
To better understand CSMA/CD, let us look at the first bits transmitted by the two
stations involved in the collision. Although each station continues to send bits in the
frame until it detects the collision, we show what happens as the first bits collide. In
Figure 12.11, stations A and C are involved in the collision.
CHAPTER 12 MEDIA ACCESS CONTROL (MAC) 335
A B C D
t1 First bit of
A t2 Transmission
Transmission
time t3 time
C
t4 First bit of C’s collision
A’s collision detection and
detection abortion
and abortion Collision
Time occurs Time
At time t1, station A has executed its persistence procedure and starts sending
the bits of its frame. At time t2, station C has not yet sensed the first bit sent by
A. Station C executes its persistence procedure and starts sending the bits in its
frame, which propagate both to the left and to the right. The collision occurs some-
time after time t2. Station C detects a collision at time t3 when it receives the first
bit of A’s frame. Station C immediately (or after a short time, but we assume imme-
diately) aborts transmission. Station A detects collision at time t4 when it receives
the first bit of C’s frame; it also immediately aborts transmission. Looking at the
figure, we see that A transmits for the duration t4 − t1; C transmits for the duration
t 3 − t2.
Now that we know the time durations for the two transmissions, we can show a
more complete graph in Figure 12.12.
A B C D
Collision
t1 occurs
Transmission t2 Transmission
Part of A’s
time frame t3 time
frame
t4 Part of C’s
A detects
collision and
aborts C detects
collision
Time and aborts Time
the frame and does not monitor the line for collision detection. Therefore, the frame trans-
mission time Tfr must be at least two times the maximum propagation time Tp. To under-
stand the reason, let us think about the worst-case scenario. If the two stations involved in
a collision are the maximum distance apart, the signal from the first takes time Tp to reach
the second, and the effect of the collision takes another time TP to reach the first. So the
requirement is that the first station must still be transmitting after 2Tp.
Example 12.5
A network using CSMA/CD has a bandwidth of 10 Mbps. If the maximum propagation time
(including the delays in the devices and ignoring the time needed to send a jamming signal, as we
see later) is 25.6 μs, what is the minimum size of the frame?
Solution
The minimum frame transmission time is Tfr = 2 × Tp = 51.2 μs. This means, in the worst case, a
station needs to transmit for a period of 51.2 μs to detect the collision. The minimum size of the
frame is 10 Mbps × 51.2 μs = 512 bits or 64 bytes. This is actually the minimum size of the frame
for Standard Ethernet, as we will see later in the chapter.
Procedure
Now let us look at the flow diagram for CSMA/CD in Figure 12.13. It is similar to the
one for the ALOHA protocol, but there are differences.
Station has
a frame to send
K=0
Legend
Tfr: Frame average transmission
time
K : Number of attempts Wait TB Apply one of the
R : (random number): 0 to 2K _ 1 seconds persistence methods
TB : (Backoff time) = R × Tfr
Create random [false] Done or
number R Transmit collision?
and receive
[true]
[true]
K < 15 ? Send a [true] Collision
K=K+1 jamming detected?
[false] signal [false]
Abort Success
The first difference is the addition of the persistence process. We need to sense the
channel before we start sending the frame by using one of the persistence processes we
discussed previously (nonpersistent, 1-persistent, or p-persistent). The corresponding
box can be replaced by one of the persistence processes shown in Figure 12.10.
CHAPTER 12 MEDIA ACCESS CONTROL (MAC) 337
Energy Collision
Throughput
The throughput of CSMA/CD is greater than that of pure or slotted ALOHA. The max-
imum throughput occurs at a different value of G and is based on the persistence
method and the value of p in the p-persistent approach. For the 1-persistent method, the
maximum throughput is around 50 percent when G = 1. For the nonpersistent method,
the maximum throughput can go up to 90 percent when G is between 3 and 8.
Traditional Ethernet
One of the LAN protocols that used CSMA/CD is the traditional Ethernet with the data
rate of 10 Mbps. We discuss the Ethernet LANs in Chapter 13, but it is good to know
that the traditional Ethernet was a broadcast LAN that used the 1-persistence method to
control access to the common media. Later versions of Ethernet try to move from
CSMA/CD access methods for the reason that we discuss in Chapter 13.
338 PART III DATA-LINK LAYER
12.1.4 CSMA/CA
Carrier sense multiple access with collision avoidance (CSMA/CA) was invented
for wireless networks. Collisions are avoided through the use of CSMA/CA’s three
strategies: the interframe space, the contention window, and acknowledgments, as
shown in Figure 12.15. We discuss RTS and CTS frames later.
Station has
a frame to send
K=0
Legend
K: Number of attempts
TB: Backoff time Channel free?
IFS: Interframe Space
RTS: Request to send [false] [true] Carrier sense
CTS: Clear to send
Wait IFS
Send RTS
Wait TB
seconds Set a timer
CTS received
[false]
before time-out?
[true]
Wait IFS
Send
Transmission
the frame
Set a timer
[true]
K < limit ? [false]
ACK received
K=K+1 before time-out?
[false] [true]
Abort Success
❑ Interframe Space (IFS). First, collisions are avoided by deferring transmission even
if the channel is found idle. When an idle channel is found, the station does not send
immediately. It waits for a period of time called the interframe space or IFS. Even
though the channel may appear idle when it is sensed, a distant station may have
already started transmitting. The distant station’s signal has not yet reached this
CHAPTER 12 MEDIA ACCESS CONTROL (MAC) 339
station. The IFS time allows the front of the transmitted signal by the distant station to
reach this station. After waiting an IFS time, if the channel is still idle, the station can
send, but it still needs to wait a time equal to the contention window (described next).
The IFS variable can also be used to prioritize stations or frame types. For example, a
station that is assigned a shorter IFS has a higher priority.
❑ Contention Window. The contention window is an amount of time divided into
slots. A station that is ready to send chooses a random number of slots as its wait
time. The number of slots in the window changes according to the binary exponen-
tial backoff strategy. This means that it is set to one slot the first time and then dou-
bles each time the station cannot detect an idle channel after the IFS time. This is
very similar to the p-persistent method except that a random outcome defines the
number of slots taken by the waiting station. One interesting point about the con-
tention window is that the station needs to sense the channel after each time slot.
However, if the station finds the channel busy, it does not restart the process; it just
stops the timer and restarts it when the channel is sensed as idle. This gives priority
to the station with the longest waiting time. See Figure 12.16.
Size:
Found
binary exponential
idle
Continuously sense
IFS
❑ Acknowledgment. With all these precautions, there still may be a collision resulting
in destroyed data. In addition, the data may be corrupted during the transmission.
The positive acknowledgment and the time-out timer can help guarantee that the
receiver has received the frame.
Frame Exchange Time Line
Figure 12.17 shows the exchange of data and control frames in time.
1. Before sending a frame, the source station senses the medium by checking the
energy level at the carrier frequency.
a. The channel uses a persistence strategy with backoff until the channel is idle.
b. After the station is found to be idle, the station waits for a period of time called
the DCF interframe space (DIFS); then the station sends a control frame called
the request to send (RTS).
2. After receiving the RTS and waiting a period of time called the short interframe
space (SIFS), the destination station sends a control frame, called the clear to
send (CTS), to the source station. This control frame indicates that the destination
station is ready to receive data.
340 PART III DATA-LINK LAYER
•••
DIFS
RTS
SIFS
CTS CTS
SIFS
Data NAV
SIFS
ACK ACK
3. The source station sends data after waiting an amount of time equal to SIFS.
4. The destination station, after waiting an amount of time equal to SIFS, sends an
acknowledgment to show that the frame has been received. Acknowledgment is
needed in this protocol because the station does not have any means to check for
the successful arrival of its data at the destination. On the other hand, the lack of
collision in CSMA/CD is a kind of indication to the source that data have
arrived.
Network Allocation Vector
How do other stations defer sending their data if one station acquires access? In other
words, how is the collision avoidance aspect of this protocol accomplished? The key is
a feature called NAV.
When a station sends an RTS frame, it includes the duration of time that it needs to
occupy the channel. The stations that are affected by this transmission create a timer
called a network allocation vector (NAV) that shows how much time must pass before
these stations are allowed to check the channel for idleness. Each time a station
accesses the system and sends an RTS frame, other stations start their NAV. In other
words, each station, before sensing the physical medium to see if it is idle, first checks
its NAV to see if it has expired. Figure 12.17 shows the idea of NAV.
Collision During Handshaking
What happens if there is a collision during the time when RTS or CTS control frames
are in transition, often called the handshaking period? Two or more stations may try to
send RTS frames at the same time. These control frames may collide. However,
because there is no mechanism for collision detection, the sender assumes there has
been a collision if it has not received a CTS frame from the receiver. The backoff strat-
egy is employed, and the sender tries again.
CHAPTER 12 MEDIA ACCESS CONTROL (MAC) 341
Hidden-Station Problem
The solution to the hidden station problem is the use of the handshake frames (RTS and
CTS). Figure 12.17 also shows that the RTS message from B reaches A, but not C.
However, because both B and C are within the range of A, the CTS message, which
contains the duration of data transmission from B to A, reaches C. Station C knows that
some hidden station is using the channel and refrains from transmitting until that dura-
tion is over.
CSMA/CA and Wireless Networks
CSMA/CA was mostly intended for use in wireless networks. The procedure described
above, however, is not sophisticated enough to handle some particular issues related to
wireless networks, such as hidden terminals or exposed terminals. We will see how
these issues are solved by augmenting the above protocol with handshaking features.
The use of CSMA/CA in wireless networks will be discussed in Chapter 15.
12.2.1 Reservation
In the reservation method, a station needs to make a reservation before sending data.
Time is divided into intervals. In each interval, a reservation frame precedes the data
frames sent in that interval.
If there are N stations in the system, there are exactly N reservation minislots in the
reservation frame. Each minislot belongs to a station. When a station needs to send a
data frame, it makes a reservation in its own minislot. The stations that have made res-
ervations can send their data frames after the reservation frame.
Figure 12.18 shows a situation with five stations and a five-minislot reservation
frame. In the first interval, only stations 1, 3, and 4 have made reservations. In the sec-
ond interval, only station 1 has made a reservation.
A fter discussing the general issues related to the data-link layer in Chapters 9 to 12,
it is time in this chapter to discuss the wired LANs. Although over a few decades
many wired LAN protocols existed, only the Ethernet technology survives today. This
is the reason that we discuss only this technology and its evolution in this chapter.
This chapter is divided into five sections.
❑ The first section discusses the Ethernet protocol in general. It explains that IEEE
Project 802 defines the LLC and MAC sublayers for all LANs including Ethernet.
The section also lists the four generations of Ethernet.
❑ The second section discusses the Standard Ethernet. Although this generation is
rarely seen in practice, most of the characteristics have been inherited by the fol-
lowing three generations. The section first describes some characteristics of the
Standard Ethernet. It then discusses the addressing mechanism, which is the same
in all Ethernet generations. The section next discusses the access method, CSMA/
CD, which we discussed in Chapter 12. The section then reviews the efficiency of
the Standard Ethernet. It then shows the encoding and the implementation of this
generation. Before closing the section, the changes in this generation that resulted
in the move to the next generation are listed.
❑ The third section describes the Fast Ethernet, the second generation, which can still
be seen in many places. The section first describes the changes in the MAC sub-
layer. The section then discusses the physical layer and the implementation of this
generation.
❑ The fourth section discusses the Gigabit Ethernet, with the rate of 1 gigabit per
second. The section first describes the MAC sublayer. It then moves to the physical
layer and implementation.
❑ The fifth section touches on the 10 Gigabit Ethernet. This is a new technology that
can be used both for a backbone LAN or as a MAN (metropolitan area network).
The section briefly describes the rationale and the implementation.
361
362 PART III DATA-LINK LAYER
LLC
Data-link layer
Ethernet Token Ring Token Bus
•••
MAC MAC MAC
part of the framing duties are collected into one sublayer called the logical link control
(LLC). Framing is handled in both the LLC sublayer and the MAC sublayer.
The LLC provides a single link-layer control protocol for all IEEE LANs. This
means LLC protocol can provide interconnectivity between different LANs because it
makes the MAC sublayer transparent.
Media Access Control (MAC)
Earlier we discussed multiple access methods including random access, controlled
access, and channelization. IEEE Project 802 has created a sublayer called media
access control that defines the specific access method for each LAN. For example, it
defines CSMA/CD as the media access method for Ethernet LANs and defines the
token-passing method for Token Ring and Token Bus LANs. As we mentioned in the
previous section, part of the framing function is also handled by the MAC layer.
Ethernet
evolution
13.2.1 Characteristics
Let us first discuss some characteristics of the Standard Ethernet.
Connectionless and Unreliable Service
Ethernet provides a connectionless service, which means each frame sent is independent
of the previous or next frame. Ethernet has no connection establishment or connection
termination phases. The sender sends a frame whenever it has it; the receiver may or may
not be ready for it. The sender may overwhelm the receiver with frames, which may result
in dropping frames. If a frame drops, the sender will not know about it. Since IP, which is
using the service of Ethernet, is also connectionless, it will not know about it either. If the
transport layer is also a connectionless protocol, such as UDP, the frame is lost and
salvation may only come from the application layer. However, if the transport layer is
TCP, the sender TCP does not receive acknowledgment for its segment and sends it again.
Ethernet is also unreliable like IP and UDP. If a frame is corrupted during trans-
mission and the receiver finds out about the corruption, which has a high level of prob-
ability of happening because of the CRC-32, the receiver drops the frame silently. It is
the duty of high-level protocols to find out about it.
Frame Format
The Ethernet frame contains seven fields, as shown in Figure 13.3.
❑ Preamble. This field contains 7 bytes (56 bits) of alternating 0s and 1s that alert the
receiving system to the coming frame and enable it to synchronize its clock if it’s out
of synchronization. The pattern provides only an alert and a timing pulse. The 56-bit
CHAPTER 13 WIRED LANs: ETHERNET 365
pattern allows the stations to miss some bits at the beginning of the frame. The pream-
ble is actually added at the physical layer and is not (formally) part of the frame.
❑ Start frame delimiter (SFD). This field (1 byte: 10101011) signals the beginning
of the frame. The SFD warns the station or stations that this is the last chance for
synchronization. The last 2 bits are (11)2 and alert the receiver that the next field is
the destination address. This field is actually a flag that defines the beginning of
the frame. We need to remember that an Ethernet frame is a variable-length frame.
It needs a flag to define the beginning of the frame. The SFD field is also added at
the physical layer.
❑ Destination address (DA). This field is six bytes (48 bits) and contains the link-
layer address of the destination station or stations to receive the packet. We will
discuss addressing shortly. When the receiver sees its own link-layer address, or a
multicast address for a group that the receiver is a member of, or a broadcast
address, it decapsulates the data from the frame and passes the data to the upper-
layer protocol defined by the value of the type field.
❑ Source address (SA). This field is also six bytes and contains the link-layer address
of the sender of the packet. We will discuss addressing shortly.
❑ Type. This field defines the upper-layer protocol whose packet is encapsulated in
the frame. This protocol can be IP, ARP, OSPF, and so on. In other words, it serves
the same purpose as the protocol field in a datagram and the port number in a seg-
ment or user datagram. It is used for multiplexing and demultiplexing.
❑ Data. This field carries data encapsulated from the upper-layer protocols. It is a
minimum of 46 and a maximum of 1500 bytes. We discuss the reason for these
minimum and maximum values shortly. If the data coming from the upper layer is
more than 1500 bytes, it should be fragmented and encapsulated in more than one
frame. If it is less than 46 bytes, it needs to be padded with extra 0s. A padded
data frame is delivered to the upper-layer protocol as it is (without removing the
padding), which means that it is the responsibility of the upper layer to remove
or, in the case of the sender, to add the padding. The upper-layer protocol needs
to know the length of its data. For example, a datagram has a field that defines the
length of the data.
❑ CRC. The last field contains error detection information, in this case a CRC-32. The
CRC is calculated over the addresses, types, and data field. If the receiver calculates
the CRC and finds that it is not zero (corruption in transmission), it discards the frame.
Frame Length
Ethernet has imposed restrictions on both the minimum and maximum lengths of a frame.
The minimum length restriction is required for the correct operation of CSMA/CD, as
we will see shortly. An Ethernet frame needs to have a minimum length of 512 bits or
64 bytes. Part of this length is the header and the trailer. If we count 18 bytes of header
and trailer (6 bytes of source address, 6 bytes of destination address, 2 bytes of length
or type, and 4 bytes of CRC), then the minimum length of data from the upper layer is
64 − 18 = 46 bytes. If the upper-layer packet is less than 46 bytes, padding is added to
make up the difference.
366 PART III DATA-LINK LAYER
The standard defines the maximum length of a frame (without preamble and SFD
field) as 1518 bytes. If we subtract the 18 bytes of header and trailer, the maximum
length of the payload is 1500 bytes. The maximum length restriction has two historical
reasons. First, memory was very expensive when Ethernet was designed; a maximum
length restriction helped to reduce the size of the buffer. Second, the maximum length
restriction prevents one station from monopolizing the shared medium, blocking other
stations that have data to send.
13.2.2 Addressing
Each station on an Ethernet network (such as a PC, workstation, or printer) has its own
network interface card (NIC). The NIC fits inside the station and provides the station
with a link-layer address. The Ethernet address is 6 bytes (48 bits), normally written in
hexadecimal notation, with a colon between the bytes. For example, the following
shows an Ethernet MAC address:
4A:30:10:21:10:1A
Example 13.1
Show how the address 47:20:1B:2E:08:EE is sent out online.
Solution
The address is sent left to right, byte by byte; for each byte, it is sent right to left, bit by bit, as
shown below:
Hexadecimal 47 20 1B 2E 08 EE
Binary 01000111 00100000 00011011 00101110 00001000 11101110
Transmitted ← 11100010 00000100 11011000 01110100 00010000 01110111
Unicast: 0 Multicast: 1
•••
Byte 1 Byte 2 Byte 6
multicast address: the recipients are all the stations on the LAN. A broadcast destina-
tion address is forty-eight 1s.
Example 13.2
Define the type of the following destination addresses:
a. 4A:30:10:21:10:1A
b. 47:20:1B:2E:08:EE
c. FF:FF:FF:FF:FF:FF
Solution
To find the type of the address, we need to look at the second hexadecimal digit from the left. If it
is even, the address is unicast. If it is odd, the address is multicast. If all digits are Fs, the address
is broadcast. Therefore, we have the following:
a. This is a unicast address because A in binary is 1010 (even).
b. This is a multicast address because 7 in binary is 0111 (odd).
c. This is a broadcast address because all digits are Fs in hexadecimal.
Distinguish Between Unicast, Multicast, and Broadcast Transmission
Standard Ethernet uses a coaxial cable (bus topology) or a set of twisted-pair cables
with a hub (star topology) as shown in Figure 13.5.
We need to know that transmission in the standard Ethernet is always broadcast, no
matter if the intention is unicast, multicast, or broadcast. In the bus topology, when sta-
tion A sends a frame to station B, all stations will receive it. In the star topology, when
station A sends a frame to station B, the hub will receive it. Since the hub is a passive
element, it does not check the destination address of the frame; it regenerates the bits (if
they have been weakened) and sends them to all stations except station A. In fact, it
floods the network with the frame.
The question is, then, how the actual unicast, multicast, and broadcast transmis-
sions are distinguished from each other. The answer is in the way the frames are kept or
dropped.
❑ In a unicast transmission, all stations will receive the frame, the intended recipient
keeps and handles the frame; the rest discard it.
❑ In a multicast transmission, all stations will receive the frame, the stations that are
members of the group keep and handle it; the rest discard it.
368 PART III DATA-LINK LAYER
A B C D E F G H
A hub
A cable tap
A cable end
Hub Coaxial cable
E F G H Twisted pair cable
❑ In a broadcast transmission, all stations (except the sender) will receive the frame
and all stations (except the sender) keep and handle it.
a. Station A has sent 512 bits and no collision is sensed (the energy level did not
go above the regular energy level), the station then is sure that the frame will go
through and stops sensing the medium. Where does the number 512 bits come
from? If we consider the transmission rate of the Ethernet as 10 Mbps, this
means that it takes the station 512/(10 Mbps) = 51.2 μs to send out 512 bits.
With the speed of propagation in a cable (2 × 108 meters), the first bit could
have gone 10,240 meters (one way) or only 5120 meters (round trip), have col-
lided with a bit from the last station on the cable, and have gone back. In other
words, if a collision were to occur, it should occur by the time the sender has
sent out 512 bits (worst case) and the first bit has made a round trip of 5120
meters. We should know that if the collision happens in the middle of the cable,
not at the end, station A hears the collision earlier and aborts the transmission.
We also need to mention another issue. The above assumption is that the length
of the cable is 5120 meters. The designer of the standard Ethernet actually put a
restriction of 2500 meters because we need to consider the delays encountered
throughout the journey. It means that they considered the worst case. The whole
idea is that if station A does not sense the collision before sending 512 bits,
there must have been no collision, because during this time, the first bit has
reached the end of the line and all other stations know that a station is sending
and refrain from sending. In other words, the problem occurs when another sta-
tion (for example, the last station) starts sending before the first bit of station A
has reached it. The other station mistakenly thinks that the line is free because
the first bit has not yet reached it. The reader should notice that the restriction of
512 bits actually helps the sending station: The sending station is certain that no
collision will occur if it is not heard during the first 512 bits, so it can discard
the copy of the frame in its buffer.
b. Station A has sensed a collision before sending 512 bits. This means that one of
the previous bits has collided with a bit sent by another station. In this case both
stations should refrain from sending and keep the frame in their buffer for
resending when the line becomes available. However, to inform other stations
that there is a collision in the network, the station sends a 48-bit jam signal. The
jam signal is to create enough signal (even if the collision happens after a few
bits) to alert other stations about the collision. After sending the jam signal, the
stations need to increment the value of K (number of attempts). If after incre-
ment K = 15, the experience has shown that the network is too busy, the station
needs to abort its effort and try again. If K < 15, the station can wait a backoff
time (TB in Figure 12.13) and restart the process. As Figure 12.13 shows, the
station creates a random number between 0 and 2K − 1, which means each time
the collision occurs, the range of the random number increases exponentially.
After the first collision (K = 1) the random number is in the range (0, 1). After
the second collision (K = 2) it is in the range (0, 1, 2, 3). After the third collision
(K = 3) it is in the range (0, 1, 2, 3, 4, 5, 6, 7). So after each collision, the proba-
bility increases that the backoff time becomes longer. This is due to the fact that
if the collision happens even after the third or fourth attempt, it means that the
network is really busy; a longer backoff time is needed.
370 PART III DATA-LINK LAYER
Efficiency 5 1 / (1 1 6.4 3 a)
in which the parameter “a” is the number of frames that can fit on the medium. It can
be calculated as a = (propagation delay)/(transmission delay) because the transmission
delay is the time it takes a frame of average size to be sent out and the propagation delay
is the time it takes to reach the end of the medium. Note that as the value of parameter a
decreases, the efficiency increases. This means that if the length of the media is shorter
or the frame size longer, the efficiency increases. In the ideal case, a = 0 and the effi-
ciency is 1. We ask to calculate this efficiency in problems at the end of the chapter.
Example 13.3
In the Standard Ethernet with the transmission rate of 10 Mbps, we assume that the length of the
medium is 2500 m and the size of the frame is 512 bits. The propagation speed of a signal in a
cable is normally 2 × 108 m/s.
The example shows that a = 0.24, which means only 0.24 of a frame occupies the whole
medium in this case. The efficiency is 39 percent, which is considered moderate; it means that
only 61 percent of the time the medium is occupied but not used by a station.
13.2.5 Implementation
The Standard Ethernet defined several implementations, but only four of them
became popular during the 1980s. Table 13.1 shows a summary of Standard Ether-
net implementations.
Table 13.1 Summary of Standard Ethernet implementations
Implementation Medium Medium Length Encoding
10Base5 Thick coax 500 m Manchester
10Base2 Thin coax 185 m Manchester
10Base-T 2 UTP 100 m Manchester
10Base-F 2 Fiber 2000 m Manchester
In the nomenclature 10BaseX, the number defines the data rate (10 Mbps), the
term Base means baseband (digital) signal, and X approximately defines either the
maximum size of the cable in 100 meters (for example 5 for 500 or 2 for 185 meters) or
the type of cable, T for unshielded twisted pair cable (UTP) and F for fiber-optic. The
standard Ethernet uses a baseband signal, which means that the bits are changed to a
digital signal and directly sent on the line.
CHAPTER 13 WIRED LANs: ETHERNET 371
Manchester Manchester
encoder decoder
Station
Media
10Base5
10 Mbps 500 m
Transceiver cable
maximum 50 m
Baseband Cable Cable
(digital) end end
Transceiver Thick coaxial cable
maximum 500 m
Cable
end
10Base2
10 Mbps 185 m
Baseband
(digital)
Thin coaxial cable,
maximum 185 m
Cable
end
Note that the collision here occurs in the thin coaxial cable. This implementation is
more cost effective than 10Base5 because thin coaxial cable is less expensive than thick
coaxial and the tee connections are much cheaper than taps. Installation is simpler
because the thin coaxial cable is very flexible. However, the length of each segment
cannot exceed 185 m (close to 200 m) due to the high level of attenuation in thin coaxial
cable.
10Base-T: Twisted-Pair Ethernet
The third implementation is called 10Base-T or twisted-pair Ethernet. 10Base-T uses a
physical star topology. The stations are connected to a hub via two pairs of twisted
cable, as shown in Figure 13.9.
10Base-T
•••
10Base-T hub
CHAPTER 13 WIRED LANs: ETHERNET 373
Note that two pairs of twisted cable create two paths (one for sending and one for
receiving) between the station and the hub. Any collision here happens in the hub.
Compared to 10Base5 or 10Base2, we can see that the hub actually replaces the coaxial
cable as far as a collision is concerned. The maximum length of the twisted cable here
is defined as 100 m, to minimize the effect of attenuation in the twisted cable.
10Base-F: Fiber Ethernet
Although there are several types of optical fiber 10-Mbps Ethernet, the most common
is called 10Base-F. 10Base-F uses a star topology to connect stations to a hub. The sta-
tions are connected to the hub using two fiber-optic cables, as shown in Figure 13.10.
10Base-F
10 Mbps Fiber Two fiber-optic
cables
Baseband
(digital) •••
10Base-F hub
Wireless LANs
W e discussed wired LANs and wired WANs in the two previous chapters. We con-
centrate on wireless LANs in this chapter and wireless WANs in the next.
In this chapter, we cover two types of wireless LANs. The first is the wireless LAN
defined by the IEEE 802.11 project (sometimes called wireless Ethernet); the second is
a personal wireless LAN, Bluetooth, that is sometimes called personal area network or
PAN.
This chapter is divided into three sections:
❑ The first section introduces the general issues behind wireless LANs and compares
wired and wireless networks. The section describes the characteristics of the wire-
less networks and the way access is controlled in these types of networks.
❑ The second section discusses a wireless LAN defined by the IEEE 802.11 Project,
which is sometimes called wireless Ethernet. This section defines the architecture
of this type of LAN and describes the MAC sublayer, which uses the CSMA/CA
access method discussed in Chapter 12. The section then shows the addressing
mechanism used in this network and gives the format of different packets used at
the data-link layer. Finally, the section discusses different physical-layer protocols
that are used by this type of network.
❑ The third section discusses the Bluetooth technology as a personal area network
(PAN). The section describes the architecture of the network, the addressing mech-
anism, and the packet format. Different layers used in this protocol are also briefly
described and compared with the ones in the other wired and wireless LANs.
435
436 PART III DATA-LINK LAYER
15.1 INTRODUCTION
Wireless communication is one of the fastest-growing technologies. The demand for
connecting devices without the use of cables is increasing everywhere. Wireless LANs
can be found on college campuses, in office buildings, and in many public areas. Before
we discuss a specific protocol related to wireless LANs, let us talk about them in
general.
to a wireless infrastructure network, or to another wireless LAN. The first situation is the
one that we discuss in this section: connection of a wireless LAN to a wired infrastructure
network. Figure 15.2 shows the two environments.
Figure 15.2 Connection of a wired LAN and a wireless LAN to other networks
Wired Infrastructure
internet Access
Switch
point
In this case, the wireless LAN is referred to as an infrastructure network, and the
connection to the wired infrastructure, such as the Internet, is done via a device called
an access point (AP). Note that the role of the access point is completely different from
the role of a link-layer switch in the wired environment. An access point is gluing two
different environments together: one wired and one wireless. Communication between
the AP and the wireless host occurs in a wireless environment; communication between
the AP and the infrastructure occurs in a wired environment.
Moving between Environments
The discussion above confirms what we learned in Chapters 2 and 9: a wired LAN or a
wireless LAN operates only in the lower two layers of the TCP/IP protocol suite. This
means that if we have a wired LAN in a building that is connected via a router or a
modem to the Internet, all we need in order to move from the wired environment to a
wireless environment is to change the network interface cards designed for wired envi-
ronments to the ones designed for wireless environments and replace the link-layer
switch with an access point. In this change, the link-layer addresses will change
(because of changing NICs), but the network-layer addresses (IP addresses) will remain
the same; we are moving from wired links to wireless links.
438 PART III DATA-LINK LAYER
15.1.2 Characteristics
There are several characteristics of wireless LANs that either do not apply to wired
LANs or the existence of which is negligible and can be ignored. We discuss some of
these characteristics here to pave the way for discussing wireless LAN protocols.
Attenuation
The strength of electromagnetic signals decreases rapidly because the signal disperses
in all directions; only a small portion of it reaches the receiver. The situation becomes
worse with mobile senders that operate on batteries and normally have small power
supplies.
Interference
Another issue is that a receiver may receive signals not only from the intended sender,
but also from other senders if they are using the same frequency band.
Multipath Propagation
A receiver may receive more than one signal from the same sender because electromag-
netic waves can be reflected back from obstacles such as walls, the ground, or objects.
The result is that the receiver receives some signals at different phases (because they
travel different paths). This makes the signal less recognizable.
Error
With the above characteristics of a wireless network, we can expect that errors and
error detection are more serious issues in a wireless network than in a wired network. If
we think about the error level as the measurement of signal-to-noise ratio (SNR), we
can better understand why error detection and error correction and retransmission are
more important in a wireless network. We discussed SNR in more detail in Chapter 3,
but it is enough to say that it measures the ratio of good stuff to bad stuff (signal to
noise). If SNR is high, it means that the signal is stronger than the noise (unwanted sig-
nal), so we may be able to convert the signal to actual data. On the other hand, when
SNR is low, it means that the signal is corrupted by the noise and the data cannot be
recovered.
duplex mode. Wireless hosts do not have enough power to do so (the power is
supplied by batteries). They can only send or receive at one time.
2. Because of the hidden station problem, in which a station may not be aware of
another station’s transmission due to some obstacles or range problems, collision
may occur but not be detected. Figure 15.3 shows an example of the hidden station
problem. Station B has a transmission range shown by the left oval (sphere in
space); every station in this range can hear any signal transmitted by station B.
Station C has a transmission range shown by the right oval (sphere in space); every
station located in this range can hear any signal transmitted by C. Station C is
C
Range of B Range of C
B A C B A