0% found this document useful (0 votes)

38 views

Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013

This lecture covers channel capacity and the channel coding theorem. It introduces Fano's inequality, which relates the error in estimating a random variable X from another random variable Y to their conditional entropy H(X|Y). Channel capacity is defined as the maximum rate at which information can be sent over a channel with arbitrarily low probability of error. Several channel models are presented, including the noisy typewriter channel and binary symmetric channel. The channel coding theorem states that channel capacity is the highest rate at which error-free communication is possible. Tools needed to prove the theorem are also outlined.

Uploaded by

Kate boss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013

Uploaded by

Kate boss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Channel Capacity and the Channel Coding

Theorem, Part I
Information Theory 2013
Lecture 4

Michael Roth

April 24, 2013

Outline

This lecture will cover

• Fano’s inequality.
• channel capacity and some channel
models.
• a preview of the channel coding theorem.
• the tools that are needed to establish the
channel coding theorem.

All illustrations are borrowed from the book.

Fano’s inequality
Estimate X from Y . Relate error in guessing X to H(X |Y ).
We know that H(X |Y ) = 0 if X = g(Y ) (Problem 2.5) → can
estimate X with zero error probability. Extension: H(X |Y ) “small”
→ can estimate X with low error probability.
Formally: X has p(x ), Y related via p(y |x ),n estimate
o X̂ = g(Y )
with alphabet X̂ , error probability Pe = Pr X̂ 6= X .

Fano’s inequality: For X → Y → X̂

H(Pe ) + Pe log |X | ≥ H(X |X̂ ) ≥ H(X |Y ).

Weaker: 1 + Pe log |X | ≥ H(X |Y ) or

H(X |Y ) − 1
Pe ≥ .
log |X |
Motivation and preview
A communicates with B: A induces a state in B. Physical process
gives rise to noise.
Mathematical analog: source W , transmitted sequence X n , etc.

^
W Xn Channel Yn W
Encoder Decoder
Message p(y |x) Estimate
of
Message

Two X n may give the same Y n — inputs confusable.

Idea: use only a subset of all possible X n such that there is, with
high probability, only one likely X n to result in each Y n .

Map W into “widely spaced” X n . Then Ŵ = W with high

probability.
Channel capacity: maximum rate (source bits/channel use) at
which we can carry out the above steps.
Channel capacity

Discrete channel: input alphabet X , output alphabet Y,

probability transition matrix p(y |x ).
Memoryless channel: current output depends only on the current
input, conditionally independent of previous inputs or outputs.
“Information” channel capacity of a discrete memoryless channel is

C = max I(X ; Y ).
p(x )

Shannon’s channel coding theorem: C highest rate (bits per

channel use) at which information can be sent with arbitrary low
probability of error.
Some channels I
0 0

Noiseless binary channel

X Y • I(X ; Y ) = H(X ) − H(X |Y ) =
H(X ).
• C = 1, achieved for uniform X .
1 1

1
1/2

1/2
Noisy channel with
2
nonoverlapping outputs
X Y
• output random, but input uniquely
3 determined.
1/3
• C = 1, achieved for uniform X .
1

2/3
4
Some channels II
A A A A
B B B
C C C C
D D D
E E

Noisy typewriter
• input either unchanged or
shifted (both w.p. 21 ).
• use of every second input:
log 13 bits per transmission
without error.
• I(X ; Y ) = H(Y ) − H(Y |X ) =
H(Y ) − H( 12 , 12 ) = H(Y ) − 1.
• C = max I(X ; Y ) =
log 26 − 1 = log 13.
Y Y
Z Z Z
Noisy channel Noiseless subset of inputs
Some channels III
1−p
0 0
Binary symmetric channel
• simplest channel with errors.
p • probability of switched input is p.
p
• “all received bits unreliable”.
• C = 1 − H(p) achieved for
1
1−p
1
uniform X .

I(X ; Y ) = H(Y ) − H(Y |X )

X
= H(Y ) − p(x )H(Y |X = x )
X
= H(Y ) − p(x )H(p)
= H(Y ) − H(p)
≤ 1 − H(p).

Reminder: H(p) = −p log p − (1 − p) log(1 − p).

Some channels IV
1−α Binary erasure channel
0 0
• bits are lost rather than corrupted.
• fraction α are erased.
• e: receiver knows that it does not
know.
α
α
e • I(X ; Y ) = H(Y ) − H(Y |X ) =
H(Y ) − H(α).
• C = 1 − α.
• feedback discussion and surprising
1
1−α
1 fact.

Introduce E with E = 1 if Y = e. Let π = Pr {X = 1}. Then

H(Y ) = H(Y , E ) = H(E ) + H(Y |E )

= H(α) + (1 − α)H(π)

and I(X ; Y ) = (1 − α)H(π) yields C = (1 − α) for π = 12 .

Symmetric channels I

Transmission matrix. Example for X = Y = {0, 1, 2}:

0.3 0.2 0.5

 

p(y |x ) = 0.5 0.3 0.2

 
0.2 0.5 0.3

Pr {Y = 1|X = 0} = 0.2. Rows must add up to 1.

This is a symmetric channel: row 1 is a permutation of row 2.
Other rows and columns are permutations too.
Let r be one row in p(y |x ). Then

I(X ; Y ) = H(Y ) − H(Y |X ) = H(Y ) − H(r) ≤ log |Y| − H(r).

with c sum over one column.

Generalization: each row is a permutation of every other row, and
all column sums are equal. Example:
" #
1/3 1/6 1/2
p(y |x ) = .
1/3 1/2 1/6

Channel capacity for weakly symmetric channels is

C = log |Y| − H(r).

Properties of channel capacity

Properties:
• C ≥ 0, since I(X ; Y ) ≥ 0.
• C ≤ log |X | and C ≤ log |Y|.
• I(X ; Y ) continuous function of p(x ).
• I(X ; Y ) concave in p(x ).
Consequences:
• maximum exists and is finite.
• convex optimization tools can be employed.
Preview of the channel coding theorem

Yn
Intuitive idea:
Xn
• for large block lengths every channel looks
like the noisy typewriter.
• one (typical) input sequence gives
≈ 2nH(Y |X ) output sequences.
• total number of (typical) output
sequences ≈ 2nH(Y ) must be divided into
sets of size 2nH(Y |X ) .
• total number of disjoint sets ≤ 2n(H(Y )−H(Y |X )) = 2nI(X ;Y ) .
• can send at most 2nI(X ;Y ) distinguishable sequences of
length n.
• channel capacity as log of the maximum number of
distinguishable sequences.
Definitions I
^
W Xn Channel Yn W
Encoder Decoder
Message p(y |x) Estimate
of
Message

• discrete channel: (X , p(y |x ), Y)).

• nth extension of the discrete memoryless channel:
(X n , p(y n |x n ), Y n ) with p(yk |x k , y k−1 ) = p(yk |xk ).
• no feedback: p(y n |x n ) = (default case in the
Qn
i=1 p(yi |xi ).
book.)
• (M, n) code for (X , p(y |x ), Y):
1. index set {1, 2, . . . , M}.
2. encoding function X n : {1, 2, . . . , M} → X n with codewords
x n (1), . . . , x n (M). all codewords form the codebook.
3. decoding function: g : Y n → {1, 2, . . . , M}.
Definitions II

• conditional prob. of error: λi = Pr {g(Y n ) 6= i|X n = x n (i)}.

• maximal prob. of error: λ(n) = maxi∈{1,...,M} λi .
(n)
• average prob. of error for an (M, n) code: Pe i=1 λi .
1 PM
= M

• rate of an (M, n) code: R = log(M)/n bits per transmission.

• rate R achievable if there exists a sequence of (d2nR e, n)
codes such that λ(n) → 0 as n → 0.
• capacity is the supremum of all achievable rates.
Jointly typical sequences I

Idea: decode Y n as index i if X n (i) is jointly typical with Y n .

(n)
The set A of jointly typical sequences {(x n , y n )} w.r.t. p(x , y ) is
given by
(
A(n)
= (x n , y n ) ∈ X n × Y n :

1

− log p(x n ) − H(X ) < ,

n
1

− log p(y n ) − H(Y ) < ,

n

1
)
− log p(x , y ) − H(X , Y ) < ,
n n

n

where p(x n , y n ) =
Qn
i=1 p(xi , yi ).
Jointly typical sequences II
Joint AEP: Let (X n , Y n ) have lengths n, drawn i.i.d. from
p(x n , y n ). Then:
n o
(n)
1. Pr (X n , Y n ) ∈ A → 1 as n → ∞.
(n)
2. |A | ≤ 2n(H(X ,Y )+) .
n o
(n)
3. Pr (X̃ n , Ỹ n ) ∈ A ≤ 2−n(I(X ;Y )−3) for
(X̃ n , Ỹ n ) ∼ p(x n )p(y n ).
yn
xn
. .. . . . . . .. . . . . • 2nH(X ) typical X sequences.
. . ..
.. . . . .. . . . . . • 2nH(Y ) typical Y sequences.
. . . . . ..
. .. . . . . . • only 2nH(X ,Y ) jointly typical
.. . .
.. . . . .. . . .. . . sequences.
. . .
.. . .
.
. .
.
.
.
.. ..
. • one in 2nI(X ;Y ) pairs is jointly
. .
. . . .. . . . . . .. typical.

Channel Capacity
No ratings yet
Channel Capacity
51 pages
Channel Capacity: 1 Preliminaries and Definitions
No ratings yet
Channel Capacity: 1 Preliminaries and Definitions
5 pages
ch7 PDF
No ratings yet
ch7 PDF
33 pages
Chapter 7: Channel Capacity
No ratings yet
Chapter 7: Channel Capacity
33 pages
ETN642_lec8__Ch8_handouts
No ratings yet
ETN642_lec8__Ch8_handouts
12 pages
Channel Capacity and Models
No ratings yet
Channel Capacity and Models
30 pages
Channel Capacity PDF
No ratings yet
Channel Capacity PDF
30 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
36 pages
TEOI-Capacity of discrete channels
No ratings yet
TEOI-Capacity of discrete channels
62 pages
Coding 515
No ratings yet
Coding 515
92 pages
Discrete Memoryless Channels
No ratings yet
Discrete Memoryless Channels
16 pages
Chapter 2
No ratings yet
Chapter 2
12 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
37 pages
It Co 3 en
No ratings yet
It Co 3 en
14 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
36 pages
Notes- BSC
No ratings yet
Notes- BSC
6 pages
Lecture 15: Channel Capacity, Rate of Channel Code
No ratings yet
Lecture 15: Channel Capacity, Rate of Channel Code
6 pages
Discrete Memory Less Channel
No ratings yet
Discrete Memory Less Channel
68 pages
CH 7
No ratings yet
CH 7
68 pages
T4 NoiseAndMutualInformation
No ratings yet
T4 NoiseAndMutualInformation
8 pages
Communication Systems Channel Capacity
No ratings yet
Communication Systems Channel Capacity
6 pages
5-2 Information Theory
No ratings yet
5-2 Information Theory
37 pages
Channel Coding: Reliable Communication Through Noisy Channels
No ratings yet
Channel Coding: Reliable Communication Through Noisy Channels
23 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
30 pages
Different Integration Formulas
No ratings yet
Different Integration Formulas
11 pages
GATE Online Coaching Classes: Digital Communications
No ratings yet
GATE Online Coaching Classes: Digital Communications
64 pages
Information Theory & Coding: Understand
No ratings yet
Information Theory & Coding: Understand
126 pages
Channel Coding I
No ratings yet
Channel Coding I
22 pages
Noisy Channel Theorem
No ratings yet
Noisy Channel Theorem
6 pages
L9,10, L11 - Module 3 Channel Models and Capacity
No ratings yet
L9,10, L11 - Module 3 Channel Models and Capacity
40 pages
EE 376A: Information Theory: Lecture Notes
No ratings yet
EE 376A: Information Theory: Lecture Notes
75 pages
Channel Capacity
No ratings yet
Channel Capacity
68 pages
Network Information Theory: Multiple Access Channels Broadcasting Channel Capacity Region
No ratings yet
Network Information Theory: Multiple Access Channels Broadcasting Channel Capacity Region
30 pages
Polar Codes
No ratings yet
Polar Codes
64 pages
DC Slide module-3
No ratings yet
DC Slide module-3
16 pages
Digital Communication Chapter 3
No ratings yet
Digital Communication Chapter 3
37 pages
1 Merged
No ratings yet
1 Merged
182 pages
A Mathematical Theory of Communication: Jin Woo Shin, Sang Joon Kim
No ratings yet
A Mathematical Theory of Communication: Jin Woo Shin, Sang Joon Kim
6 pages
Chapter 4
No ratings yet
Chapter 4
89 pages
ITC
100% (1)
ITC
13 pages
Channel Coding: - Channel Capacity Channel Capacity, C Is Defined As
No ratings yet
Channel Coding: - Channel Capacity Channel Capacity, C Is Defined As
11 pages
An Introduction To Information Theory: Adrish Banerjee
No ratings yet
An Introduction To Information Theory: Adrish Banerjee
6 pages
INGI 2348: Information Theory and Coding
No ratings yet
INGI 2348: Information Theory and Coding
33 pages
5_2020_12_19!06_17_51_PM
No ratings yet
5_2020_12_19!06_17_51_PM
7 pages
Rohini 15720602071
No ratings yet
Rohini 15720602071
2 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
30 pages
Information Theory and Coding - Chapter 5
No ratings yet
Information Theory and Coding - Chapter 5
41 pages
6441 Lecture 17
No ratings yet
6441 Lecture 17
17 pages
Channel Coding: X X y y
No ratings yet
Channel Coding: X X y y
13 pages
Unit 1
100% (2)
Unit 1
45 pages
Ec8501-Digital Communication Question Bank Two Marks With Answer
No ratings yet
Ec8501-Digital Communication Question Bank Two Marks With Answer
28 pages
TE361 Channel Coding 1
No ratings yet
TE361 Channel Coding 1
24 pages
DC Lecture Slides 1 - Information Theory
No ratings yet
DC Lecture Slides 1 - Information Theory
22 pages
IT_w3
No ratings yet
IT_w3
10 pages
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
No ratings yet
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
31 pages
15ec54 PDF
No ratings yet
15ec54 PDF
56 pages
Polar Codes
No ratings yet
Polar Codes
58 pages
ECE 771 Lecture 10 - The Gaussian Channel
No ratings yet
ECE 771 Lecture 10 - The Gaussian Channel
9 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
3D Tic Tac Toe: Board State Representation
No ratings yet
3D Tic Tac Toe: Board State Representation
5 pages
Operation Research
No ratings yet
Operation Research
15 pages
Capstrum Coefficient Features Analysis For Multilingual Speaker Identification System
No ratings yet
Capstrum Coefficient Features Analysis For Multilingual Speaker Identification System
8 pages
PDC LAB Experiment 2
No ratings yet
PDC LAB Experiment 2
12 pages
DAA - Lab.File
No ratings yet
DAA - Lab.File
25 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
CS8381 DS Labreg2017
No ratings yet
CS8381 DS Labreg2017
3 pages
Two-Variable Regression Model - The Problem of Estimation
No ratings yet
Two-Variable Regression Model - The Problem of Estimation
35 pages
Cannon Strassen DNS Algorithm
No ratings yet
Cannon Strassen DNS Algorithm
10 pages
Attachment
No ratings yet
Attachment
2 pages
Algorithms Types - Discrete Mathematics Questions and Answers - Sanfoundry
No ratings yet
Algorithms Types - Discrete Mathematics Questions and Answers - Sanfoundry
7 pages
Comparison of Three Back-Propagation Training Algorithms For Two Case Studies
No ratings yet
Comparison of Three Back-Propagation Training Algorithms For Two Case Studies
9 pages
Lab 4 Convolution
No ratings yet
Lab 4 Convolution
4 pages
Ada Module 1
No ratings yet
Ada Module 1
26 pages
EasyFFT Frequency Transform For Arduino PDF
No ratings yet
EasyFFT Frequency Transform For Arduino PDF
5 pages
Principles of Communication: Digital Pulse Modulation
No ratings yet
Principles of Communication: Digital Pulse Modulation
63 pages
ESO-208A Computational Methods in Engineering Assignment 1 2022-23 Semester-1
No ratings yet
ESO-208A Computational Methods in Engineering Assignment 1 2022-23 Semester-1
2 pages
Basics of Digital Recording
No ratings yet
Basics of Digital Recording
5 pages
Machine Learning: B.Tech (CSBS) V Semester
No ratings yet
Machine Learning: B.Tech (CSBS) V Semester
17 pages
Cross Power Spectral Density: Z T Which Is Sum of Two Real Jointly WSS Random Processes
No ratings yet
Cross Power Spectral Density: Z T Which Is Sum of Two Real Jointly WSS Random Processes
8 pages
Lecture 6 Differentiable Games
No ratings yet
Lecture 6 Differentiable Games
31 pages
Dual Simplex Method
No ratings yet
Dual Simplex Method
11 pages
RNN-LectureNotes
No ratings yet
RNN-LectureNotes
36 pages
Chapter-02 (Part-II) PDF
No ratings yet
Chapter-02 (Part-II) PDF
23 pages
DAA Intro
No ratings yet
DAA Intro
37 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
5.3 Binary Max Heap
No ratings yet
5.3 Binary Max Heap
3 pages
Project Slides
No ratings yet
Project Slides
24 pages
5EC4-04 - Digital Signal Processing - Suman Sharma
No ratings yet
5EC4-04 - Digital Signal Processing - Suman Sharma
475 pages
HW2 Written Sol
No ratings yet
HW2 Written Sol
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013

Uploaded by

Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013

Uploaded by

Channel Capacity and the Channel Coding

April 24, 2013

This lecture will cover

All illustrations are borrowed from the book.

Fano’s inequality: For X → Y → X̂

H(Pe ) + Pe log |X | ≥ H(X |X̂ ) ≥ H(X |Y ).

Weaker: 1 + Pe log |X | ≥ H(X |Y ) or

Two X n may give the same Y n — inputs confusable.

Map W into “widely spaced” X n . Then Ŵ = W with high

Discrete channel: input alphabet X , output alphabet Y,

Shannon’s channel coding theorem: C highest rate (bits per

Noiseless binary channel

I(X ; Y ) = H(Y ) − H(Y |X )

Reminder: H(p) = −p log p − (1 − p) log(1 − p).

Introduce E with E = 1 if Y = e. Let π = Pr {X = 1}. Then

H(Y ) = H(Y , E ) = H(E ) + H(Y |E )

and I(X ; Y ) = (1 − α)H(π) yields C = (1 − α) for π = 12 .

Transmission matrix. Example for X = Y = {0, 1, 2}:

0.3 0.2 0.5

p(y |x ) = 0.5 0.3 0.2

Pr {Y = 1|X = 0} = 0.2. Rows must add up to 1.

I(X ; Y ) = H(Y ) − H(Y |X ) = H(Y ) − H(r) ≤ log |Y| − H(r).

with c sum over one column.

Channel capacity for weakly symmetric channels is

C = log |Y| − H(r).

• discrete channel: (X , p(y |x ), Y)).

• conditional prob. of error: λi = Pr {g(Y n ) 6= i|X n = x n (i)}.

• rate of an (M, n) code: R = log(M)/n bits per transmission.

Idea: decode Y n as index i if X n (i) is jointly typical with Y n .

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.