Recurrence and Ergodicity
Recurrence and Ergodicity
§5.1 Introduction
The aim of this lecture is to state Birkhoff’s Ergodic Theorem and to study
some of its applications in the context of ergodic transformations of a prob-
ability space. In particular we will apply it to the doubling map and to
the continued fraction map and deduce some results of a number-theoretic
nature. Birkhoff’s Ergodic Theorem can be viewed as giving us information
about rates of recurrence. Indeed, one application that we shall study is
Kac’s Lemma: given a set A of measure µ(A), the orbit of almost every point
of A eventually returns to A and the expected time of the first recurrence
is 1/µ(A). We begin by discussing Poincaré’s theorem: this gives us infor-
mation about the recurrence properties of an arbitrary measure-preserving
transformation.
1
MAGIC010 Ergodic Theory Lecture 5
∞ ∞
!
X [
µ(T −k F ) = µ T −k F ≤ µ(X) = 1.
k=0 k=0
Since the terms in the summation have the constant value µ(F ), we must
have µ(F ) = 0. 2
2
MAGIC010 Ergodic Theory Lecture 5
The following relations are all easy to prove, and indicate why the notation
was chosen in this way.
(ii) If ν1 , ν2 µ then
d(ν1 + ν2 ) dν1 dν2
= + .
dµ dµ dµ
(iii) If λ ν µ then
dλ dλ dν
= .
dµ dν dµ
3
MAGIC010 Ergodic Theory Lecture 5
I = {B ∈ B | T −1 B = B a.e.}.
4
MAGIC010 Ergodic Theory Lecture 5
in L2 .
Corollary 5.4
Let (X, B, µ) be a probability space and let T : X → X be an ergodic
measure-preserving transformation. Let f ∈ L2 (X, B, µ). Then
n−1 Z
1X j
fT → f dµ, as n → ∞,
n
j=0
in L2 .
5
MAGIC010 Ergodic Theory Lecture 5
µ̂{(xj )∞
j=0 | x0 ∈ B0 , . . . , xr ∈ Br } = µ(T
−r
B0 ∩ · · · ∩ Br )
6
MAGIC010 Ergodic Theory Lecture 5
Hence von Neumann’s Ergodic Theorem for T follows from von Neumann’s
Ergodic Theorem for T̂ .
for µ-a.e. x ∈ X.
Corollary 5.7
Let (X, B, µ) be a probability space and let T : X → X be an ergodic
measure-preserving transformation. Let f ∈ L1 (X, B, µ). Then
n−1 Z
1X
f (T j x) → f dµ, as n → ∞,
n
j=0
for µ-a.e. x ∈ X.
7
MAGIC010 Ergodic Theory Lecture 5
are small, and that return times to sets of small measure are large. This is
indeed the case, and forms the content of Kac’s Lemma.
Let T : X → X be a measure-preserving transformation of a probability
space (X, B, µ) and let A ⊂ X be a measurable subset with µ(A) > 0. By
Poincaré’s Recurrence Theorem, the integer
Proof. Let
An = A ∩ T −1 Ac ∩ · · · ∩ T −(n−1) Ac ∩ T −n A.
T T
T T T
A
A1 A2 A3 An
1 = µ(X)
8
MAGIC010 Ergodic Theory Lecture 5
∞ X
X
= k = 1n µ(An,k )
n=1
X∞
= nµ(An )
n=1
X∞ Z
= nA dµ
n=1 An
Z
= nA dµ.
A
Σ = {{(xj )∞
j=0 | xj ∈ {0, . . . , 100}, |xj − xj+1 | = 1 for j = 0, 1, 2, . . .}
9
MAGIC010 Ergodic Theory Lecture 5
If we have i balls in the first urn then at the next stage we must have either
i − 1 or i + 1 balls in the first urn. The number of balls becomes i − 1 if
the random number chosen is equal to the number of one of the balls in
the first urn. As there are currently i such balls, the probability of this
happening is i/100. Hence the conditional probability Pi,i−1 that there are
i − 1 balls remaining given that we started with i balls in the first urn is
i/100. Similarly, the conditional probability Pi,i+1 that there are i + 1 balls
in the first urn given that we started with i balls is (100 − i)/100. This
defines a stochastic matrix:
0 1 0 0 0 ···
1 99
0 ···
100 02 100 0
98
P =
0
100 0 100 0 · · ·
3 97
0
0 100 0 100 · · ·
.. .. .. .. .. ..
. . . . . .
Note that Pi,j 6= 0 if and only if Aij = 1 and so P and A are compatible. It is
straightforward to check that pP = p. Hence we have a Markov probability
measure µP defined on Σ. The matrix A is irreducible (but is not aperiodic);
this ensures that µP is ergodic.
Consider the cylinder A = [100] of length 1. The represents there being
100 balls in the first urn. By Poincaré’s Recurrence Theorem, if we start in
A then we return to A infinitely often. By Kac’s lemma, the expected first
return time to A is
1
= 2100 seconds,
µP (A)
which is about 4 × 1022 years, or about 3 × 1012 times the length of time
that the Universe has so far existed!
Remark This suggests that returning to a small set is a ‘rare’ event, and
as such the return times are likely to have a Poisson distribution. This
is normally formalised as follows. Let T be an ergodic measure-preserving
transformation of a probability space (X, B, µ) and let x ∈ X. Let An be a
decreasing sequence of subsets that decrease
R to x. Define the return time as
τ (x) = limn→∞ µ(An )τAn (x). Then τ dµ = 1. One would normally expect
τ to have a Poisson distribution, but this is only known in particular cases;
usually one needs some hyperbolicity of the dynamics T (such as being a
shift of finite type). Additionally, there are normally restrictions on the sets
An (such as requiring them to be cylinders).
10
MAGIC010 Ergodic Theory Lecture 5
This r-adic expansion is unique, unless the sequence (xj ) ends in either
infinitely repeated 0s or infinitely repeated (r − 1)s.
Proposition 5.9
Lebesgue almost every number in [0, 1] is normal.
Proof. Fix r ≥ 2. Then clearly all but a countable set of points has a
unique r-adic expansion. Fix k ∈ {0, 1, . . . , r − 1}. Then it is easy to see
that xj = k if and only if T j−1 x ∈ [k/r, (k + 1)/r). Thus
n−1
1 1X
card{1 ≤ j ≤ n | xj = k} = χ[k/r,(k+1)/r) (T j x).
n n
j=0
11
MAGIC010 Ergodic Theory Lecture 5
Proposition 5.10
For Lebesgue-almost every point x ∈ [0, 1], the arithmetic mean of the digits
occurring in the base r expansion of x is (r − 1)/2.
We leave this as an exercise.
Proposition 5.11
For Lebesgue-almost every x ∈ [0, 1], the frequency with which the natural
number k occurs in the continued fraction expansion of x is
(k + 1)2
1
log .
log 2 k(k + 2)
Proof. Let λ denote Lebesgue measure and let µ denote Gauss’ measure.
Then λ-a.e. and µ-a.e. x ∈ (0, 1) is irrational and has an infinite continued
fraction expansion
1
x= .
x0 + x + 1 1
1 1
x2 + x +···
3
12
MAGIC010 Ergodic Theory Lecture 5
Proposition 5.12
(i) For Lebesgue-almost every x ∈ [0, 1], the arithmetic mean of the digits
in the continued fraction expansion of x is infinite.
(ii) For Lebesgue-almost every x ∈ [0, 1], the geometric mean of the digits
in the continued fraction expansion of x is
∞ log k/ log 2
Y 1
1+ .
k 2 + 2k
k=1
Proof. Writing
1
x= 1 .
x0 + x1 + 1
1
x2 + x +···
3
almost everywhere.
We leave (5.3) as an exercise.
We prove (5.4). Define f (x) = log k for x ∈ (1/(k + 1), 1/k]. Then
n−1
1 1X
(log a0 + log a1 + · · · + an−1 ) = f (T j x)
n n
j=0
Z 1
1 f (x)
→ dx
log 2 0 1 + x
∞ Z
1 X 1/k log k
= dx
log 2 1/(k+1) 1+x
k=1
∞
X log k 1
= log 1 + 2
,
log 2 k + 2k
k=1
13
MAGIC010 Ergodic Theory Lecture 5
so we obtain that
f ≥ Fn − Fn ◦ T
on the set A = {x | Fn (x) > 0}.
Hence
Z Z Z
f dµ ≥ Fn dµ − Fn ◦ T dµ
A Z A ZA
= Fn dµ − Fn ◦ T dµ as Fn = 0 on X \ A
X A
Z Z
≥ Fn dµ − Fn ◦ T dµ as Fn ◦ T ≥ 0
X X
= 0 as µ is T -invariant.
2
Corollary 5.14
Let g ∈ L1 (X, B, µ) and let
n−1
1X
Mα = x ∈ X | sup g(T j x) > α .
n≥1 n j=0
14
MAGIC010 Ergodic Theory Lecture 5
(since fn (x) > 0 ⇒ Fn (x) > 0 and Fn (x) > 0 ⇒ fj (x) > 0 for some 1 ≤ j ≤
n). Write Cn = {x | Fn (x) > 0} and observe that Cn ⊂ Cn+1 . Thus χCn
converges to χBα and so f χCn converges to f χMα , as n → ∞. Furthermore,
|f χCn | ≤ |f |. Hence, by the Dominated Convergence Theorem,
Z Z Z Z
f dµ = f χCn dµ → f χMα dµ = f dµ, as n → ∞.
Cn X X Mα
Applying
R the Maximal RInequality, we have,R for all n ≥ 1 we have that
Cn f dµ ≥ 0. Therefore Mα f dµ ≥ 0, i.e., Bα g dµ ≥ αµ(Bα ).
For the general case, we work with the restriction of T to B, T : B → B,
and apply the Maximal Inequality on this subset to get
Z
g dµ ≥ αµ(Mα ∩ B),
Mα ∩B
as required. 2
Observe that
n+1 1
an+1 (x) = an (T x) + f (x).
n n
15
MAGIC010 Ergodic Theory Lecture 5
(i) f ∗ = f∗ µ-a.e
(ii) f ∗ ∈ L1 (X, B, µ)
(iii) f ∗ dµ = f dµ.
R R
Note that [
{x ∈ X | f∗ (x) < f ∗ (x)} = Eα,β
β<α, α,β∈Q
Therefore
αµ(Eα,β ) ≤ βµ(Eα,β )
and since β < α this shows that µ(Eα,β ) = 0. Thus f ∗ = f∗ µ-a.e. and
n−1
1X
lim f (T j x) = f ∗ (x) µ-a.e.
n→∞ n
j=0
16
MAGIC010 Ergodic Theory Lecture 5
Then gn ≥ 0 and Z Z
gn dµ ≤ |f | dµ
Dkn ∩ M k −ε = Dkn .
n
Thus Z Z
k+1 1
f ∗ dµ ≤ µ(Dkn ) ≤ µ(Dkn ) + f dµ
Dkn n n Dkn
(where the first inequality follows from the definition of Dkn ). Since
[
X= Dkn
k∈Z
17
MAGIC010 Ergodic Theory Lecture 5
so that Z Z Z
∗
f dµ = f∗ dµ ≥ f dµ.
Therefore Z Z
∗
f dµ = f dµ,
as required.
Finally, we prove that f ∗ = E(f | I). First note that as f ∗ is T -invariant,
it is measurable with respect to I. Moreover, if I is any T -invariant set then
Z Z
f dµ = f ∗ dµ.
I I
§5.9 References
Most of the material in this lecture is standard in ergodic theory. The presen-
tation of Ehrenfests’ example (originally an example in statistical mechanics)
is taken from
§5.10 Exercises
Exercise 5.1
Construct an example to show that Poincaré’s recurrence theorem does not
hold on infinite measure spaces. (Recall that a measure space (X, B, µ) is
infinite if µ(X) = ∞.)
Exercise 5.2
Prove Proposition 5.10: For Lebesgue-almost every point x ∈ [0, 1], the
arithmetic mean of the digits occurring in the base r expansion of x is
(r − 1)/2.
18
MAGIC010 Ergodic Theory Lecture 5
Exercise 5.3
(i) Let T : X → X be an ergodic measure-preserving transformation of
a probability Rspace (X, B, µ). Let f : X → R be measurable, and
suppose that f dµ = ∞. Prove that
n−1
1X
lim f (T j x) = ∞
n→∞ n
j=0
Exercise 5.4
Let B be a Banach space and U a bounded linear operator of B such that
supk kU k k < ∞. Prove that the following are equivalent:
n−1
1X j
(i) U v converges in norm;
n
j=0
n−1
1X j
(ii) U v has a limit point in the weak topology,
n
j=0
(iii) U has a fixed point in the weakly closed convex hull of {U n v} (i.e. the
smallest weakly closed convex set that contains all the U n v).
in Lp .
19