Ujhandout
Ujhandout
Ujhandout
András Gyárfás
2011 FALL
INTRODUCTION
This handout is based on the Advanced Combinatorics course I have taught through
many years at Budapest Semesters. It contains a basic material covered in each semester
(sections 1-5) and two further parts (fruit salad and advanced menu) from which I have
selected material according to the taste and appetite of the students.
The objective of the course is to introduce hypergraphs together with widely appli-
cable proof methods of combinatorial mathematics.
The material is self contained, apart from basic facts of linear algebra. However,
at least one elementary course - for example counting methods, graph theory, discrete
structures - is strongly recommended.
Each section is supplemented with a set of easy questions (self-test) to check the
understanding of the material and with an exercise set.
At certain places the presentation of the material follows closely the following sources:
Babai, Frankl: Linear Algebra Methods in Combinatorics.
Graham, Rothchild, Spencer: Ramsey Theory.
Spencer: Ten Lectures on the Probability Method.
Van Lint, Wilson: A course in Combinatorics.
Acknowledgement. First of all I express my thanks to the active students of this
course through the years from 1988 to the present time. They understood my English,
read my handwritten notes, found misprints, caught errors, invented new proofs, some
of them even found new results that eventually deserve to be published (or already have
been published). The support of the organizers of the Budapest Semesters (in America
and in Budapest) is also acknowledged together with the help of my home institute,
SZTAKI (Computer and Automation Research Institute of the Hungarian Academy of
Sciences). Thanks to Péter Juhász for implementing the figures.
1
CONTENT
2.2. Graphs from the Hall of Fame. (Graphs of Zykov, Mycielski, Tutte, Shift
graph, Kneser graph.)
2
4. COUNTING AND PROBABILITY
4.2. Probability method. (The Erdős lower bound on R(n), Lower bound for W (k),
Tournaments, Large transitive subtournaments, Existence versus construction, Paradox-
ical tournaments, Hamiltonian paths.)
4.3. Local Lemma. (Erdős-Lovász theorem, Improved lower bound on W (k), Even
cycles in regular digraphs.)
5.1. The dimension bound. (Oddtown, Fisher inequality, A cubic lower bound for
R(n), Two-distance sets, Cross-intersecting hypergraphs.)
6. FRUIT SALAD
Distinct representatives, symmetric chain decomposition, properties of hypergraphs with
n vertices and n, n+1, n+2 edges, critical not 2-colorable hypergraphs, sunflower theorem,
sum-free subsets of numbers.
7. ADVANCED MENU
Baranyai theorem, normal hypergraphs and perfect graphs, constructive super polynomial
lower bound for Ramsey numbers, Bursuk conjecture.
3
1 BASIC NOTIONS AND EXAMPLES
1.1 DEFINITIONS
Hypergraph: H = (V, E) where V is a set and E is a collection of subsets of V . The
set V is called the vertex set and E is called the edge set of the hypergraph H . The
word “collection” is used to allow selection of the same subset of V more than once, if
this happens we have multiedges.
Simple hypergraph: no multiedges.
What can be empty? To be precise, there are some details to discuss. Do we allow
that V = ∅ ? Frank Harary has a paper with the pun title: “Is the null-graph a pointless
concept?” Do we allow E = ∅ ? What if V, E are nonempty but e = ∅ for some e ∈ E
? We shall assume that V and E are nonempty but allow empty edges for technical
reasons.
Singletons, isolated vertices. An edge e ∈ E is called singleton if |e| = 1. A vertex
is isolated if no edge contains it. We allow singleton edges and isolated vertices (unless
stated otherwise).
Finite or infinite? In most cases we shall work with finite hypergraphs. However,
sometimes it is natural to consider hypergraphs with |V | = ∞, for example V = Rd , but
usually we shall still assume |E| < ∞.
Degree of v ∈ V : The number of edges containing vertex v (in a hypergraph H = (V, E)
). Notation for the degree: d(v) or dH (v) if not clear from the context.
Maximum and Minimum degree: ∆(H) denotes the largest and δ(H) denotes the
smallest among the degrees of the vertices of H.
Regular hypergraph: d(x) = d(y) for all x, y ∈ V .
t-regular hypergraph: regular hypergraph with common degree t.
Uniform hypergraph: |e| = |f | for all e, f ∈ E.
t-uniform hypergraph: uniform hypergraph with edges of t vertices.
Graphs can be defined as 2-uniform hypergraphs, simple graphs are the simple 2-uniform
hypergraphs. (A singleton edge is similar to the notion of the loop in Graph theory but
a loop contributes 2 and a singleton contributes 1 to the degree of a vertex.) There are
4
some terms used only for graphs. The most important is the adjacency: two vertices
x, y of a graph are called adjacent if {x, y} is an edge of the graph. The complementary
graph (or complement) of a simple graph G = (V, E) is the graph G = (V, E) where E
is the set of pairs of V not in E. Complements of simple uniform hypergraphs can be
defined in a similar way.
Isomorphic hypergraphs: Two hypergraphs are isomorphic if there exists an edge
preserving bijection between their vertex sets.
How to draw a hypergraph? One possibility to represent a hypergraph is to draw
it. In case of graphs this is very usual. It is more complicated to draw a hypergraph.
On Figure 1.1 the “potato” represents an edge with four vertices, the other nine
edges have two vertices. Figure 1.2 is a drawing of the so called Fano plane, a famous
3-regular, 3-uniform hypergraph on seven vertices. In that case the edges are represented
with lines with one exception, the circle also represents an edge.
The incidence matrix. A hypergraph H = (V, E) can be represented by its incidence
matrix, defined as follows. Assume that V = {v1 , . . . vn } and E = {e1 , . . . em }. The
incidence matrix of H is the m × n matrix [aij ] where
(
1 if vj ∈ ei
aij =
0 if vj ∈
/ ei
Remember that rows correspond to edges and columns correspond to vertices in the
incidence matrix of the hypergraph. (Of course, this is merely a convention.)
Hypergraphs are rather general structures but there is a theorem valid for all hyper-
graphs.
5
Theorem 1.1 For any hypergraph H = (V, E)
X X
d(x) = |e|
x∈V e∈E
H H∗
Theorem 1.2 Every simple graph is the intersection graph of some hypergraph.
6
ei -s are distinct edges and xi , xi+1 ∈ ei for 1 ≤ i ≤ k − 1. For convenience, a vertex is
considered also as a path (this corresponds to the case k = 1). The length of the path is
k − 1, the number of edges in it.
The definition of a cycle is very similar, it is more symmetric: add ek to the end of
a sequence forming a path, requiring x1 , xk ∈ ek . More precisely, for k ≥ 2, a cycle is
a sequence x1 , e1 , x2 , e2 , . . . , xk−1 , ek−1 , xk , ek where the xi -s are distinct vertices and the
ei -s are distinct edges and xi , xi+1 ∈ ei for 1 ≤ i ≤ k − 1 and x1 , xk ∈ ek . The length
of the cycle is k, the number of vertices (or the number of edges). An important special
case is the cycle of length two (the first object on Figure 1.3).
Paths and cycles can form rather complicated structures in hypergraphs. In the
special case of graphs, paths and cycles are determined by just giving the sequence of
their vertices. For graphs, paths ( and cycles) with k vertices are isomorphic, a rather
standard notation is to use Pk for paths and Ck for cycles (with k vertices). Figure 1.3
gives the drawing of some paths and cycles.
The complete uniform hypergraph Knr is the hypergraph with n vertices and with
¡ ¢
all distinct r-element subsets as edges. Therefore Knr has nr edges. We always assume
1 ≤ r ≤ n and (as usual) abbreviate Kn2 as Kn .
A k-partite hypergraph is a hypergraph H = (V, E) such that V is partitioned into k
non-empty sets, called partite classes and every edge e ∈ E intersects each partite class
in at most one vertex. Notice that every hypergraph is k-partite for some k since the
vertex set can be partitioned into one element partite classes. The 2-partite graphs are
called bipartite graphs.
7
Intersecting hypergraphs are hypergraphs in which any pair of edges have a non-
empty intersection. For example, the Fano plane (Figure 1.2) is an intersecting hyper-
graph.
Steiner systems. Let t, k, n be integers satisfying 2 ≤ t ≤ k < n. A Steiner system
S(t, k, n) is a k-uniform hypergraph H = (V, E) with n vertices such that for each t-
element set T ⊂ V there is exacly one edge e ∈ E satisfying T ⊆ e. The complete graph
Kn is the same as a S(2, 2, n) Steiner system. The Steiner systems S(2, 3, n) are called
Steiner triple systems . The Fano plane is an example of a Steiner triple system on 7
vertices. A trivial example of a Steiner system with t = 3 is K43 , the smallest interesting
one is S(3, 4, 8) (Exercise 1.4).
Linear spaces. These are hypergraphs in which each pair of distinct vertices is con-
tained in precisely one edge. To exclude trivial cases, it is always assumed that there
are no empty or singleton edges.
The rest of the section gives examples and results about linear spaces. An obvious
example is a hypergraph with only one edge which contains all vertices, this is called a
trivial linear space . Another example is the so called near pencil with one ’big’ edge
which contains all but one vertices (the other edges have two vertices, see Figure 1.4).
Another straightforward example is the complete graph Kn . A more complicated one,
the affine plane of order 3 is shown in Figure 1.5. This example (like the Fano plane)
is a special case of Steiner triple systems S(2, 3, n). Observe that uniform linear spaces
are precisely the Steiner systems S(2, k, n).
8
Figures 1.4. and 1.5. The near pencil and the affine plane of order 3
Drawings of linear spaces. One can try to represent a linear space by drawing its
vertices as points of the plane so that edges are precisely the lines determined by these
points. If there are at most seven vertices then such a drawing is possible apart from
one exception, the Fano plane. Such a drawing is more clear if only the lines having at
least three points are shown. Figure 1.6 gives the drawings of linear spaces with at most
five vertices according to this convention.
After observing the examples, it seems that (apart from the trivial one) a linear space
must have at least as many edges as vertices.
Theorem 1.3 If a non-trivial linear space has n vertices and m edges then m ≥ n (de
Bruijn - Erdős).
9
Proof. (Tricky proof, due to Conway.) Let H = (V, E) be a linear space, |V | = n,
|E| = m and suppose 1 < m ≤ n. Select v ∈ V , e ∈ E such that v ∈
/ e. Observe that
this implies d(v) ≥ |e| which (together with m ≤ n ) gives
1 1
≥
n(m − d(v)) m(n − |e|)
Adding these inequalities for all pairs v ∈
/ e one gets
XX 1 XX 1
1= ≥ =1
v∈V e63v
n(m − d(v)) e∈E
m(n − |e|)
v ∈e
/
because the inner sums are never empty (1 < m) and the terms in them do not
depend on e and v (and have m − d(v) resp. n − |e| terms). Therefore all inequalities
must be equalities, in particular m = n. ¤
The next result shows that intersecting linear spaces are rather restrictive.
Proof. (Outline.) If the union of at most two edges cover the vertex set then H is
trivial or a near pencil. Otherwise for e, f ∈ E select p ∈ V \ (e ∪ f ). For x ∈ e, let g(x)
be a vertex of f such that x, p, g(x) are in an edge of H. Then g is a bijection therefore
|e| = |f | := k + 1. This shows that H is (k + 1)-uniform. Then (k + 1)-regularity,
|V | = k 2 + k + 1 and |E| = k 2 + k + 1 follows easily. ¤
Intersecting linear spaces with property (1) are called finite planes of order k . Their
existence is known only if k is a prime power. The following important result of Bruck
and Ryser rules out the existence of finite planes for certain values of k: if a finite plane
of order k exists for k ≡ 1 or k ≡ 2 (mod 4) then k = x2 + y 2 has integer solution.
Apart from these results, only the nonexistence of a finite plane of order 10 is known.
The finite planes of order at most eight are unique but this is not true for order nine.
10
Self-test 1.
[T1.1] Show that all 0 − 1 matrices are incidence matrices of hypergraphs. What about
zero rows or columns?
[T1.2] Find the smallest k for which the Fano plane is a k-partite hypergraph. Answer
the same question for the affine plane of order 3.
[T1.3] Is it obvious that the dual of a finite plane is a finite plane?
[T1.4] Is it obvious that the dual of a finite plane is isomorphic to itself?
[T1.5] Formulate Theorem 1.3 in dual form!
[T1.6] What is the intersection graph of the Fano plane?
[T1.7] What is the intersection graph of the affine plane of order 3 ?
[T1.8] Formulate Theorem 1.1 for graphs!
[T1.9] Find how to get the finite plane of order 3 from the affine plane of order 3 (Figure
1.5).
Exercise set 1.
[1.1] Prove by induction that every simple graph is the intersection graph of some hy-
pergraph.
[1.2] Prove the statement in [1.1] by using the notion of duality!
[1.3] Work out the details of the proof of Theorem 1.4
[1.4] Construct S(3, 4, 8) starting from the Fano plane. Explain why does it work!
[1.5] The Fano plane has cyclic representation: shift the set {1, 2, 4} (by adding 1 to
its elements) six times, using arithmetic (mod 7). Find similar representation for the
finite plane of order 3 and order 4. What property of q + 1 positive integers ensures that
they give a cyclic representation of a finite plane of order q?
[1.6] Show that if a Steiner triple system S(2, 3, n) exists then n ≡ 1 or n ≡ 3 (mod 6).
[1.7] Find two divisibility conditions for the existence of S(2, k, n).
[1.8] Find t divisibility conditions for the existence of S(t, k, n).
[1.9] Prove or give counterexample for the following two statements. Regular linear
spaces are uniform. Uniform linear spaces are regular.
[1.10] Give a catalogue of linear spaces with six vertices. (Follow the convention of Figure
1.6.)
11
2 CHROMATIC NUMBER AND GIRTH
χ(H) ≤ ∆(H) + 1
The girth of a hypergraph H = (V, E) , g(H), is the minimum among all cycle lengths
taken over all possible cycles of H. If H has no cycles, we define g(H) = ∞. Observe
that g(H) ≥ 2 and g(H) = 2 if and only if E has two edges intersecting in at least two
vertices.
12
We show later in this section that there are hypergraphs with large girth and large
chromatic number. The existence of such hypergraphs have been proved first by P.Erdős
and A.Hajnal with the probability method, the first explicit construction was given by
L.Lovász. First we look at the case of graphs.
Theorem 2.2 The Zykov graph Zi has no triangle and its chromatic number is i.
Proof. First we prove by induction that Zi has no triangle. This is obvious for Z2 ,
assume that true for Zi . Select three vertices from Zi+1 and assume they form a triangle
T . Then, from the construction, no two vertices of T are in Ai+1 and no two vertices of
T are from different copies of Zi and if two vertices of T are in the same copy of Zi then
the third is not from Ai+1 . Therefore the only possibility is that all three vertices of T
are from the same copy of Zi which contradicts the inductive hypothesis.
13
Next we prove that χ(Zi ) = i, it is clear for i = 2. If true for some i, then a proper
coloring of Zi+1 with i + 1 colors can be obtained by taking the same proper i-coloring
on all copies of Zi and coloring A with a new color. But perhaps there is a tricky proper
coloring of Zi+1 with i colors...
Assume there is such a tricky proper coloring of Zi+1 with colors 1, 2, . . . , i. Since
χ(Zi ) = i (by induction), the tricky coloring must use all colors in each copy of Zi . In
particular, there is a vector [x1 , x2 , . . . , xi ] such that for all j, xj is a vertex of the j-th
copy of Zi and xj is colored with color j. There exists a vertex a ∈ Ai adjacent with the
vertices of this vector! (Contradiction). ¤
Mycielski graphs.
This construction is more economic than the Zykov construction. Set M1 = K1 ,
M2 = K2 . To define Mi+1 , let Gi be a copy of Mi . For each vertex v of Gi define a twin
vertex v ∗ which is adjacent to the same set of vertices as v in Gi . (The set of twins are
distinct and disjoint from the vertex set of Gi .) Finally, add a new vertex wi adjacent
to all twin vertices.
14
Tutte graphs.
The Tutte graph Ti will be a graph of chromatic number i with girth 6, so Ti is
not only without triangles but without cycles of length four and five. Like the previous
constructions, we start with T1 = K1 and proceed recursively. However, the number of
copies of Ti needed to construct Ti+1 will be enormous...
¡(ni −1)i+1¢
Assume that Ti is already constructed and has ni vertices. Take mi = ni
disjoint copies of Ti , the reason will be explained soon. These copies will be glued
ni
together along the hyperedges of a complete ni -uniform hypergraph Hi = K(n i −1)i+1
whose vertices are disjoint from all copies of Ti . The gluing is done as follows. Due to
the definitions, we can make a one-to-one onto correspondence between the copies of Ti
and the hyperedges of Hi . Assume that the hyperedge ej corresponds to the j-th copy
of Ti (j = 1, 2, . . . , mi ). For every fixed j, define an injection fj from the vertices of ej
to the vertices of the j-th copy of Ti . The new graph edges (in addition to the edges
inside the copies of Ti ) are the pairs {x, fj (x)} where x runs over the vertices of ej and
j runs over {1, 2, . . . , mi }.
Proof. Since the vertices of the gluing hypergraph Hi can be colored with a new color
and all copies of Ti can be colored the same way by i colors, χ(Ti ) ≤ i comes easily
by induction. The problem is to show χ(Ti ) ≥ i. This is also done by induction, of
course there is no difficulty to launch it. Assume that we already know this for some i.
Assume indirectly that Ti+1 has a proper coloring with i colors. Then, by the inductive
hypothesis, all copies of Ti in Ti+1 are colored using all the i colors.
15
The vertices of the gluing hypergraph Hi are also colored and by the pigeonhole
principle, there exists a hyperedge, say ej , whose vertices are all colored with the same
color, say color 1! (Check this carefully to enjoy the beauty of Tutte’s idea.) Now we
have a contradiction because the j-th copy of Ti has a vertex y colored with color 1 so
both vertices of the edge {fj−1 (y), y} are colored with color 1.
There is still a nice part ahead: imagine that you walk on a cycle of Ti+1 . Look around
carefully while you complete your tour on the cycle. It can certainly happen that your
tour is completely inside of a copy of Ti . Then, by induction you are convinced that
you visited at least six vertices. Watch your step when you leave a copy, you are in the
gluing hypergraph. Where to go? You are forced to go into a new copy of Ti from where
you can not escape immediately. Therefore you visit at least two vertices in at least two
copies and at least two vertices of the gluing hypergraph, a total of at least six vertices.¤
The Shift graph (of Erdős and Hajnal).
The new feature of this graph is that it is defined without recursion. The vertex set
¡ ¢
of the shift graph SHn is the set of pairs (i, j), 1 ≤ i < j ≤ n so it has n2 vertices.
What about the edges? The vertices (p, q), (r, s) are adjacent if and only if q = r or
p = s.
Theorem 2.5 The Shift graph SHn has no triangles and χ(SHn ) = dlog2 (n)e.
Proof. The triangle free property is left as an (easy) exercise. To see that χ(SHn ) ≤
n
dlog2 (n)e, color the vertex (p, q) with color 1 if 1 ≤ p ≤ 2
< q ≤ n. The vertices of SHn
are partitioned into A1 , A2 as follows.
n n
A1 = {(i, j) : 1 ≤ i < j ≤ }, A2 = {(i, j) : < i < j ≤ n}
2 2
16
Observe that x ∈ A1 and y ∈ A2 are not adjacent and A1 , A2 define shift graphs,
isomorphic to SHb n2 c and to SHd n2 e , respectively. Continuing this process, a proper
coloring of SHn is obtained with dlog2 (n)e colors.
To prove the reverse inequality, χ(SHn ) ≥ dlog2 (n)e, assume that the vertices of
SHn are properly colored with t colors. For each i < n, let Si denote the set of vertices
{(i, j) : j = i + 1, i + 2, . . . , n}. We claim that the sets Sk and Sl can not be colored
with the same color set if k 6= l. Indeed, without loss of generality, k < l and the color
of (k, l) is red. Then red is a color used on Sk but red can not be used on Sl because
all vertices of Sl are adjacent to the vertex (k, l). Now the claim is proved and it gives
that n − 1, the number of sets Si is not larger than the number of nonempty subsets of
t colors. It follows that n − 1 ≤ 2t − 1 i.e. n ≤ 2t which implies dlog2 (n)e ≤ t. ¤
Kneser graphs. In 1955 Kneser posed the following problem. For fixed positive
n
integers n, k, consider the hypergraph K2n+k , i.e. all n element subsets of a ground set
of 2n + k elements. Kneser was interested in partitioning the hyperedges into as few
classes as possible so that hyperedges in the same partition class are pairwise intersecting
and conjectured that this minimum is k + 2. One can reformulate this as a problem
to determine the chromatic number (of a very symmetric) graph, the Kneser graph,
n
KN (n, k) whose vertices correspond to the edges of K2n+k , two vertices being adjacent
if and only if the corresponding hyperedges are disjoint. Then the conjecture is that the
chromatic number of this graph is k + 2. It is easy to see that k + 2 is an upper bound
(Exercise 2.10), the problem was whether one can do better. Kneser’s conjecture was
proved by Lovász in 1977 and Bárány simplified it very soon thereafter (but Lovász’s
proof gives more). The main surprise (of both proofs) is the method applied.
Proof. Bárány’s short proof uses two ingredients (the first was used in Lovász’s proof
as well):
Borsuk theorem (1933). Assume that surface of the unit sphere, S k = {x ∈ Rk+1 , ||x|| =
1} is covered by the union of k + 1 open sets. Then some of these sets has two antipodal
points.
17
Gale theorem (1956). For positive integers k, n it is possible to place 2n + k points on S k
so that each open hemisphere contains at least n of them. (An open hemisphere H(a)
with center a ∈ S k is the set {x ∈ S k } for which the inner product xa is positive.)
This is the way to put together the two ingredients. Assume that the n-element
subsets of a 2n + k-element are partitioned into k + 1 classes so that each class contains
pairwise intersecting sets. Place the vertices of the ground set on S k as described by
Gale theorem. Define the sets U1 , U2 , . . . , Uk , Uk+1 as follows: let Ui be the set of points
x ∈ S k for which the open hemisphere H(x) contains an n-element set from the i-th
partition class. By Gale theorem, the union of the sets Ui cover S k . The sets Ui are all
open sets, Borsuk theorem says that in some of them, say in Uj , there are two antipodal
points. These points are centers of disjoint open hemispheres, by definition both contain
an n-element set from the j-th partition class - contradiction. ¤
Josh Green, a former BSM student simplified the proof even further by eliminating
Gale’s theorem from the proof! His modification (appeared in 2002 in the American
Mathematical Monthly) is as follows. Instead of placing the vertices of the ground set
on S k by Gale’s theorem, he just placed them on S k+1 as points in general position!
General position here means no k + 2 points are on a great k-sphere, S k ⊂ S k+1 . The
sets U1 , U2 , . . . , Uk , Uk+1 are defined similarly: Ui is the set of points x ∈ S k+1 for which
the open hemisphere H(x) contains an n-element set from the i-th partition class. The
k + 2 sets
k+1
[
k+1
U1 , . . . , Uk+1 , F = S \ Ui
i=1
cover S k+1 . Borsuk’s theorem remains true if open and closed sets are mixed in the
cover (one of the sets, F , is closed). Thus some of these sets contain an antipodal pair
a, −a. This pair can not be in F because the great circle separating the hemispheres
H(a), H(−a) contains at most k+1 points so one of H(a), H(−a) contain at least n
points so a or −a would be in some Ui . Thus - as in the original proof - we have a pair
of antipodal points in some Ui and this contradiction finishes the proof.
18
2.3 HOW TO GLUE HYPERGRAPHS TO GET GRAPHS?
Recall that Tutte graphs have arbitrary large chromatic number and girth six. No simple
construction is known if graphs of girth seven are required with large chromatic number,
although their existence was proved by Erdős in 1961 using his probability method.
The first construction of such graphs was found by Lovász in 1968. Interestingly, his
construction is based on hypergraphs. The following construction is somewhat simpler
(?!). It is also based on hypergraphs.
The Nesetril-Rödl construction.
Assume that three positive integers, p, k, n are given, k, n ≥ 2. Our objective is to
construct a hypergraph H with the following properties: 1. H has no cycles of length at
most p (the girth of H is at least p+1) . 2. H is k-uniform . 3. χ(H) ≥ n. A hypergraph
H with properties 1,2,3 is called a [p, k, n]-hypergraph. Observe that we have already
seen [3, 2, n]-hypergraphs, i.e. graphs without triangles and chromatic number at least
n (Zykov graphs, Mycielski graphs) and [5, 2, n]-hypergraphs, i.e. graphs with girth six
and chromatic number at least n.
The construction of [p, k, n]-hypergraphs is recursive. The first step is to construct a
[1, k, n]-hypergraph. Note that in this case we have no restrictions about cycles because,
by the definition of cycles, every cycle has length at least two. We define (in terms of k
and n ) a number s as follows.
s := (k − 1)(n − 1) + 1.
19
determined by those k vertices is monochromatic. This contradicts the definition of a
proper coloring.
Recursive step. Assume that [p − 1, K, n]-hypergraphs are already constructed. Unfor-
tunately, there is no misprint here! Although we wish to construct an s-partite [p, k, n]-
hypergraph, we shall use [p − 1, K, n]-hypergraphs with values of K much larger than
k. (This means that graphs (k = 2) can not be constructed without using hyper-
graphs already constructed.) Notice also that s depends on k therefore the [p − 1, K, n]-
hypergraphs we shall use are S-partite where S is not the same as s. It might be a
consolation than n never changes.
In fact, we shall build hypergraphs H1 , . . . , Hs+1 , each of them will be s-partite and
k-uniform. The last one, Hs+1 will be the final aim, the s-partite [p, k, n]-hypergraph.
The definition of H1 . It will be refreshing to see that H1 is not defined recursively.
Moreover, H1 will have pairwise disjoint edges. After these promises, look at the defini-
tion.
The vertex set V of H1 is partitioned into sets X1 , X2 , . . . , Xs where each set Xi
¡s−1¢
has the same number of vertices, namely k−1 . This allows to define pairwise disjoint
¡¢
k-element edges so that for all choices of k partite sets (there are ks choices) there is
exactly one edge which has vertices from the chosen partite classes (one vertex from
each). Figure 2.5 illustrates H1 for k = 3, n = 3 in that case s = (3 − 1)(3 − 1) + 1 = 5,
¡¢
|Xi | = 42 = 6.
20
Figure 2.5. H1
The definition of Hj+1 (1 ≤ j ≤ s). Assume that Hj is already constructed for some
j, 1 ≤ j ≤ s with partite classes X1 , . . . , Xs .
Set K = |Xj |.
Let Fj be a [p − 1, K, n]-hypergraph with vertex set Vj and with edge set Ej . The set
Vj is called the gluing base and the edges in Ej are called the gluing edges because they
serve to keep together many copies of Hj . The definition of Hj+1 is as follows. Take
as many copies of Hj as the number of gluing edges and, for each gluing edge, identify
the j-th partite class of a copy with the gluing edge. For distinct gluing edges distinct
copies of Hj are used. Apart from their j-th classes, all partite classes of the copies are
kept disjoint. The result is an s-partite hypergraph whose t-th partite class is the union
of the t-th partite classes of the copies used, except for t = j when the t-th partite class
is Vj . This finishes the (recursive) definition of Hj which gives Hs+1 in the last step.
21
Figure 2.6. To get Hj+1 from copies of Hj
Proof. The problem is to understand the construction, after that the proof is not
difficult. Only two statements need a proof. The first is that there are no cycles up to
length p. This comes from an argument similar to the one used for Tutte graphs to show
that their smallest cycle is of length at least six. Of course, it is a bit more difficult to
travel through a cycle of a hypergraph than through a cycle of a graph... Prove first for
H1 (trivial) then use induction to prove it for Hs+1 . (Exercise 2.11 asks to elaborate the
details.)
The second statement is that it is impossible to color properly the vertices of Hs+1
with n − 1 colors. Assume there is such a coloring C. Then C colors the vertices of Xs
which is the gluing base of Fs which is at least n-chromatic so there exists a gluing edge in
it monochromatic under C. This gluing edge serves as Xs , the last partite class in a copy
of Hs . Therefore we can select a copy of Hs whose last partite class is monochromatic.
Repeat the argument, C colors the (s − 1)-th partite class, Xs−1 of the selected copy
of Hs which is the gluing base of Fs−1 ... Going back like this, we find a copy of H1 in
which all partite classes are monochromatic under C. And now the choice of s (and the
contradiction) becomes (hopefully) clear (Exercise 2.12). ¤
22
Self-test 2.
[T2.1] What is the chromatic number of Kn ?
[T2.2] What is the chromatic number of Knr ?
[T2.3] Show that the Fano plane and the affine plane of order 3 are 3-chromatic hyper-
graphs.
[T2.4] Find a 3-chromatic graph with girth g.
[T2.5] Give some hypergraphs (graphs) with equality in Theorem 2.1.
[T2.6] Show that (with suitably chosen parameters) the Kneser graph has no triangle
and its chromatic number is arbitrary large.
Exercise set 2.
[2.1] Show that a graph is bipartite if and only if it does not have cycles of odd length.
[2.2] Prove that the Greedy algorithm colors every hypergraph H = (V, E) properly with
at most ∆(H) + 1 colors!
[2.3] Give an example of a bipartite graph G such that for a certain ordering of its
vertices the Greedy algorithm uses 2000 colors for the proper coloring of G.
[2.4] Use the greedy algorithm to prove: if H = (V, E) is a hypergraph (no singleton
edges) such that for all pairs of distinct edges e, f ∈ E we have |e∩f | =
6 1 then χ(H) = 2.
[2.5] Prove that a finite plane of order at least 3 is 2-colorable!
[2.6] Prove that Steiner triple systems are not 2-colorable!
[2.7] Modify the definition of the Zykov graph Zn+1 so that instead of n copies of Zn one
copy of Zi is used for 1 ≤ i ≤ n. Show that it works (no triangles, and it has chromatic
number n + 1).
[2.8] Prove that the Mycielski graph Mn has no triangle and has chromatic number n.
Hint: one possible proof uses the following lemma. Assume that G is an n-chromatic
graph and consider a proper coloring of G with n colors, 1, 2, . . . , n. Then, for each color
i, there exists a vertex v colored with color i such that v is adjacent to at least one vertex
of color j for all j 6= i.
[2.9] Prove that there are no triangles in the Shift graphs.
[2.10] Prove that the chromatic number of the Kneser graph KN (n, k) is at most k + 2.
[2.11] Show that Hs+1 has no cycles of length 2, 3, . . . , p (hint is in the text).
[2.12] Show that the chromatic number of Hs+1 is at least n (outline is in the text).
23
3 A LOOK AT RAMSEY THEORY
Theorem 3.1 If six persons are present at a party then either three of them pairwise
know each other or three of them pairwise do not know each other. This is not necessarily
true for five persons.
Proof. Select an arbitrary person P . Then there are three other persons P1 , P2 , P3 such
that P either knows all of them or none of them. In the former case, either P, Pi , Pj
pairwise know each other or P1 , P2 , P3 pairwise do not know each other. The latter case
is similar.
The graph C5 (vertices represent persons and edges represent the pairs who know
each other) show the second part of the theorem. ¤
24
Theorem 3.2 Assume that n, t are integers satisfying n ≥ t. There exists an integer m
t
(depending on n and t) with the following property: if the edges of Km are colored with
red and blue in any fashion then there exists a monochromatic Knt , i.e. n vertices with
¡ ¢
all the nt edges determined by them having the same color.
The smallest m in Theorem 3.2 is called the Ramsey number and denoted by Rt (n).
Then, clearly R1 (n) = 2n − 1 (as mentioned above discussing the Pigeonhole Principle)
and R2 (n) is the same as R(n) introduced above.
Proof. We prove by induction on t. The case t = 1 is the pigeonhole principle (R1 (n) =
2n − 1).
Consider t fixed and assume that Rt (k) exists for any k ≥ t. We are going to prove
that Rt+1 (n) exists.
The heart of the proof is the definition of the “shadow coloring”. If the edges of
t+1
H = Km are 2-colored and a vertex x is removed from H then H − x can be considered
t
as a 2-colored Km−1 , namely: a t-element subset T of H − x is considered to be colored
with the same color as T ∪ x in H. We call briefly this coloring as the shadow coloring
of H − x . Observe that if m is large (depending on k) then the induction hypothesis
guarantees a monochromatic Kkt in the shadow coloring of H − x. In other words, there
exists a set K with k vertices such that the color of each t + 1-element subset containing
x in the set x ∪ K is the same.
Using that k can be chosen arbitrarily in terms of n, one can select a huge m which
allows to find a set S of 2n − 1 vertices, S = {x1 , x2 , . . . x2n−1 }, so that the color of any
t + 1-element subset of S depends only on the smallest index of its elements. By the
pigeonhole principle, there exist n elements of S determining a monochromatic Knt+1 . ¤
Proof. Set m = 4n = 22n , consider a 2-coloring of the edges of the complete graph Km .
The shadow coloring at any vertex x1 is a vertex coloring. By the pigeonhole principle,
25
one can find 22n−1 vertices all adjacent to x1 in the same color. Select x2 arbitrarily
among those vertices and continue... One can select S = {x1 , x2 , . . . , x2n−1 } so that the
color of any edge {xi , xj } within S depends only on the minimum of i and j. This gives
a monochromatic Kn in S (as in the proof of Theorem 3.2). ¤
Many colors and non-diagonal Ramsey numbers. There are natural generaliza-
tions of the numbers Rt (n). The Ramsey number
Rt (n1 , n2 , . . . , nr )
t
is the smallest integer m with the following property: if the edges of Km are colored
with colors 1, 2, . . . , r in any fashion then, for some i, there exists a Knt i , monochromatic
¡ ¢
in color i (i.e. all the nti edges within some set of ni vertices are colored with color i).
The number Rt (n1 , n2 , . . . , nr ) is sometimes called non-diagonal Ramsey number (for
r colors) emphasizing that the numbers ni can be different. In the same spirit, if all ni
are equal than the term diagonal Ramsey number is used (for example, R(n) = R2 (n, n)
is a diagonal Ramsey number). The existence of non-diagonal Ramsey numbers follows
immediately from the existence of diagonal ones. The order of magnitude of R(3, n)
2n
( log(n) ) was clarified in 1995 (Kim) after a a long process of advances. The role of non-
diagonal Ramsey numbers can be illustrated by the following basic inequality of Erdős
and Szekeres.
¡p+q−2¢
Corollary 3.5 R(p, q) ≤ p−1
.
26
The only nontrivial exact value known for three colors is R(3, 3, 3) = 17 (see exercise
[3.4]). The only known value for hypergraphs is R3 (4, 4) = 13.
Convex n-gons. The following surprising application of Ramsey numbers is a cele-
brated theorem of Erdős and Szekeres.
Theorem 3.6 There exists a function f (n) with the following property: if f (n) points
of the plane are in general position (no three on a line) then there exist n of these points
forming the vertices of a convex n-gon.
Proof. We show that R4 (5, n) is a good choice for f (n). Consider a set P of R4 (5, n)
points in general position. A four-element subset of P is called convex if the convex hull
of the four points is a quadrangle, otherwise it is called concave. This is a partition of all
four-element subsets of P therefore, from the definition of the Ramsey number R4 (5, n),
either there exist five points of P whose four-element subsets are concave or there exist
n points of P whose four-element subsets are convex. However, the former possibility is
impossible ([T3.2]) and the latter implies that we have n points which are the vertices
of a convex n-gon ([T3.3]). ¤
27
Proposition 3.7 A 2-colored base block contains an almost monochromatic 3-term A.P.
Consider a 2-coloring C of [325] and partition [325] into 65 base blocks, B1 , B2 , . . . B65 ,
starting with B1 = [5]. Since there are only 25 ways to color a base block, the Pigeonhole
Principle guarantees the existence of 1 ≤ i < j ≤ 33 such that C colors Bi and Bj
precisely the same way. Now we can select l ≤ 65 so that the three base blocks Bi , Bj , Bl
form a 3-term A.P. of blocks which means that the first elements of the blocks form a 3-
term A.P. Applying Proposition 3.7 for Bi , we can find an almost monochromatic 3-term
A.P. in Bi . If this is monochromatic, the claim follows. Otherwise, w.l.o.g., C colors
it as RRB (red,red,blue). Since Bj is colored precisely like Bi , one can find two 3-term
A.P.-s with identical third term such that one of them is colored RR? and the other is
colored BB?. This finishes the proof of the claim (see Figure 3.1).
Of course, 325 is just an upper bound on W (3). It is not very difficult to prove that
W (3) = 9, but that proof is not suitable to generalize. The next step shows how to prove
the existence of W (4) from the existence of W (3, r), which is the smallest m with the
property: if [m] is r-colored in any fashion then there exists a monochromatic 3-term
A.P. It is strange that a theorem about 2-colorings can not be proved without proving
it for r-colorings... (This interesting phenomena occurred earlier. The construction of
[p, 2, n]-hypergraphs (graphs!) relied on constructions of [p − 1, K, n]-hypergraphs.)
W (4) < ∞ if W (3, r) < ∞. A base block of length t = 2W (3) − 1 ensures that Propo-
sition 3.7 can be generalized: a 2-colored base block contains an almost monochromatic
4-term A.P. (first three terms of the same color). But now we need three identically
colored base blocks forming a 3-term A.P. of blocks! Since a base block of length t can
be 2-colored in 2t ways we need W (3, 2t ) base blocks to ensure that. To extend it with
28
a fourth base block is just a modest doubling and then we have the possibility to find
two 4-term A.P.-s colored as RRR? and BBB? such that their fourth terms are identical.
In fact, t = 13 works here, (using W (3) = 9 instead of the upper bound 325 from the
proof) we have
W (4) ≤ t(2W (3, 2t ) − 1) < 13W (3, 213 ).
Now the existence of W (4) follows from the existence of W (3, r) where r = 213 . One
can guess that this is not very small, especially not from our proof which will be roughly
2
..
2.
2
where the hight of the tower is r. Concerning the existence of W (3, r) only the first step
of the iteration process is shown.
W (3, 3) < ∞ . Blocks of length 7 are the ’small base blocks’, one can find almost
monochromatic 3-term A.P.-s in them. Blocks of length 2 × 7 × 37 are the ’medium base
blocks’, one can find in them the pattern shown in Figure 3.2. The final step is to find
two identically colored medium blocks and extend them to a 3-term A.P. of medium
blocks (see Figure 3.2). The argument gives
7
W (3, 3) < 2(2 × 7 × 37 )(32×7×3 )
Theorem 3.8 (Van der Waerden) For every k, r there exists m (depending on k, r) such
that any r-coloring of [m] contains a monochromatic k-term arithmetic progression.
29
The proof outlined above gives a bound for W (k, 2) which grows as the Ackermann
function. In 1989 Shelah found a better bound, W (k, 2) ≤ wow(k + 1), where “wow” is
the iterated tower function
22
wow(2) = 2, wow(3) = 22 , wow(4) = 22 = 65536
and wow(5) is a tower of 2’s with height 65536 - wow! For comparison, note that
W (4) = 35 and W (3, 3) = 27. It is also known that W (5) = 178. The best lower bound
of W (k, 2) is exponential (see Section 4).
Theorem 3.9 (Hales-Jewett) For arbitrary positive integers t, r there exists a positive
integer n (depending on t, r) with the following property: for every r-coloring of the
points of the cube Ctn there exists a monochromatic line (i.e. a line whose points are
colored with the same color).
30
Figure 3.3. The cubes C32 and C23 and their lines.
Tic-tac-toe. In the r-player Tic-tac-toe game the players color the points of Ctn (players
use distinct colors) and their objective is to create a monochromatic line (using some
definition of a line). Since the lines defined above are the most restrictive, Theorem 3.9
implies that there is no draw if r persons play with fixed board size t in sufficiently high
dimenson n. In case of two persons, the first player always wins (again, if n is large
enough).
Self-test 3.
[T3.1] Show that R(n) > (n − 1)2 .
[T3.2] Assume that five points of the plane are in general position (no three on a line).
Show that four of them are the vertices of a convex quadrangle.
[T3.3] Assume that n points of the plane are in general position and all subsets of four
points determine a convex quadrangle. Prove that the n points are the vertices of a
convex n-gon.
[T3.4] Consider the k-uniform hypergraph A(k, n) whose vertex set is [n] and whose
edges are the k-term A.P.-s. Formulate Van der Waerden’s theorem as a statement
about the chromatic number of this hypergraph.
P ¡ ¢
[T3.5] Show that Ctn has ni=1 ni tn−i lines.
Exercise set 3.
31
[3.1] Prove that R(3, 4) ≤ 9.
[3.2] Prove that R(3, 4) ≥ 9.
[3.3] Prove that R(4, 4) ≤ 18. (The reverse inequality comes from the following famous
coloring of the edges of K17 : the vertex set is [17] and the edge (i, j) is red if i − j is a
square (mod 17) otherwise it is blue.)
[3.4] Prove that R(3, 3, 3) ≤ 17.
[3.5] Show that R(3, 3, . . . , 3) ≤ 3r! (assuming r arguments). Extra part: improve the
bound to ber!c + 1 (this is the current record set eighty years ago).
[3.6] Show that R(3, 3, . . . , 3) > 2r (assuming r arguments). Extra: Ideas for improve-
ment?
[3.7] Use the result of exercise [3.5] to prove the following: if n ≥ 3r! then no matter
how [n] is partitioned into r classes, there is a solution of the equation x + y = z with
all three numbers from the same class (x = y is permitted in the solution).
[3.8] Show that if the edges of a countably infinite complete graph are colored red or
blue in any fashion then there is a infinite monochromatic complete subgraph.
[3.9] (Kiran Kedlaya, former BSM student) Show that R(n) ≤ R3 (6, n).
[3.10] Prove Theorem 3.4!
[3.11] Prove the corollary of Theorem 3.4!
[3.12] Prove that R3 (n, n) is also a good choice for f (n) in Theorem 3.6 . Hint: color
triples according to the parity of the number of points inside the triangle spanned by
them.
[3.13] Assume (a real life situation) that ’knowing each other’ is not necessarily sym-
metric. Prove that at a party with nine persons there are either three persons among
which nobody knows any other or there are three persons A, B, C such that A knows B,
B knows C and A knows C (a transitive triple). Show that the same statement is not
necessarily true for a party of eight persons.
[3.14] Show that
7
W (3, 3) < 2(2 × 7 × 37 )(32×7×3 )
[3.15] Show that Hales-Jewett theorem implies Van der Waerden theorem.
32
[3.16] Show that
33
4 COUNTING AND PROBABILITY
Theorem 4.1 (Sperner) Assume that V = [n] and H = (V, E) is an antichain. Then
µ ¶
n
|E| ≤ .
b n2 c
µ ¶−1
n
|E| n ≤1
b2c
and the proof is finished. ¤
34
otherwise the question is foolish. With this assumption, the answer is not difficult, guess
it before you look at exercise [4.2]. What happens if we ask the same question for simple
t-uniform hypergraphs? If n ≤ 2t − 1 then the answer is immediate ([T4.2]). For larger
values of n, the answer is provided by
Proof. (Katona)We say that a cyclic permutation of [n] extends an edge e ∈ E if the
vertices of e appear in consecutive positions of the cyclic permutation. There are t!(n−t)!
cyclic permutations extending e. On the other hand, a cyclic permutation can be the
extension of at most t edges. This is not so obvious, exercise [4.3] asks for the proof.
The number of cyclic permutations of [n] is (n − 1)! therefore
Theorem 4.3 (Erdős - Selfridge) A t-uniform hypergraph with less than 2t−1 edges is
2-chromatic.
Proof. Assume that H = (V, E) is a t-uniform hypergraph with vertex set V = [n].
For each edge e ∈ E there are 2 × 2n−t = 2n−t+1 distinct 2-colorings of V which makes e
monochromatic. This implies that at most |E|2n−t+1 2-colorings of V are monochromatic
on some edges of H. However, from the assumption |E| < 2t−1 , that number is smaller
35
than 2n , the number of 2-colorings of V . This means that there exists a 2-coloring of V
under which no edge of H is monochromatic! ¤
Proof. The probability space is the 2-colorings of the edges of Km . This space has N =
2( 2 ) elements. A subset of n vertices in K spans a monochromatic complete subgraph
m
m
1
with probability p = . Therefore the probability of having a monochromatic Kn is
2( 2 )
n −1
36
¡m¢
at most p n
< 1 from the assumption of the theorem. This means that the probability
of having no monochromatic Kn is positive. Therefore there exists a 2-coloring of the
edges of Km such that there is no monochromatic Kn . This clearly implies that m is a
lower bound for R(n). ¤
The lower bound m for R(n) was stated in a strange way in Theorem 4.4 to show
the idea of the proof clearly. To get an actual bound, one has to calculate the largest m
(in terms of n) satisfying the condition
µ ¶
m
< 2( 2 )−1 .
n
n
Such calculations often require upper bound on binomial coefficients. The better the
estimate, the better the bound. If we start with the trivial upper bound
µ ¶
m
< mn
n
then our lower bound is
n 1 1
m = 2 2 × 2− n × 2− 2
¡m¢
([T4.3]). If Sterling’s formula is used for the asymptotic of n
then
n n
m = 22 × √
e 2
is obtained so there is no earthshaking difference between the trivial and the best
estimate of m.
n
Corollary 4.5 (Erdős) For n ≥ 3, 2 2 < R(n).
Proof. Outline: ¡ n¢
2
< 1.
2k−1
Tournaments. A tournament Tn is an oriented Kn , which means that each edge of the
complete graph Kn gets a one-way orientation. The edge from vertex v to vertex w is
37
denoted by (v, w). We can interpret a tournament Tn as vertices represent players and
(v, w) means that v beats w in the game.
There are two non-isomorphic T3 -s, one is the cyclic triangle and the other is the
transitive triangle. ([T4.4]: define the isomorphism of tournaments) A tournament is
called transitive if the presence of the edges (i, j) and (j, k) imply that (i, k) is also
an edge. A tournament Tn is transitive if and only if its vertices can be labeled with
1, 2, . . . , n so that all edges point from smaller label to larger label (exercise [4.4]). This
shows that transitive tournaments on the same number of vertices are isomorphic.
Large transitive subtournaments.
Let f (k) be the smallest n such that any Tn contains a transitive Tk . It is obvious
that f (2) = 2, it is easy that f (3) = 4 ([T4.5]). Then f (4) = 8 and f (5) = 14.
k−1
Theorem 4.7 2 2 ≤ f (k) ≤ 2k−1 .
Proof. The lower bound comes from the probability method. The probability space is
the set of all tournaments on [n], there are N = 2( 2 ) elements in the probability space.
n
If ¡n¢
k
k!
<1
2( )
k
2
then there exists a Tn which does not contain a transitive subtournament on k vertices.
Work out the details! (Exercise [4.5])
The upper bound comes easily by induction (Exercise [4.6]). ¤
Existence versus construction. Theorem 4.7 shows that there exists a tournament
T127 which does not contain a transitive T15 because the probability of having a transitive
T15 in every T127 is smaller than one. In a “real world” problem, it would be important to
know how much smaller? The following situation is from the book of Erdős and Spencer:
Probabilistic methods in Combinatorics.
“A computer scientist is told to construct , on a computer, the matrix of a tournament
on 127 players that does not contain a transitive subtornament on 15 players. Imagine
that if his matrix does contain a transitive subtournament on 15 players, his company
(or country or mutual fund) will suffer grave financial consequences. He learns from this
book that such a tournament exists. But this is not sufficient - his boss wants a specific
38
matrix stored in the computer. Our scientist calculates that it may cost millions of
dollars (or tens of millions of forints ) to find such a T by exhaustive methods. However,
like many of us, he is on a limited budget. What is he going to do? If his moral character
is sufficiently low he would construct T at random, pack his bags to prepare for a hasty
departure, and hope for the best. Given our experiences with computer scientists, we
surmise that this would be the “real world” solution. If, on the other hand, he is of
outstanding moral character his only recourse is to ask for a refund on this book. We
would, of course, refuse.”
This was written more than twenty years ago so certain statements (definitely the
conversion rate of dollars to forints) may not be valid any more. The quoted paragraph
continues:
“The “real world” situation is different if it is required that the tournament does
not contain a transitive subtournament on 16 players. Then a “random” tournament
will have the desired property with probability ≥ .998. This probability of success is
sufficient in many practical situation - our computer scientist can randomize and relax.”
Paradoxical tournaments. Who are the best k players in a tournament? This can
not be answered easily because, for every k, there are tournaments in which every set of
k players is beaten by somebody. Tournaments of this property are called k-paradoxical
.
Theorem 4.8 For every positive integer k, there exist k-paradoxical tournaments.
Proof. At this point, the reader can easily prove it (exercise [4.7]) by interpreting the
inequality µ ¶
n 1
(1 − k )n−k < 1
k 2
¤
39
Hamiltonian Paths in Tournaments.
A famous theorem of Rédei (1937) says that the players of any tournament can be
placed in a line so that each player won the match against her left hand neighbor. More
formally, every tournament has a (directed) Hamiltonian path. The simple proof is
left as an exercise ([4.8]). (A more difficult result is that the number of Hamiltonian
paths is odd in every tournament.) How many Hamiltonian paths can we found in a
tournament? If the tournament is transitive then there is only one. On the other hand,
there are tournaments with many Hamiltonian paths.
n!
Theorem 4.9 For every n there is a tournament Tn with at least 2n−1
Hamiltonian
paths.
40
the lemma was applied to a specific problem (to prove Theorem 4.11 below) but later
became an important tool applicable in many situations.
Dependency bound. The set of events {A1 , A2 , . . . , An } has dependency bound D if,
for each i, Ai is dependent on at most D other events Aj . More precisely, this means
that for each i, there is a set Si ⊆ [n] such that |Si | ≤ D, i ∈
/ Si and Ai is independent of
any boolean combination of the events {Aj : j ∈
/ Si }. (Two events A, B are independent
if prob(A ∩ B) = prob(A)prob(B).)
Theorem 4.10 (Local Lemma) Assume that A1 , A2 , . . . , An are events with dependency
1
bound D and prob(Ai ) ≤ 4D
for 1 ≤ i ≤ n. Then
Ãn !
\
prob Ai > 0
i=1
Proof. We shall use occasionally AB for A ∩ B, pr for prob. It is convenient to use the
notion of conditional probability. For non-empty B define probability of A with condition
pr(AB)
B as pr(A|B) := pr(B)
. Then A, B are independent if and only if pr(A|B) = pr(A).
Claim: for arbitrary S ⊆ [n] and i ∈
/S
à !
\ 1
P = pr Ai | Aj ≤
j∈S
2D
1
The claim is proved by induction on |S|, for S = ∅ it is true because pr(Ai ) ≤ 4D
from the
condition of the theorem. From the assumptions, S can be partitioned into S1 and S2 so
that |S1 | ≤ D and Ai is independent of any boolean combination of {Aj : j ∈ S2 , j 6= i}.
pr(AB|C)
Using the identity pr(A|BC) = pr(B|C)
write P as
³ T T ´
pr Ai j∈S1 Aj | j∈S2 Aj
P = ³T T ´ (2)
pr j∈S1 Aj | j∈S2 Aj
41
and the denominator in (2) is bounded from below by
à ! à !
[ \ X \ X 1 1 1
1 − pr Aj | Aj ≥ 1 − pr Aj | Aj ≥ 1 − ≥1− =
j∈S j∈S j∈S j∈S j∈S
2D 2 2
1 2 1 2 1
Theorem 4.11 Assume that H = (V, E) is a t-uniform hypergraph in which each edge
intersects at most 2t−3 other edges (t ≥ 3). Then H is 2-chromatic.
Proof. Assume that the vertex set of H is [n]. Consider the probability space of all
2-colorings of V . For each edge ei ∈ E, let Ai be the event that ei is monochromatic.
1
Clearly prob(Ai ) = 2t−1
and D = 2t−3 is a good dependency bound for the events
because Ai is independent from any Boolean combinations of those Aj -s for which ej
does not intersect ei . Thus the condition for the Local Lemma is satisfied with equality
1 1
( 2t−1 = 4D
). Then the Local Lemma says
à !
\
prob Ai > 0
ei ∈E
2k
Theorem 4.12 W (k) > 8k
.
Proof. The probability space is all 2-colorings of [n]. For each k-term A.P. S, AS is the
1
event that S is monochromatic. Clearly, prob(AS ) = 2k−1
. Since a fixed S can intersect
42
at most nk other S’s (Ex.[4.9]), D = nk is a good dependency bound. The condition of
2k
the Local Lemma is satisfied with equality if n = 8k
. ¤
Even cycles in regular digraphs.
A directed graph is r-regular if the indegree and the outdegree of each vertex is r. It
was a long-standing conjecture that for sufficiently large r any r-regular digraph contains
an even (directed) cycle. A surprising proof was found by Alon.
Proof. The probability space is the 2-colorings of the vertex set of the given r-regular
digraph G. A coloring is ’good’ if for each vertex x of G there exists an edge (x, y) such
that the color of x and y are different. A coloring is ’bad’ at x if there is no vertex y such
that the color of x and y are different. Let Ax be the event that a coloring is bad at x.
1
Then prob(Ax ) = 2r
. On the other hand, D = r2 is good dependency bound for the set
of events {Ax : x ∈ V } (check! Ex.[4.10] ). To apply the Local Lemma, the condition
1 1
2r
≤ 4r2
is needed which is valid since r ≥ 8. Now the Local Lemma says that there
exists a good coloring which immediately gives an even cycle ([T4.6]). ¤
4.4 JOKES.
The triangle is 2-chromatic!
Proof. Let V = [3] be the vertex set of the triangle T . The probability space is the 2-
colorings of V (N = 8 elements). Let Aij be the event that the edge ij is monochromatic.
1
Clearly, prob(Aij ) = 2
for all the three choices of index pairs. Any pair of events is
independent, for example
1
prob (A12 A13 ) = prob (A12 ) prob (A13 ) =
4
1
therefore D = 2
is a good dependency bound. The condition for the application of the
Local Lemma holds with equality because
1 1
prob (Aij ) = =
2 4D
43
Therefore there exists a coloring of V so that no edge is monochromatic. ¤
Spencer’s injections.
A function f : S → T is called injective if its domain is S and it is one-to-one.
¡|S|¢
Theorem 4.15 Joke 2 (Spencer) Assume that S is a finite set. If |T | > 2
then there
exists an injective function f : S → T .
Proof. The probability method is used with probability space of all f : S → T functions.
The event Axy is that f (x) = f (y). Clearly
1
prob (Axy ) =
|T |
it follows that à !
\
prob Axy >0
x,y∈S
so an injective f exists. ¤
One can try to improve the theorem above by the Local Lemma. Indeed, an order of
magnitude better result comes from it.
Theorem 4.16 Joke 3 (Spencer) Assume that S is a finite set. If |T | ≥ 8|S| then there
exists an injective function f : S → T .
Proof. Follow the previous proof, but apply the Local Lemma. Clearly, D = 2(|S| − 2)
is a good dependency bound for the events Axy . The condition |T | ≥ 8|S| makes the
Local Lemma applicable. ¤
44
Self-test 4.
[T4.1] Is it true that an antichain is a simple hypergraph?
[T4.2] What is the maximum number of edges of a t-uniform intersecting simple hyper-
graph on vertex set [n], if n ≤ 2t − 1 ?
[T4.3] Check the calculation of the lower bound in Theorem 4.4 if the upper bound mn
¡ ¢
is used for m
n
.
[T4.4] Define isomorphism for tournaments.
[T4.5] Show that any T4 contains a transitive T3 .
[T4.6] Show that a digraph with a good coloring (defined in the proof of Theorem 4.13
has an even cycle!
[T4.7] Find the mistake in the proof of Joke 1.
[T4.8] Find the strongest theorem generalizing the results of Jokes 2 and 3.
Exercise set 4.
[4.1] Let H be an antichain with fk edges of k elements. Prove the so called LYM
inequality:
n
X µ ¶−1
n
fk ≤1
k=0
k
[4.2] Prove that an intersecting simple hypergraph on vertex set [n] has at most 2n−1
edges!
[4.3] Prove that any cyclic permutation extends at most t edges! (See the proof of
Theorem 4.2.)
[4.4] Prove that if Tn is transitive then its vertices can be labeled with 1, 2, . . . , n so that
all edges point from smaller label to larger label.
[4.5] Work out the details for the lower bound in Theorem 4.7
[4.6] Work out the inductive proof for the upper bound in Theorem 4.7.
[4.7] Prove Theorem 4.8 by interpreting the stated inequality!
[4.8] Prove that every tournament has a directed Hamiltonian path.
[*4.9] Prove that a k-term A.P. in [n] can intersect at most kn other k-term A.P.-s (in
[n]).
[4.10] Show that D = r2 is a good dependency bound in the proof of Theorem 4.12.
45
5 LINEAR ALGEBRA METHOD
Theorem 5.1 Oddtown (under the stated rules) has no more clubs than the number of
inhabitants.
Proof. Assume that the inhabitants form the vertices and the clubs form the edges of
the hypergraph H = (V, E) . Set V = [n] and E = {e1 , e2 , . . . , em }. Consider the linear
space of n-component vectors over the two-element field F2 = {0, 1}. The edges of H are
viewed as elements of this linear space by looking at ei as a row vector in the incidence
matrix of H. The two rules of Oddtown can be expressed nicely with the standard inner
product:
(
1 if i = j
ei ej =
0 if i 6= j
Claim: e1 , e2 , . . . , em are linearly independent.
To see this, assume that
m
X
λi ei = 0 (λi ∈ F2 )
i=1
and take the inner product of both sides by ej . Using the property of the inner product
displayed above, we get λj = 0 and this is valid for any j proving the claim.
Therefore m ≤ the dimension of the linear space = n. ¤
The main motive of this cute proof is that the dimension is an upper bound for
the number of linearly independent elements in any linear space. One can easily find a
similar theorem and proof for “Modprimetown” (exercise [5.1]). A further trick is needed
46
for the proof of the “Modprimepowertown” theorem (exercise [5.2]). Surprisingly, it is
unknown whether a similar “Modsixtown” theorem is true or not. But presently there is
the strong belief in Modsixtown that they will never have more clubs than the number
of inhabitants.
Fisher inequality.
Theorem 5.2 Assume that H = (V, E) is a simple hypergraph without empty edges and
each pair of distinct edges intersect in precisely λ elements. Then |E| ≤ |V |.
Proof. There are some easy special cases treated separately. For λ = 0 the edges
are pairwise disjoint, so the theorem follows immediately. If there is an edge e ∈ E
with |e| = λ then the theorem follows easily ([T5.1]). Assume these cases are excluded,
|V | = n, |E| = m.
The edges of H are considered again as row vectors from the incidence matrix. The
intersection property of the edges can be expressed with the inner product as follows
(with suitable positive integers γi ):
(
λ + γi if i = j
ei ej =
λ if i 6= j
Claim: The vectors ei are linearly independent in the linear space of n-component
P
vectors of real numbers (i.e. in Rn ). Assume, on the contrary, that mi=1 αi ei = 0 and
compute the inner product of both sides by ej . We get that for all j (j ∈ [m])
λβ + αj γj = 0
Pm
where β = i=1 αi . Expressing αj (γj 6= 0), we get that for all j ∈ [m]
λ
αj = − β (3)
γj
proving the claim if β = 0. If β 6= 0 then adding (5) for all j ∈ [m] we get
à m !
X 1
β = −λ β
j=1
γ j
47
which is contradiction (λ > 0, β 6= 0). This proves the claim and the theorem follows
from the dimension argument. ¤
Proof. Here is the tricky coloring of the edges of the complete graph K = K(n−1)
3
with red and blue. Associate the vertices of K with the edges (triples) of the complete
3
hypergraph T = Kn−1 . Color an edge of K red if the corresponding two triples of T
intersect in precisely one vertex. Otherwise color the edge blue.
A red complete subgraph of K corresponds to triples of T pairwise intersecting in
precisely one element. From Theorem 5.2, there are at most n − 1 such triples. A blue
complete subgraph of K corresponds to triples of T pairwise intersecting in zero or two
elements. Since three is an odd number, we have an oddtown situation, theorem 5.1
shows that there are at most n − 1 triples again. Therefore in our coloring of the edges
of K there is no monochromatic Kn . ¤
We note here that the above construction can be generalized to give superpolynomial
lower bound for R(n). For a fixed prime p define a red-blue coloring by associating ver-
tices to (p2 − 1)-element subsets of an m-element set and coloring the edge between two
vertices red if the corresponding subsets has intersection size congruent to −1 (mod p).
Otherwise the edge is blue. Observe that proposition 5.4 gives this coloring for p = 2.
However, it is not as easy to show that under this coloring there are no large monochro-
¡m¢
matic complete subgraphs [not larger than 2 p−1 ].
Two-distance sets. A one-distance set in Rn is a set of points (vectors) such that
any two of them have the same distance. It is easy to see that a one-distance set has at
most n + 1 points and equality is possible (exercise [5.3]). How many points can form a
two-distance set in Rn , i.e. a set such that the distances between the points take only
¡ ¢
two values? It is easy to find a two-distance set with n2 points ([T5.3]). The next
theorem gives an upper bound not far from this lower bound.
48
(n+1)(n+4)
Theorem 5.5 A two-distance set of Rn has no more than 2
points.
the distance of x, y is ||x − y||. Define the polynomials (in n variables) fi (x) for i ∈ [m]
as
fi (x) = (||x − ai ||2 − δ12 )(||x − ai ||2 − δ22 )
and substitute aj for x! All but the j-th term vanishes and the j-th term is λj (δ1 δ2 )2
which can be zero only if λj = 0. Since j was arbitrary, the claim is proved.
On the other hand, looking at the definition of fi , all fi can be written as a linear
combination of polynomials of five types:
à n !2 à n !
X X
x2k ; x2k xj ; xi xj ; xi ; 1
k=1 k=1
thus (adding the number of these polynomials type-to-type) all fi belong to a linear
space of dimension at most
n(n + 1) (n + 1)(n + 4)
1+n+ +n+1=
2 2
so their number, m, can not exceed the claimed value. ¤
49
¡p+q¢
Theorem 5.6 If (E, F) is cross intersecting then m ≤ p
.
Proof. We say that an ordering of [n] is compatible with the pair (ei , fi ) if all elements
of ei precede all elements of fi in the ordering. There are
µ ¶
n n!
p!q!(n − p − q)! = ¡p+q¢
p+q p
orderings compatible with a pair (ei , fi ) and the cross-intersecting property implies that
each ordering of [n] is compatible with at most one pair (ei , fi ). Therefore
n!
m ¡p+q¢ ≤ n!
p
50
and fi have non-empty intersection. Since we have cross-intersecting hypergraphs, this
means
(
0 if i 6= j
gi (aj ) =
6= 0 if i = j
and this condition implies that the polynomials gi are linearly independent (exercise
[5.6]). Therefore their number, m, is at most the dimension d, proving theorem 5.4. In
fact, the linear independence of the gi -s follow from the weaker assumption
(
0 if i < j
gi (aj ) =
6= 0 if i = j
(exercise [5.7]) and it gives a generalization of Theorem 5.4. for so called skew cross-
intersecting hypergraphs.
Proof. (Tverberg) Assume that [n] is the vertex set of Kn and we have a partition into
m complete bipartite graphs in the form [Ai , Bi ], where i ∈ [m] and the edges of the
51
complete bipartite graphs are between Ai and Bi . Consider n variables xj (j ∈ [n]) and
define 2m linear forms as follows:
X X
Li = xj and Mi = xj
j∈Ai j∈Bi
The condition that the complete bipartite graphs form edge disjoint partition is trans-
lated as
X
xi xj = L1 M1 + L2 M2 + . . . + Lm Mm (4)
1≤i<j≤n
and
x1 + x2 + . . . + xn = 0
Discrepancy of hypergraphs.
The discrepancy of a hypergraph is a measure of the most balanced bipartition of
the vertex set. Let Ψ denote a function from V to {−1, +1} and let H = (V, E) be a
hypergraph. Then the discrepancy of H is defined as follows.
X
disc(H) = minΨ maxe∈E | Ψ(x)|
x∈e
Proof. Assume V = [n], E = {e1 , e2 , . . . , em }. We allow Ψ to take real values from the
interval [−1, +1]. A vertex x is called bad for Ψ if Ψ(x) ∈ (−1, +1). An edge ej is called
52
bad for Ψ if it has more than t bad vertices. We shall define a procedure to get a Ψ
with no bad edges and at each step the following condition will be preserved for all bad
edges:
X
Ψ(x) = 0 (5)
x∈ej
The procedure starts with Ψ ≡ 0 which clearly satisfies (5). For the general step,
assume that there exist bad edges e1 , e2 , . . . , er for Ψ and let 1, 2, . . . , s be the bad vertices
of H for Ψ.
Claim: r < s. (Exercise [5.9])
Consider the unknowns y1 , y2 , . . . , ys . Using the claim, the system of r homogeneous
linear equations
X
yi = 0
i∈ej
5.3 EIGENVALUES
Regular graphs of girth five.
Consider an r-regular (simple) graph of girth five. At least how many vertices do we
need in such a graph? An arbitrary vertex is adjacent to r further vertices and all of
these are adjacent to r − 1 further vertices. Could it be possible that we double-counted
some vertices so far? The answer is no, thus we have at least r2 + 1 vertices in such a
graph. (Check this, [T5.4]). But is it possible to find r-regular graphs with girth five
53
which have only r2 + 1 vertices? For r = 2 the five-cycle, C5 , is a trivial example. For
r = 3 the famous Petersen graph is such an example.
Unfortunately such beautiful graphs are rare as the next theorem shows.
Theorem 5.9 (Hoffman-Singleton) If G is an r-regular graph with girth five and has
r2 + 1 vertices then r ∈ {2, 3, 7, 57}.
Proof. Set n = r2 + 1, V (G) = [n] and consider the adjacency matrix A = [aij ] of G
defined as the n × n matrix
(
1 if ij is an edge of G
aij =
0 if ij is not an edge of G
Claim. A2 + A − (r − 1)In = Jn where In is the n × n identity matrix and Jn is the
n × n all-ones matrix.
To prove the claim, notice that the diagonal entries of A2 are r so the equation is
valid for the diagonal entries (the diagonal of A is zero). Assume i 6= j. If (ij) is not an
edge of G then there is precisely one vertex of G is adjacent to both i and j (no C4 in G)
therefore the ij entry in A2 is 1. If (ij) is an edge of G then no vertex of G is adjacent
to both i and j (no C3 in G) therefore the ij entry in A2 is 0. In both cases the claimed
matrix equation is correct.
54
By the principal axis theorem the symmetric matrix A has an orthogonal basis from
eigenvectors. Let f be the all-ones column vector with n elements. Then Af = rf hence
f is an eigenvector with eigenvalue r. Consider an eigenvector e orthogonal to f , by
definition, Ae = λe, fe = 0 (with inner product notation). Multiply the equation of the
claim by e from the right which gives
λ2 e + λe − (r − 1)e = 0
r + m1 λ1 + m2 λ2 = 0. (7)
2r − r2 + (m1 − m2 )s = 0. (8)
s4 − 2s2 − 16(m1 − m2 )s − 15 = 0.
This equation implies that s is a divisor of 15, its possible values are 1, 3, 5, 15. Therefore
s2 +3
r= 4
∈ {1, 3, 7, 57} but r = 1 is impossible (girth is ∞). ¤
55
Self-test 5.
[T5.1] Prove Fisher inequality if some edge has λ vertices.
[T5.2] How do you get de Bruijn-Erdős theorem from Fisher inequality?
¡ ¢
[T5.3] Construct a two-distance set in R2 with n2 points.
[T5.4] Show that an r-regular graph of girth five has at least r2 + 1 vertices.
Exercise set 5.
[5.1] Assume that p is a prime and H = (V, E) is a hypergraph in which the edge sizes
are not divisible by p but each pair of edges has intersection size divisible by p. Prove
that |E| ≤ |V |.
[5.2] The same as exercise [5.1] but replace the prime by a power of prime. Hint: show
that the edges as row vectors are linearly independent in the linear space of n-component
vectors over the field of rational numbers .
[5.3] Show that a one-distance set of Rn has at most n + 1 points.
[5.4] A transversal of a hypergraph H = (V, E) is a set T ⊆ V such that T ∩ e 6= ∅ for
all e ∈ E. The transversal number of H is defined as
56
[5.8] Assume that we want to cover the edge set of the complete graph Kn by the edges of
complete bipartite graphs (allowing to cover an edge many times). How many bipartite
graphs do we need?
[5.9] Show that r < s at an arbitrary step in the procedure of the proof of Theorem 5.8.
[5.10] Prove that λ can be defined as required in the proof of Theorem 5.8 and condition
(*) is preserved for the new Ψ.
P
[5.11] Show that the final Ψ in the proof of Theorem 5.8 satisfies | x∈ej Ψ(x)| ≤ 2t − 1
for all j, 1 ≤ j ≤ m.
57
6 FRUIT SALAD
Theorem 6.1 A hypergraph has an SDR if and only if | ∪i∈S ei | ≥ |S| for all S ⊆ [m].
Theorem 6.2 A bipartite graph [A, B] has a matching from A to B if and only if
|Γ(S)| ≥ |S| for every S ⊆ A. (Here Γ(S) denotes the set of vertices adjacent to some
vertex of S.)
Outline of a short proof by induction on |A|: if strict inequality holds for all nonempty
proper subsets S, delete an edge and apply induction; if there is equality for some S
then apply induction for the bipartite graphs [S, Γ(S)] and [A \ S, B \ Γ(S)]. ¤
The proof above does not provide a fast algorithm to find a matching or a subset
S ⊆ A violating the condition. Such a proof can be obtained by using the alternating
path method .
Theorem 6.3 The subsets of [n] can be partitioned into symmetric chains.
58
6.3 A property of n sets on n elements.
A vertex deletion of a hypergraph means that we remove a vertex x from the vertex set
and each edge e is replaced by the edge e \ x. In other words, column x is removed from
the incidence matrix of the hypergraph.
Theorem 6.4 Asssume that a simple hypergraph has n vertices and n edges. Then there
is a vertex deletion which gives a simple hypergraph.
Proof. Assume that the edges are e1 , . . . , en and define a graph G with vertex set [n]
and with edges ij if and only if the symmetric difference of the edges ei and ej is a single
vertex x. If Theorem 6.4 is not true then we have n edges in G. However, G can not
contain any cycle - prove it! - thus we have a contradiction. ¤
∪i∈I1 ei = ∪i∈I2 ei
Proof. Consider the edges as 0 − 1 vectors in Rn . They are linearly dependent. This
gives easily the proof. ¤
∪i∈I1 ei = ∪i∈I2 ei
moreover
∩i∈I1 ei = ∩i∈I2 ei
59
Proof. Consider the vectors in R2n whose first n coordinates are determined by ei and
the last n by the complement of ei . These vectors are in an n + 1-dimensional subspace
of R2n therefore they are linearly dependent. Finish the proof. ¤
No proofs are known without linear algebra.
Proof. Assign real variables xj to vertices and consider the system of homogeneous
P
linear equations j∈ei xj = 0. ¤
The critical hypergraphs have a surprising connection with digraphs without even
cycles. Assume that D is a digraph, define the hypergraph H(D) as follows. The
vertices of H are the vertices of D and the edges of H are the sets x ∪ Γ+ (x) where x is
a vertex of D and Γ+ (x) is the ’outset’ of x.
Theorem 6.8 For a strong digraph D, H(D) is critical if and only if D has no even
directed cycles.
Theorem 6.9 An r-uniform hypergraph with more than r!(s − 1)r+1 edges contains a
sunflower with s petals.
| ∪ki=1 ei | ≤ r(s − 1)
60
and there exists a vertex of degree more than
r!(s − 1)r+1
= (r − 1)!(s − 1)r
r(s − 1)
Since Sbi covers [3k + 1], their union for bi ∈ B covers C n(k + 1) times. Therefore for
(k+1)n
some x ∈ [p − 1] at least 3k+1
> n/3 bi satisfy xbi ∈ C (mod p) and these bi -s form a
sum-free subset A in B. ¤
Remarks. Theorem 6.1 is Hall’s theorem (1935). The SCD theorem (Theorem 6.3) is
from de Bruijn, Tengbergen and Kruyswijk (1951). Theorem 6.4 is from Bondy (1972),
Theorem 6.5 is from Tverberg (1971), Theorem 6.6 is from Lindstrom (1993), Theorems
6.7 and 6.8 are from Seymour (1974). The sunflower theorem (Theorem 6.9) is from
Erdős and Rado (1960) - the bound given is not sharp. In fact, it seems to be a very
difficult problem to decide whether for s=3 the bound can be improved to cr . The
complete r-partite hypergraph with two vertices in each partite class and with edge
multiplicity r − 1 shows that 2r (r − 1) edges do not give a sunflower with three petals.
Theorem 10 is from Erdős (1965). It is not known whether n/3 is the best bound, Alon
and Kleitman constructed a set B of n positive integers with no larger sum-free subset
than 12n/29.
61
7 ADVANCED MENU
Lemma 7.2 Let A be a nonnegative real matrix with integral row and column sums.
Then the elements of A can be replaced by their floor or ceiling so that all row and
column sums retain their values.
Proof. A non-integral entry (NIE) of A gives another NIE in the same row which gives
another NIE in the same column which gives another NIE in the same row, etc. This
procedure gives an alternating cycle of NIE-s along which we can replace the elements
to get at least one integral entry and keep all row and column sums. Then repeat the
procedure. ¤
¡n−1¢
Set M = r−1
(the number of factors) and m = n/r (the number of edges in each
factor). Define an m-partition of a set X as a partition of X into m parts, such that ∅
is allowed with multiplicities. Example: ∅, ∅, 1 is a 3-partition of [1].
Lemma 7.3 For every integer k ∈ [0, n] there exist m-partitions of [k], A1 , A2 , . . . , AM
¡ n−k ¢
with the following property (*): each subset S ⊆ [k] occurs in exactly r−|S| partitions.
62
Observe that Lemma 7.3 gives the theorem: for k = n it says that S ⊆ [n] occurs in
¡ n−n ¢
exactly r−|S| partitions - this number is zero except for |S| = r when it is one!!
Lemma 7.3 is proved by induction on k. The base is k = 0 when all partitions
consist of m empty sets. Assume that for some k < n the m-partitions A1 , A2 , . . . , AM
satisfy (*). Define matrix A as an M × 2k matrix where the rows are associated with
the partitions the columns are associated with the 2k subsets of [k]. The entry of A
associated to row Ai and column S is 0 if S ∈
/ Ai , otherwise it is
r − |S|
n−k
r−|S|
except when S = ∅: in that case the entry n−k
is taken with the multiplicity of S in Ai .
In the example above, A is a 5 × 4 matrix, the columns are associated to ∅, 1, 2, {1, 2}.
The first four rows are [1/2, 1/4, 1/4, 0] and the fifth row is [1, 0, 0, 0] because ∅ has
multiplicity two in A5 .
The row sums of A:
X r − |S| 1 X
= (mr − |S|) = 1
S∈A
n − k n − k S∈A
i i
Using Lemma 7.2, round A to integral B keeping row and column sums. Since the row
sums are all ones, there is precisely one 1 in each row of B, this corresponds to a block
Si ∈ Ai .
Change A1 , A2 , . . . , AM by replacing Si with Si ∪ {k + 1} (and keep all other blocks
unchanged). This defines A∗1 , A∗2 , . . . , A∗M and we show that they satisfy (*) for k + 1.
Assume S ∗ ⊆ [k + 1].
Case 1. k + 1 ∈ S ∗ . Now S ∗ appeared 0 times in the Ai -s and appears in the A∗i -s as
many times as the column sum of B which is equal to the column sum of A thus
µ ¶ µ ¶
n−k−1 n−k−1
=
r − (|S ∗ | − 1) − 1 r − |S ∗ |
63
showing property (*) for k + 1.
¡ n−k
¢
/ S ∗ . Now S ∗ occurred
Case 2. k + 1 ∈ r−|S ∗ |
times in the Ai -s (inductive hypothesis)
but disappeared as many times as extended by k + 1 i.e. by column sum of B which is
equal to column sum of A thus
µ ¶ µ ¶ µ ¶
n−k n−k−1 n−k−1
− =
r − |S ∗ | r − |S ∗ | − 1 r − |S ∗ |
Lemma 7.4 Assume that e is an edge of a normal hypergraph H. Form a new hyper-
graph H+ by adding a new copy of e. Then H+ is also a normal hypergraph.
and the proof is done. Otherwise consider a proper coloring of H with ∆(H) = t colors,
assume that e is red. Remove all red edges and add e+ . This is a partial hypergraph of
H with maximum degree t − 1 so there is a proper coloring with t − 1 colors. Adding
64
back the red edges, we have a proper coloring of H+ with ∆(H+ ) = t colors and the
lemma is proved.
The next lemma shows that normal hypergraphs behave nicely for another pair of
parameters. Let ν(H) denote the maximum number of pairwise disjoint edges in a
hypergraph H (matching number) and let τ (H) denote the transversal number, the
minimum number of vertices needed to intersect all edges of H (see Exercise 5.4 in
handout). The obvious inequality τ ≥ ν becomes equality for normal hypergraphs.
65
construction. Assume that on the vertex set of G we have a hypergraph H whose hyper-
edges span complete subgraphs in G and each edge of G is covered by some hyperedge
of H. It is easy to see that the dual of H is a suitable hypergraph, i.e. the intersection
graph of H ∗ is G. The cited special case is when the edge set of G defines H. Another
(not very economic) choice of H is to select all complete subgraphs of G as hyperedges.
The best choice is to select the minimum number of complete subgraphs needed to cover
the edge set of G. However, as we shall see, there is another useful choice for H called
the clique-hypergraph of G: the edges of H are the complete subgraphs of G maximal
for inclusion.
Example. Consider the graph G obtained from a triangle T = K3 by adding three
new vertices, each adjacent to a different pair of vertices of T . (G has six vertices and
nine edges.) The dual of G has 9 vertices, the dual of all complete subgraphs of G
has 19 vertices, the dual of a minimum cover of G has 3 vertices and the dual of the
clique-hypergraph of G has 4 vertices.
The advantage of the clique-hypergraph is that its dual has the Helly-property: pair-
wise intersecting edges have a nonempty intersection.
The perfect graph theorem. Let ω(G) denote the number of vertices in the largest
complete subgraph of G. A graph is called perfect if χ = ω for all induced subgraphs.
The perfect graph theorem (PGT) is the following result of Lovász:
Proof. Assume that G is perfect, i.e. ω = χ for all induced subgraphs. This implies
that ∆ = q for all partial hypergraphs of H ∗ , the dual of the clique hypergraph H of G -
in other words, H ∗ is normal (Helly property is needed!). Lemma 15 gives that ν = τ for
all partials of H ∗ which gives that ω = χ for all induced subgraphs of the complement
of G. ¤
Remark. Lovász’s proof of PGT is from 1972. It was a conjecture of Berge (1960)
who also had a stronger conjecture, the perfect graph conjecture (PGC) claiming that a
graph is perfect if and only if it does not contain odd cycles of length at least five and
66
their complements (as induced subgraphs!). The PGC is recently proved by Chudnovsky,
Robertson, Seymour and Thomas. However, the proof is very long structural analysis
and definitely does not make the PGT obsolete. An introduction to the subject is
Golumbic: Algorithmic Graph Theory and Perfect graphs (1980).
Theorem 7.7 Assume that p is a prime, L is a set of s integers, and we have a hyper-
graph with n vertices and with m edges e1 , e2 , . . . , em satisfying the following conditions:
for each i, |ei | ∈
/L (mod p); for each i, j, i 6= j, |ei ∩ ej | ∈ L (mod p). Then
µ ¶ µ ¶ µ ¶
n n n
m≤ + + ... + .
0 1 s
Proof. The claimed upper bound is the dimension of polynomials in n variables with
degree at most s which are linear in all variables (products of at most s distinct variables
form a basis). The proof idea is to define m linearly independent polynomials of that
type. Apart from a tiny technical problem, the following definition works:
Y
fi (x) := (xei − l)
l∈L
where ei is the 0 − 1 incidence vector of the edge ei , xei is the inner product (mod p).
Linear independence follows by substituting ej to fi and applying the diagonal criterion
(exercise 5.6). The tiny problem is that to get linear polynomials, the domain of fi must
be restricted to {0, 1}n . ¤
Theorem 17 gives a super-polynomial lower bound for the Ramsey number as follows.
Let p be a prime and n > 2p2 . Associate the vertices of the complete graph K with the
67
p2 − 1-element subsets of an n-element set. Color an edge of K red if the corresponding
subsets has intersection size not congruent to −1 (mod p) otherwise color it blue. Apply
Theorem 17 with L = {0, 1, 2, . . . , p − 2} for the red color. Then apply Theorem 17 with
a prime q > p2 − 1 and with L = {p − 1, 2p − 1, . . . , p2 − p − 1} for the blue color.
(the last step uses Sterling formula). The inequality for the chromatic number follows
|V (Gp )| 24p−1
from χ(Gp ) ≥ α(Gp )
≥ 1.7548p−1
. ¤
Borsuk conjectured (in 1933) that every set of Rd with diameter one can be parti-
tioned into d + 1 sets of diameter less than one. The conjecture was proved for d ≤ 3
and for all d if the set is special (centrally symmetric, have smooth boundary). Let f (d)
be the minimum for which every set of diameter one can be partitioned into f (d) sets
of diameter smaller than one. Borsuk’s conjecture was f (d) ≤ d + 1. This fails badly as
proved by Kahn and Kalai in 1992:
√
2d
Theorem 7.9 f (d) > 1.1397
¡4p−1¢
Proof. Let p be a prime, d = 2
. Let H be a set of 4p−1 elements. The hypergraph
KK(d) is defined as follows.
68
The vertex set of KK(d) is formed by the d pairs of elements of H. Formally
V (KK) = {axy : x, y ∈ H}
eZ = {axy : x ∈ Z, y ∈
/ Z}
therefore the maximum distance, i.e. the diameter of KK(d) in Rd is realized if and
only if
|eZ1 ∩ eZ2 | = min{|eZ ∩ eZ 0 | : Z, Z 0 ⊆ H} = λ
69