Buehler - Classical Metalogic PDF
Buehler - Classical Metalogic PDF
Buehler - Classical Metalogic PDF
January 5, 2015
2
Preface
What follows are my personal notes created in preparation for the UC-Berkeley Group in Logic preliminary exam. I
am not a computability theorist, nor a model theorist; I am a graduate student with some knowledge who isalasquite
fallible. Accordingly, this text is made available as a convenient reference, set of notes, and summary, but without even
the slight hint of a guarantee that everything contained within is factual and correct (indeed, some areas are entirely
unchanged from the moment I copied them off the blackboard). This said, if you find a serious error, I would greatly
appreciate it if you would let me know so that it can be corrected. The material for these notes derives from a wide
variety of sources:
Lectures by Wes Holliday
Lectures by Antonio Montalban
Lectures by John Steel
To those Group in Logic students who may be using these notes to help themselves prepare for their preliminary
exam, chapters 1-5, 7, and 9-18 contain relevant material, as well as chapter 8, section 3.
i
ii PREFACE
Contents
Preface i
1 First-Order Logic 1
1.1 Signatures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Languages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2.1 Terms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2.2 Atomic Formulas and Sentences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2.3 Formal Languages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2.4 Standard Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Normal Forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
5 Definability 25
5.1 Definability of Relations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
5.2 Definable Classes of Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5.3 The Skolem Theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5.3.1 Skolemization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
5.3.2 The Downward Lwenheim-Skolem Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 27
iii
iv CONTENTS
6 Amalgamation 31
6.1 Elementary Amalgamation Property . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
6.2 Strong Elementary Amalgamation Property . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
6.3 The Strong Amalgam Property and Algebraicity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
6.4 Interpolation and Definability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
II Computability Theory 61
9 Of Algorithms 63
9.1 Turing Machines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
9.1.1 The Church-Turing Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
9.2 Functions and Recursion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
12 Non-Computable Relations 81
12.1 A Three-Fold Distinction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
12.1.1 Closure Properties and Connections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
12.2 Computably Enumerable Relations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
12.3 Reduction and Computable Separability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
12.4 Many-One Reducibility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
12.4.1 The m-Degrees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
12.4.2 Characterizing the 01 /01 -Complete Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
First-Order Logic
Although our aim is, quite clearly, to speak eventually of full first-order logic, we begin much more simplywith the
basic building blocks thereof. The first of these basic building blocks is the notion of a signature or vocabulary.
1.1 Signatures
A signature is simply a listing of the symbols with which we will be workingour vocabulary, so to speak. This
vocabulary of symbols is divided into three sets based on what kind of thing a given symbol is intended to represent.
Signature
A signature is a (possibly empty) collection of the following sets:
i. The set of constant symbols; symbols which represent a particular entity
ii. For every n N, a set of n-ary relation symbols; symbols which represent a relation which either holds or
does not hold between tuples of n entities.
iii. For every n N, a set of n-ary function symbols; symbols which represent functions mapping n tuples of
entities to entities.
A signature L with no constants or function symbols is, for obvious reasons, called a relational signature; a
signature with no relation symbols is, for less obvious reasons, sometimes called an algebraic signature.
1.2 Languages
In addition to the symbols provided by the signature, every language also has a stock of variables. Any symbol can be
used as a variable provided that it is not already being used. The actual choice of variables is never important, and so
its common to restrict them to x0 , x1 , . . . with natural number or cardinal subscripts.
1.2.1 Terms
The terms of a signature will eventually play a critical role in our work; intuitively, a term is a combination of function
symbols, constants, and variablessomething which would denote an individual if all the variables were assigned.
1
2 CHAPTER 1. FIRST-ORDER LOGIC
Term
The terms of a signature L are the strings of symbols defined as follows:
If a term t is introduced as t( x) this means that x is a sequence (x0 , x1 , . . . , xn ) of variables, and every variable which
occurs in t is among the variables in x.
Atomic Formula
The atomic formulas of a signature L are the strings of symbols generated by:
If s and t are terms of L, then s = t is an atomic formula of L.
For n > 0, if R is an n-ary relation symbol of L, and t1 , . . . , tn are terms of L, then the expression
R(t1 , . . . , tn ) is an atomic formula of L.
Atomic Sentence
An atomic formula in which there are no variables orequivalentlyall of whose terms are closed.
Just as with terms, if we introduce an atomic formula as ( x), then ( s) means the atomic formula got from by
putting terms from the sequence s in place of all occurrences of the corresponding variables in x.
At times, it will be useful to refer to not only that atomic formulas of a signature, but also negated atomic formulas:
Literal
An atomic formula or negated atomic formula; paralleling earlier use, closed literals are those literals without
variables.
not
and
or
= equals
for all
there exists
1.2. LANGUAGES 3
The terms, closed terms, atomic formulas, atomic sentences, etc. of the formal language L are defined to be the same
as those of the signature L. Finally, we define L itself as the class of all its well-formed formulas. The well-formed
formulas of L include, for any atomic formula A of L and any well-formed formula , both
A and
which is quite standard. In addition to these, however, we also allow possibly infinitary constructions with a given
cardinal upperbound on conjunction/disjunction and on quantification:
( x) [] L and ( x) [] L
In addition to the well-formed formulas of the language, we also distinguish a particular class of these formulas:
sentences.
Sentence
A formula of a language L with no free variables.
Some thought quickly shows that every language has a unique signature; since our primary topic will be languages,
we will often conflate the two. In addition, although only occasionally observed elsewhere, given a language L
generated from a signature L with conjunction/disjunction and quantification upperbounds / respectively, we will
use L as the name of this language.
Example 1.
Although the term first-order technically refers to logics quantifying only over objectsa criterion which all of
our languages meet, regardless of the particular , chosenwe will specifically use the term for the language L0 0
which is our primary concern. If an infinitary language is being considered, it will be made explicit. In the L0 0 case,
the more complex definition of language given above simplifies to the one typically given in undergraduate courses:
(
R(t) for t terms
Wffs =
| ( ) | ( ) | x | x for , Wffs
We add the following definitions to our repertoire in order to mirror our natural intuitions on the similarity of
various formulas:
Variant
For a language L and corresponding signature L, we say that two n-ary formulas of L, ( x) and (y), are variants
of one another if ( x) may be obtained from (y) via a uniform replacement of variables (that is, ( x) = ( x)) and
vice-versa.
Note that the relation of variance over the formulas of a language is an equivalence relation; the cardinality of a first-
order language L is actually defined as the number of variant equivalence classes of its formulas. In simpler terms, the
actual choice of variables is immaterial; for our purposes R(x1 , x2 ) may as well be R(x3 , x9 ).
4 CHAPTER 1. FIRST-ORDER LOGIC
x,y for x = y
(0 n ) for {0 , . . . , n }
(0 n ) for {0 , . . . , n }
iI i for {i |i I}
iI i for {i |i I}
( ) for ()
( ) for ( ) ( )
x1 x2 . . . xn or x for x1 x2 . . . xn
x1 x2 . . . xn or x for x1 x2 . . . xn
for
Normal conventions for parenthesizing are also in effect; binds tighter than all the other logical symbols while
and bind tighter than and in turn.
[0 ] [1 ] ... [n ]
[0 ] [1 ] ... [n ]
x0 x1 . . . xn ( x)
where is quantifier-free.
Part I
5
Chapter 2
The most basic example of a model theory is that of propositional logic: Emile Posts much vaunted truth table.
Example 2.
A simple truth table.
p q q p q
T T F F
T F T T
F T F F
F F T F
This simple model theory may be taken as our paradigm; it elegantly represents all facts necessary to determining the
truth or falsity of a given sentence in a formal and concise manner. If more generality were desired (as it may very
well be), imagine assigning truth values to all the atomic propositions available for use. Each of these assignments
gives a different model, a different situation.
2.1 Models
So-called models, interpretations, or structures are the primary topic of model theory; a model will play the role of a
particular instantiation of our signature, one way the constant symbols could have been assigned, one definition each
of the relation symbols could have been given, and one function each of our function symbols could have named. All
of these could, of course, have been different; a model simply gives one way the world could have been.
7
8 CHAPTER 2. THE BASICS OF MODELS
Model
For a given signature L, a model A is an object made up of the following four elements:
i. A set called the domain of A, written dom(A) or dom A; occasionally also referred to as the universe or
carrier of A. The elements of dom(A) are called the elements or entities of the model A. The cardinality of A,
symbolized |A|, is defined to be the cardinality of dom(A) or |dom(A)| .
ii. A referent for each of the constant symbols of the signature L. If c is a constant symbol of the signature, we
write cA for the element named by the constant symbol c in the model A.
iii. For each positive integer n and each n-ary relation symbol of the signature, an extension for this relation
symbol; that is, an n-ary relation on dom(A) (subsets of (dom(A))n ) which the n-ary relation symbol from the
signature names. If R is a relation symbol, we write RA for the relation named by the symbol R in the model
A.
iv. For each positive integer n and n-ary function symbol of the signature, an n-ary function on dom(A) (i.e. a
map from (dom(A))n to dom(A)) which this symbol names. If f is a function symbol, we write f A for the
function named by f in the model A.
As the definition implies, a model is always associated with a signature; to make this connection explicit, given
a signature L, a model A of that signature is called an L-model. Crucially, the domain of a first-order model is not
allowed to be empty, but the extensions of relations are. Finally, we adopt the following notation:
For better or worse, there is no standard way of representing a model; it often falls to the reader to make educated
guesses as to what a writer intended.
From a given signature L, we may construct a number of formal languages, most notably L0 0 . Similarly, a signature
L determines a class of L-models. As foreshadowed above, L-models and a formal language L generated from L
share a very special relationship, namely that the class of L-models exhausts all possible interpretations of the formal
language L, it represents every different way things could be with respect to the language L.
To briefly recap, an L-model A provides an interpretation of the proper type for every symbol in the signature L. For
any constant symbols c, this means cA is some element of dom(A); for any n-ary relation symbol R or n-ary function
symbol f , this means an n-ary relation over dom(A), denoted RA , and an n-ary function f A : dom(A)n dom(A),
respectively. In doing so, however, a model does far moreassuming standard first-order logic, it sets a truth value for
every well-formed formula of any language L generated from the signature L. Mathematicians tend to symbolize this
relationship with |=, writing A |= whenever the formula is made true by the model A and saying A models .
We retain their phrasing, but reserve the symbol |= for the model-theoretic consequence relation. In lieu, we adopt
,
following the usage of modal logicians.
For a model A, |A| is the cardinality of A. For a signature L, |L| denotes the cardinality of L if and only if L is
infinite; otherwise, if L is finite, well say |L| = 0 .
2.2. MODELS AND FORMAL LANGUAGES 9
Truth in a Model
Let L be a signature, L a language generated from L, and A an L-model. Furthermore, let t( x) be a term of L and
a a sequence of elements from A at least as long as x. Then, by recursion on the complexity of the term t,
Note that these three clauses fix the denotation of any term in L when it is paired with a tuple of elements from
dom(A) of appropriate length. This done, we may now define truth in a model or the
relation. Again, the
definition is recursive, but now on the complexity of the formula. First, suppose that ( x) is atomic. Then, given a
tuple of elements a at least as long as x,
A
R(t1 , . . . , tn )[a] is true for some relation symbol R in L if and only if ht1A [a], . . . , tnA [a]i RA
Validity
A sentence of a language L is valid if it is entailed by the empty set; that is, |= . is also called a logical theorem.
Satisfiability
A sentence of a language L is called satisfiable or consistent if there exists an L-model A such that A
;
similarly, a set of sentences of a language L is called satisfiable or consistent if and only if there exists an
L-model A such that A
for every .
10 CHAPTER 2. THE BASICS OF MODELS
2.3 Theories
While models are, unsurprisingly, the primary topic of model theory, theories represent a close second place. Theories
along with the model theoretic consequence relationwill bridge the gap between the formal languages and models.
Theory
A set of sentences of a particular formal language L
Extension of a Theory
An extension of a theory T (or supertheory of T ) is a theory T 0 such that T T 0 .
Modeling a Theory
Let L be a language, L the corresponding signature, A an L-structure, and T a theory in L. A
T if and only if
A
for every sentence T .
Note that it makes sense to speak of a model of a theory (with no element assignment) since every formula under
consideration has no free variables.
Atomic Diagram
For an L-model A, the atomic diagram of A or Diagat (A) is the theory containing all closed literals of the
language L+ such that A+
, where L+ is the language generated by expanding the signature L to include a new
constant for every element of the domain of A and A+ is A with each of these new constants representing the
intended element of the domain.
Couched in less formal terms, the expansion from L to L+ gives the ability to name every element of the model A.
To make this connection clear, the added constants are often subscripted with the names of the elements themselves,
e.g. we add ca for every a A. Of course, simply expanding the signatureand by extension the languagewith new
constants is not enough; the movement from A to A+ guarantees that each of these constants refers to its intended
element of A. A itself is, after all, not an L+ -model.
Elementary Diagram
For an L-model A, the elementary diagram of A or Diagel (A) is the theory containing all sentences of the
language L+0 0 such that A+
, where L+0 0 is the first-order language generated by expanding the signature L
to include a new constant for every element of the domain of A and A+ is A with each of these new constants
representing the intended element of the domain.
Just as above, the expansion from L to L+ gives the ability to name every element of the model A, while the move-
ment to A+ ensures these constants refer as intended. The new stipulation of L+0 0 above is the reason for the term
elementarya common synonym in model theory for first-order.
Finally, while its clear that, for any model A, Diagat (A) Diagel (A), ThL (A) is constructed using a different
(smaller) language. Thus, it is not true that Diagat (A) ThL (A)although, from the perspective of L+ , ThL (A)
Diagel (A). All three of the theories above will prove useful later, but even now they suggest a distinction in the
2.3. THEORIES 11
sameness of models. In particular, note that for two L-models A and B, B
ThL (A) is a strictly weaker condition
than B+
Diagat (A) (for some extension B+ of B) which is in turn strictly weaker than B+
Diagel (A) (for some
extension B+ of B).
L -Equivalent
Two L -models A and B are L -equivalent if and only if T hL (A) = T hL (B), written A L B.
In practice, however, we will constrain ourselves to L0 0 , and so we will almost exclusively use:
Elementary Equivalent
Two L-models A and B are elementary equivalent if and only if ThL0 0 (A) = ThL0 0 (B), written simply as A B.
Proof Sketch.
First, generate the equality closure T 0 of T . Let the domain of A be the equivalence classes of = over the
closed terms of T 0 . For any n-ary function f and tuple t of n terms, set f (t)A = [ f (t)]; that is, set f (t)A to
be the equivalence class which includes the closed term f (t). Similarly, for any n-ary relation R, set RA to
be the set of tuples t such that R(t) is in T 0 .
The method of proof sketched above is a Henkin construction; later, well see that the same process can be expanded
to construct models for arbitrary consistent theories.
12 CHAPTER 2. THE BASICS OF MODELS
Chapter 3
Homomorphism
Let L be a signature and let A, B be L-models. A homomorphism is a function : dom(A) dom(B) such that:
In other words, the existence of a homomorphism from a model A to a model B implies that some portion of B looks
like Aalbeit in a very weak sense. In particular, over that portion of B the functions of L behave as they do in A
and the relations of B contain at least everything they do in A, although a single element of B may represent multiple
elements in A. This definition can clearly be strengthened, andin factwe do precisely this:
Embedding
Let L be a signature and let A, B be L-models. A function : dom(A) dom(B) is an embedding if and only if it
is injective, a homomorphism, and satisfies a strengthened second condition:
For each n > 0, each n-ary relation symbol R of L and each n-tuple a from A, a RA a RB .
Less formally, the existence of an embedding from a model A to a model B means that B contains an exact copy of A
somewhere inside of it.
Isomorphism
Let L be a signature and let A, B be L-models. A function : dom(A) dom(B) is an isomorphism if and only if
it is a surjective embedding. We say A is isomorphic to B if an isomorphism : A B exists, symbolized A B.
Isomorphism is, as usual, taken to mean that the models A and B are essentially the same; that they dont differ in
any interesting respect. In the future, for the sake of brevity, we denote functions : dom(A) dom(B) simply by
: A B. In addition, the following terminology will occasionally be used: homomorphisms : A A are called
endomorphisms of A; isomorphisms : A A are called automorphisms of A. The identity map on dom(A), IA , is an
automorphism for any model A.
The following follows easily from the definitions given.
13
14 CHAPTER 3. RELATIONS BETWEEN MODELS
(i) If A, B, and C are L-models and : A B, : B C are homomorphisms / embeddings / isomorphisms, then
the composed map is a homomorphism / embedding / isomorphism from A to C
(ii) Let A and B be L-models. If : A B is an isomorphism, then the inverse map 1 : dom(B) dom(A) exists
and is an isomorphism from B to A
Proof Sketch.
Simply leverage the definitions above
(i) For every term t( x) and tuple a A of sufficient length, (t A [a]) = t B (a)
(ii) For every atomic formula ( x) and tuple a A of sufficient length, if A
[a], then B
[a]
Proof Sketch.
Simply leverage the definitions above
3.2 Submodels
Submodel
A is a submodel of B, often notated A B, if and only if
A and B are both L-models
dom(A) dom(B)
For all constants in L, cA = cB
For each n > 0, each n-ary relation R in L, and each n-tuple a in A, a RA a RB .
For each n > 0, each n-ary function symbols f in L, and each n-tuple a in A, f A (a) = f B (a).
or, equivalently, A and B are L-models with dom(A) dom(B) and the inclusion map i : dom(A) dom(B) is an
embedding.
3.3. PRESERVING FORMULAS 15
Lemma 3.1.
Let B be an L-model and X dom(B). X is the domain of a submodel of B if and only if
Proof Sketch.
Simply leverage definitions for both directions.
Generated Submodel
Let A be a submodel of B and Y a set of elements of B. A is the submodel of B generated by Y, notated A = hYiB ,
if and only if it is the smallest submodel of B which contains Y. hYiB is also called the hull of Y in B; we call Y
itself a set of generators for A.
Less formally, a generated submodel is the result of taking all the elements in the provided set, all the constants,
and closing under functions (apply the functions a step at a time to generate a hierarchy, the union is the generated
submodel).
Finitely Generated
A submodel B is finitely generated if and only if it is of the form hYiA for some finite set Y of elements and
L-model A .
Proof Sketch.
Its easy to show that Y union an additional individual for every constant symbol in L has cardinality at
worst |Y|+|L|; similarly, the set of functions from L applied to elements of Y can be seen to have cardinality
at most |Y| + |L| + 0 . Altogether, then, the size of the domain of hYiB isusing the lemma abovebounded
by |Y| + |L| + |Y| + |L| + 0 . But this is equivalent to, max(|Y|, |L|, |Y|, |L|, 0 ) which is, of course, simply
max(|Y|, |L|, 0 ). Recall, however, that |L| 0 , and so this is equivalent to max(|Y|, |L|) = |Y| + |L|
Using our previous work, It follows immediately that homomorphisms preserve atomic formulas while embeddings
preserve both atomic formulas and their negations (literals). Isomorphisms, naturally, preserve all formulae.
16 CHAPTER 3. RELATIONS BETWEEN MODELS
Existential
A formula is existential, an 1 -formula (pronounced E-1), or simply an -formula if and only if it is built out of
quantifier free formulas using only , , and . In other words, it can be written in prenex normal form as
x [( x)].
Universal
A formula is universal, an 1 -formula (pronounced A-1), or simply an -formula if and only if it is built out of
quantifier free formulas using only , , and . In other words, it can be written in prenex normal form as
x [( x)].
As the subscripts hint, 1 and 1 formulas are, in fact, members of a natural hierarchy on formulas; while we will use
these two most often, the full context may make our work easier to understand:
.. ..
. .
Quantifier Hierarchy
4 or 4 or
0 and 0 are defined as the quantifier-free formulas.
A formula is in n+1 if and only if it is in the smallest class of 3 or 3 or
formulas which contains the n formulas and is closed under ,
, and adding to the front. 2 or 2 or
A formula is in n+1 if and only if it is in the smallest class of
formulas which contains the n formulas and is closed under , 1 or 1 or
, and adding to the front.
Quantifier-Free
Proof Sketch.
By induction on the complexity of formulas; note that any existential formula can be written in the form
x( x) where is in CNF or DNF. The base case of the induction is atomic and negated atomic formulas;
from here, its easy to see that and are also preserved. The case is slightly harder and relies on the
definition of an embedding.
Notation. For a theory T in a language L, the theory containing all of the -sentences of L
which are consequences of T is denoted T .
3.3. PRESERVING FORMULAS 17
Proof Sketch.
()
Simply leverage the previous preservation results.
()
Suppose A
T for some theory T . We show the existence of a B such that B
T and A B. Let U be
the set of all -sentences such that A
. Assume T U Diagat (A) doesnt have a model. Then, by
compactness, there is some finite set F U such that T Diagat (A) |= F. Using compactness again,
there is some finite set E Diagat (A) such that T |= E F. But, by the definition of U, F
is logically equivalent to an -formula , and so T |= E but then E as a whole is a -
sentence (recall, E must be quantifier-free). Thus, E T and, by assumption, A+
E .
Therefore, A+
and A
Fcontradicting the definition of U. By reductio, T U Diagat (A) must
have a model, and the reduct of this model to our original language fulfills the stipulations for our B.
(i) For any two L- models A, B such that A
T , B
T , and A B, ( x) is preserved downward from B to A; that
is, for every a A, if B
[a], then A
[a] as well
(ii) For some set of -formulas ( x) ( x need not be finite), T |= x[( x) ( x)]; that is, is equivalent
modulo T to a set of -formulas
Proof Sketch.
()
Simply leverage the earlier -preservation results.
()
The case where ( x) is a theory follows easily by the previous proposition. Consider, then, the case where
( x) is not a theory. Let c be a tuple of new constants (expanding the language) and note that (c) is now
a theory. Note further that, like the more general ( x), (c) is preserved downwards between all models
A, B such that A
T , B
T , and A B. Let (c) denote (c) . Its trivial that T (c) |= (c). Consider
T (c), by the definition of and the previous lemma, any model of T (c) is submodel of a (c)
modeland thus itself a (c) model. It follows immediately that T (c) |= (c), and so
T |= (c) (c)
Note that, in the theorem above, ( x), ( x), and x[( x) ( x)] need not actually be well-formed
formulas of L. Nonetheless, their meanings are still clear. More colloquially, the os-Tarski preservation theorem
states that the formulas preserved downwards by the submodel relation are always exactly the 1 formulas (any other
formula preserved downwards by the relation is just equivalent to an 1 formula). This, of course, quickly gives rise
to the following corollary:
18 CHAPTER 3. RELATIONS BETWEEN MODELS
Elementary Submodel
Let A, B be L-structures. A is an elementary submodel of B or B is an elementary extension of Anotated A Bif
and only if A B and the inclusion map is an elementary embedding (that is, an elementary inclusion)
(i) A B
(ii) For every formula ( x, y) of L and every tuple a from A, if B
y (a, y), then B
(a, d) for some element d
from A
Proof Sketch.
()
Simply leverage the definitions
()
Induction on formulas; the un-quantified cases all follow easily using vacuous existentials. For the case,
note that (ii) is easily extended to an if and only if and route through existentials.
That is, being an elementary submodela submodel which preserves all formulasis encompassed simply by the in-
clusion map preserving, for every n, the n formulas downwards.
Chain
Let L be a signature, a cardinal, and (Ai : i < ) a sequence of L-structures. The sequence (Ai : i < ) is a chain
if and only if for every i, j such that i < j < , Ai A j .
3.5. REDUCTION AND EXPANSION 19
Union of a Chain [
Given a chain of models (Ai : i < ), the union of (Ai : i < ) is the L-model A = Ai defined as follows:
i<
[
dom(A ) = dom(Ai )
i<
For any n-ary function symbol f , define f A (a) = f Ai (a) where Ai is any model in the chain containing a
Elementary Chain
A chain of L-structures (Ai : i < ) such that the inclusion maps are all elementary inclusions.
Proof Sketch.
Use the os-Tarski theorem and induction on formulas; by the construction of A , one can argue that
the witness appears at some point in the chain and by the transitivity of elementary models/induction
hypothesis, that it a witness appears in the one under consideration. Thus, by the inductive hypothesis, this
last witness appears in A .
Reduct, Expansion
Let L, L+ be signatures such that L L+ . Let A be an L-model, B an L+ -model such that the interpretations of the
symbols in L coincide in A and B. A is called an L-reduct of Bin other words, A is B Lwhile B is a
L+ -expansion of A.
20 CHAPTER 3. RELATIONS BETWEEN MODELS
Note that, by the forward direction from the first to second statement, for any model D of the elementary diagram of
A in L, there is an embedding of A in the reduct D|L.
1 Preservation
Let L be a formal language, L the associated signature, and A, B L-models. A 1 B if and only if every 1 formula
( x) is preserved from A to B.
Lemma 3.4.
Let L be a formal language, L the associated signature. For any theory U and L-model G, if G
U, then there exists
an L-model D such that D
T and G 1 D.
Lemma 3.5.
Let L be a formal language, L the associated signature, and G, D L-models; if G 1 D, then there exists an L-model E
such that D E and G E.
Proof Sketch.
Let T 0 be the union of the quantifier free formulas of B and elementary diagram of A. Suppose that T 0
is not consistent; noting that constants from one do not appear in the other, derive a contradiction. By
compactness, T 0 has a model.
Theorem 3.9.
Let T be a theory. The following are equivalent:
For every chain of structures (Bi : i < ) if Bi
T for all i, then B
T as well.
There is an 2 -theory U such that |= T U.
Proof Sketch.
() Let U = { is a 2 sentence such that T |= }. Clearly, T |= U. From here, well build, starting with
A |= U and using the lemmas above, an elementary chain A A1 A2 . . . where each Ai 1 Bi Ai+1 .
Since T is preserved under chains, A |= T and A A , A |= T by the theorem above.
Chapter 4
As stated above, every structure has a first order theorynamely, Th(A); but, quite obviously, not every theory has a
first order structure (simply consider a theory with two contradictory sentences). Earlier, canonical models provided a
means for constructing a model of any set of atomic sentences; in this section, we first consider what constraints are
needed to guarantee the existence of a model of an arbitrary theory, then use this result to prove two major theorems
for first-order languages.
Hintikka Set
A theory T in a language L is a Hintikka set if and only if
(H8) If x (x) T , then (t) T for some closed term t of L. If x (x) T , then (t) T for every closed
term t of L
The condition (H7) above is known as the Henkin propertyafter a famous logicianand the closed term t in the
property which makes x(x) true is commonly called a witness to x(x); its often said in this context that t witnesses
the truth of x(x).
21
22 CHAPTER 4. COMPACTNESS AND STRONG COMPLETENESS
Proof Sketch.
Let U be the set of atomic sentences in T . Observe that, by definition, U is equality closed. So, by a
previous theorem, the canonical model A is such that A
U. By construction U is the set of atomic
sentences in A and every a A is of the form t A for some closed term. To show that A
T , argue by an
induction on formulas using the Hintikka properties.
Proof Sketch.
Simply prove that each clause of the Hintikka set definition holds.
The conditions (i) (iii) above are commonly referred to as finite satisfiability, completeness, and having witnesses
respectively.
Proof Sketch.
Show the following:
Extension Lemma: If a theory is finitely satisfiable, then has an extension + in an expanded
first-order language L+ which is finitely satisfiable, complete, and has witnesses.
Then, argue as follows. Let be finitely satisfiable. Then, by the lemma, there is a + in an expanded
language which extends it and is Hintikka (an earlier theorem). By previous work, + has a model A
because it is a Hinikka set; thus, , a subtheory of + , also has a modelnamely A L.
Proof Sketch.
Add 0 many new constants to L, the signature in use; enumerate the sentences of L(0 ), noting that it is
possible to be so since |L| |L(0 )| = 0 . Induct starting with T , adding either a sentence or its negation
at each step according to whichever is finitely satisfiable with the current theory (note that you must prove
it is always possible to add at least one these). In addition, if the added sentence is an existential x[(x)],
also add [ci ], where ci is the least constant of the 0 that were originally added which has not yet appeared
in any sentence of our theory.
Proof Sketch.
Soundness, as usual, is easily established simply by showing that each of the proof rules of a given system
preserves the truth in a model relation,
.
Consistent
A set of sentences is consistent if and only if 0 where can be taken as either a primitive that is always
false or an abbreviation for x(x = x) x(x = x).
Proof Sketch.
With clever usage of compactness, it is enough to show that whenever a theory is consistent, it is satisfiable.
Assume |= ; if is inconsistent, then ` trivially. To see this, assume, then, that is consistent. If
whenever a theory is consistent, it is satisfiable, we now have that is satisfiable. Consider, then, {}.
By assumption, this is unsatisfiable, and soby compactnessthere is a finite subtheory of such that
{} is unsatisfiable. Then, by above, {} is inconsistent. Utilizing the reductio rule of the
proof system (or whatever combination of rules is equivalent to it), we now have a proof establishing ` .
To prove that whenever a theory is consistent, it is satisfiable, first prove that if is consistent, then has
an extension + in an expanded language which is consistent, complete, and has witnesses. From here,
we have that + is Hintikka, and thus has a modelmaking it satisfiable. It follows immediately that is
satisfiable (as earlier, simply take the reduct).
24 CHAPTER 4. COMPACTNESS AND STRONG COMPLETENESS
Chapter 5
Definability
In model theory, definability comes in two distinct forms: the definability of a relation and the definability of a class
of models. Each notion is important in its own right, and so we consider each in turn. The primary distinction between
the two is simply that the former encompasses relations relative to a particular model while the latter describes classes
of models, and is thus relative to only a particular language.
Expanding on this basic definition, its also possible to define a weaker notion:
First-Order Definable with Parameters
Let L be a first-order language, A an L-structure, and X dom(A) a set of parameters. An n-ary relation R is
first-order definable with parameters or X-definable if and only if there exists an L-formula ( x, y) such that
R = (An , b) for b X.
It may be easiest to think of first-order definability with paramters as simply definability in an expanded signature with
constants for all the elements in the parameter set S . In general, proving that a relation is definable is accomplished
by exhibiting the formula which defines the relation; proving that a relation is not definable by invoking a connection
between automorphisms and definable relations:
Thus, to show a relation is nondefinable, it suffices to exhibit an automorphism which either moves an element of
relation out of the relation or moves an element not in the relation into it. As one last note, an element a of a model
is said to be definable if and only if the relation {a} is. All of the above, thus, carries over directly into talk about
particular elements.
25
26 CHAPTER 5. DEFINABILITY
Example 3.
For simple examples of definable relations, consider:
(i) and the domain in any model
(ii) x1 + 3 = x2 in hN, +, S , 0i
Elementary Definable/Axiomatizable
A class of models K is elementary definable or elementary axiomatizable (abbreviated, EC ) if and only if there is
a theory T such that K is the class of all models which make T true; that is, K = Mod(T ).
Note that, for any class K, Th(K) is the largest possible theory which could define K; by simply leveraging the
definitions above, its easy to show that if any theory defines K, then Th(K) does.
While the primary method for showing definability (either elementary or basic elementary) remains simply ex-
hibiting the defining sentence or theory, showing non-definability proves much more difficult. Given a class K, the
standard strategy is to show that Th(K) union a theory whichas a wholeguarantees a model is not in K is finitely
satisfiable by models in K. A simple application of compactness then gives the desired result. The most famous and
important result along this line is owed to Thoralf Skolem.
5.3.1 Skolemization
Skolemization
Let T be a theory of a first-order language L. The Skolemization of T is a theory T + in a first-order language L+
where T T + , L L+ , and
(i) Every L-model A such that A
T can be extended to an L+ -model A+ such that A+
T + .
(ii) For every formula ( x, y) of L+ with x non-empty, there is a term t( x) of L+ such that
T + |= x[y ( x, y) ( x, t( x))]
Skolem Terms
The terms t( x) of L+ in condition (ii) above. These terms are usually generated by simply adding, for every
formula ( x, y), a new function f to the signature which takes x and returns the witness y if it exists; in this case,
the f are called Skolem functions
Skolem Theory
A theory T is a Skolem theory orequivalentlyhas Skolem functions if and only if T is a Skolemization of itself.
Note that being a Skolem theory is relative to the language in use; if T is a Skolem theory under L, it will usually not
be a Skolem theory under L+ where L L+ .
is a Skolem theory in L
|L | = |L|
Proof Sketch.
For every formula ( x, y), create a new function symbol f with arity | x| and add these new functions to
the signature. Make = { x[y( x, y) ( x, f ( x))] : ( x, y) is an L-formula}. Union and repeat.
Corollary.
Let T be a theory in a first-order language L. Then, T has a Skolemization T + in a first-order language L+ with
|L+ | = |L|.
Skolem Hull
Let T be a Skolem theory in a first-order language L, A a T -model, and X a subset of the elements of A. The
Skolem Hull of X is hXiA , the submodel of A generated by X.
28 CHAPTER 5. DEFINABILITY
Proof Sketch.
Use the Tarski-Vaught criterion, noting that Skolemization is specifically designed to force the presence of
witnesses in the submodel.
X dom(A)
AB
|A| = max(|X|, |L|, 0 )
Proof Sketch.
Simply leverage the work above; expand A to A use the Skolemization theorem, then take the Skolem
hull around X.
Proof Sketch.
Just add constants to the signature and sentences saying that no two constants name the same element
(pairwise distinct) to the theory T . Prove this expanded theory is finitely satisfiable and invoke compact-
ness; finally, take the reduct to the original signature.
Fact. If B Diagat (A), then A can be embedded into B; simply consider f (a) = caB .
Proof Sketch.
As earlier, introduce new constants to the signature, then add sentences asserting these constants are
all pairwise distinct from one another to T . In addition, however, also add Diagat (A) to T . Prove finite
satisfiability, invoke compactness, take the reduct.
5.3. THE SKOLEM THEOREMS 29
Proof Sketch.
As earlier, add new constants to L, generating a new language L+ . Take Diagel (A) union sentences
asserting that the new constants are pairwise disjoint and show that this is finitely satisfiable. Next, prove
that A B L. While its very possible that B L is too big, simply leverage the downward Lwenheim-
Skolem theorem.
30 CHAPTER 5. DEFINABILITY
Chapter 6
Amalgamation
D T
B T C T
Each of A, B, C, and D represents a model while the arrows signify embeddings. The model of primary interest in
the diagram is A; in order to investigate a model, its natural to view the ways in which it can be extendedhence B
and C. But even more information can be gained by looking not only at the extensions themselves, but also how those
extensions of A relate to one another in a model containing both of them: hence, D.
Of course, B and C are seldom allowed to be any extension whatsoever; we will assume there is a theory T such
that B
T and C
T which we desire to preserve in the amalgam of B and CD above. A final constraint imposed
on the process of amalgamation is, again, quite natural; since the entire process is used to investigate the model A,
the embeddings from B and C into D had better agree on A. In this context, D is said to be an amalgam of B and C
over A. As we will show, first-order logic has a number of amalgamation properties, general processes whichunder
specific assumptionsguarantee the existence of an amalgam with certain properties. As it turns out, these properties
can be leveraged to a variety of uses beyond the investigation of A, the interpolation and definability theorems in the
last section among them.
The most basic amalgamation process takes, for a a set of generators of A, the theory T above to be the set of all
sentences in the language L(a)immediately giving that B, C, and D are all L(a)-equivalent to one another. In fact,
we will guarantee an even stronger condition: the embeddings from B and C into D will be elementary. Because we
can, we may as well take A to be a submodel of B and B to be a submodel of D, making the relations along the left
side of the diagram and respectively. Note, however, that doing so only makes things slightly simpler and doesnt
constitute an important extension of our original diagram.
31
32 CHAPTER 6. AMALGAMATION
D
ThL(a) (B)
haiB
Proof Sketch.
Do it without the constants first. Let T be the union of the elementary diagram of B and the elementary
diagram of C. Suppose T is not consistent; noting that constants from one do not appear in the other, derive
a contradiction.
D D
B C B C 0 = (C)
A A
Proof Sketch.
Let T = Diagel (B) Diagel (C) {b , c: for all b B A and all c C A}. Show that T is finitely sat-
isfiable; in particular, suppose not and note that we need only consider one element from each elementary
diagram (elementary diagrams are closed under conjunction). From here, simply use the universalization
trick: Diagel (B) |= (a, c) i, j ci = b j , and so A |= (a, d), but then B |= (a, d) (use the earlier).
amalgam does not, in general, exist for any two extensions of a model. In order to guarantee existence, its necessary
to borrow a notion from mathematics: algebraicity.
Algebraic
Let B be an L-structure, X B, and b an element of B. b is algebraic over X in B if and only if there is a formula
(x, y), tuple a X, and n N, such that B
(b, a) n y (y, a)
Algebraic Closure
The algebraic closure of X in B, algB (X), is the set of all b from B which are algebraic over X. A set X is
algebraically closed if and only if X = algB (X)
Proving each of the following is a good exercise in fleshing out an intuitive understanding of algebraic closures:
If B C, then algB (X) = algC (X); use the definition of elementary extension
The major result in this area is, unsurprisingly, a strong amalgamation theorem dealing with algebraically closed sets:
D
ThL(a) (B)
halgB (a)iB
Lemma 6.1.
Given a model B, a tuple a dom(B), and b dom(B) algB (a),
There exists an L-structure A such that B A, c dom(A) dom(B), and (A, a, c) (B, a, b)
There exists L-structures D and C such that a B D, a C D, and b < dom(C).
34 CHAPTER 6. AMALGAMATION
Proof Sketch.
Its easier if we absorb X into the language and add a constant cc for c. Let T = Diagel (B) {(cc ) :
(x) tpB (b/)} {c , d: d B}. If this is inconsistent, then Diagel (B) ` (cc ) ki=1 cc = di , thus
B |= x((x) ki=1 x = di ). Prove that b is algebraica contradiction.
Proof Sketch.
We want D to satisfy T = Diagel (C) Diagel (B) {b , c: b B algB (), c C algC ()}. Suppose its
inconsistent. Then there are finite sets Y B algB (), Z C algC () such that Diagel (B) Diagel (C) `
yY,zZ y = z. Thus we have that Y g(Z) , where g is the elementary embedding from C to D.
Proof Sketch.
Observe that if L2 = L, then L2 Diagel (B) L Diagel (C) is consistent because if (b) ` (z) for
Diagel (B), Diagel (C), then (b) ` x( x), B |= x( x), and C |= x( x)contradicting
C |= (c).
Let B0 be an L1 structure such that B L1 B0 and there is an embedding C L L B0 . Since C L L B0 L ,
there is C0 an L2 structure such that B0 L L C0 L and C L L2 C0 L . Let D = n=1 Bn , an L1 structure;
S
S
we have B L1 D. Observe that D = n=1 Cn , an L2 structure, and thus C L2 D. Observe that gn+1
Proof Sketch.
Let = { is an L-sentence such that ` } = the set of L-implications of . We show first that |= .
Let C be an L2 structure such that C |= . Consider ThL (C L ). Assume it is not consistent; then
` where ThL (C L ), but then and C |= , so Th(C)a contradiction. Let B be an
L2 structure such that B |= ThL (C L ). We have C L B, so there is an L1 L2 structure D such that
B L1 D L1 and C L2 D L2 . By assumption, B |= , so D |= , so D |= , so C |= .
6.4. INTERPOLATION AND DEFINABILITY 35
For A, B L+ -structures, if A L = B L, then for a a tuple of elements from A and B, A
[a] if and only if
B
[a]
There is an L-formula (x) such that ThL (B) ` x[(x) (x)].
Proof Sketch.
36 CHAPTER 6. AMALGAMATION
Chapter 7
7.1 Types
A type isat basea description of a possible element or sequence of elements in a model, often with the use of
parameters from some distinguished set. More formally, let M be an L-structure and A dom(M). L(A) refers to
the language L augmented by constants for each element of A while ThLA (M) is, of course, the theory of M (or, more
properly, that extension of M wherein the added constants refer as intended) in LA .
Type
A set of L(A) formulas p with x1 , . . . , xn free variables is an n-type in M over A if and only if
Th(M)LA x p( x) is satisfiable
An n-type p is complete if and only if for any formula (x1 , . . . , xn ), either p or p. The set of all complete n
type in M over A is denoted S nM (A)
The simplest way to generate a complete type is, of course, to describe an element that actually exists in the model.
If a dom(M), then tp M (a/A) is the set of all n-ary L(A)-formulas ( x) such that M
[a]; tp M (a/A) is, as our
notation hints, obviously itself a complete type. Not all complete types correspond, however, to a tuple from the
model itself. Those that do are said to be realized in the model; those that do not are omitted.
Nota bene. If M N, then ThLA (M) = ThLA (N), and thus S nM (A) = S nN (A). That is, the set
of complete types is constant along the elementary substructure relation.
37
38 CHAPTER 7. TYPES, SATURATION, AND CATEGORICITY
Proving the result above requires a new bit of terminology and a clever lemma:
Partial Elementary Map
A partial elementary map from a model N to a model M is a function : B dom(N) dom(M) such that, for
any b B and L,
N
[b] M
[b]
In a striking continuity of definition, a type is said to be unsupported or non-principal if it is not supported or not
principal, respectively. From only the definitions given, several basic facts about supported and principal types are
immediate:
Fact. If a type p( x) over a set A is complete (with respect to all formulas with parameters from A), then it has
support if and only if it is principal over T .
Theorem 7.1 (Elementary Extensions which Realize all Types of the Model).
For any L-model A, there is an L-model B such that A B and every type of A is realized in B.
7.1. TYPES 39
Proof Sketch.
List all of the complete types over A: { : }. Let B0 = A. By definition, there is a B1 such that
B0 B1 and B1 realizes 0 . Note further that 1 is finitely realized in A = B0 and B0 B1 thus, 1 is
finitely realized in B1 as well. 1 is thus a complete type of B1 , and there is thus a B2 such that B1 B2
and B2 realizes 1 (and 0 ). Generalize to an inductive argument.
40 CHAPTER 7. TYPES, SATURATION, AND CATEGORICITY
Proof Sketch.
Let L+ = L {c1 , c2 , c3 , . . . }. Generate T + , an L+ theory, that is finitely satisfiable, complete, has witnesses,
T T + , and for every m and every ci there is (x) m such that T + ` (ci ). The term model of T +
is a model of T with the required property.
We argue along the lines of the omitting types theorem. First, extend L to L+ by adding a constant for
every element (countably many) and countably many new constants. Let T 1 be the empty theory and note
that, by assumption, T T 1 has a model which is an L+ model.
Given T i1 , T i is generated as follows. Noting that T i1 must be finite and that the formulas of L+ are
countable, consider the next formula in the sequence of L+ formulas which hasnt yet been considered, say
i . If T T i1 {i } has a model, define T i = {i } T i1 ; if not, then every model of T T i1 is a
model, and so define T i = {i } T i1 . In either case, T T i has a model.
Next, we search through T i for any formulas of the form x(x). Every time such a formula is found,
take one of our new constants (of which only finitely have been used), say c , to name an element which
satisfies (this must be possible since T T i has a model. Redefine T i = T i {(c )}.
Finally,
Example 4.
Let T = Th(N). Note that ( x) = {x , 0, x , s0, x , ss0,. . . } is finitely realizable and so a type. It
cannot be supported, however, because N omits .
7.3. ATOMIC AND PRIME MODELS 41
Remark. Its easy to show with Lowenheim-Skolem thatsupposing a countable languageany prime model
must be countable.
Example 5.
Atomic Model
A model M is atomic if and only if tp M (a) is principal for every a dom(M)
Nota bene. The converse does not hold generally, just in the countable case; consider, for
example, uncountable dense linear orders without endpoints
Theorem 7.3.
Let L be a countable, first-order language and T a complete L-theory with infinite models.
If for every n < 0 , S n (T ) is countable, then T has a countable atomic model
If an L-model A is countable, atomic, and models T , then A is a prime model of T
Proof Sketch.
(1) Let 1 , 2 , . . . be a lits of all non-principle, complete types over T . This list is countable, so use the
omitting types theorem; (2) Argue generally, noting the assumption of completeness, and construct the
embedding piece by piece. In particular, prove and use that if a from A and b from B and tpA (a) = tpB (b),
then for every c A, there is d B such that tpA (ac) = tpB (bd) (note that the types in A are principle).
To do this, not that tpA (ac) is generated by some ( xy). Thus, T |= xy (( xy) ( xy)tpA (ac) ( xy).
A |= y(a, y), so B |= y(b, y). Let d be that y. If we iterate this principle, we obtain an embedding;
moreover, it must be elementary since the elements were chosen to have the same types and thus satisfy
the same formulas.
42 CHAPTER 7. TYPES, SATURATION, AND CATEGORICITY
Lemma 7.2.
Let L be a countable, first-order language. If A = algA (), then A is atomic.
Proof Sketch.
Take a from A such that there is a formula ( x) and n such that A |= (a) and n x ( x). Pick so that
n is minimized. Show that A |= x(( x) ( x)) whenever ( x) tpA (a). Let ( x) tpA (a). Let a1 , . . . , an
be that tuples where is true. Assume A 6|= (ai ) for some i. Then, ( x) ( x) is a formula that a satisfies,
but which has fewer than n tuples which satisfy it totalcontradicting our choice of .
Theorem 7.4.
Let L be a countable, first-order language. Suppose A and B are atomic, countable L-models. Then, if A B, A B.
Proof Sketch.
We adapt the argument showing If A is countable and atomic, then A is prime to go back and forth
between the models and generate an isomorphism. Pick a1 A, b1 B such that tpA (a1 ) = tpB (b1 ). Pick
a2 A, b2 B such that tpA (a1 , a2 ) = tpB (b1 , b2 ). The end result of this procedure is a bijection g(ai ) = bi
which preserves types and is thus an isomorphism.
7.4. SATURATION 43
7.4 Saturation
The end of the previous section saw us realizing all the types of some starting model in one of its elementary extensions;
saturation takes this tactic one step further: realizing the types of a model in that very model.
Saturated
Let be a cardinal. A model M is -saturated if and only if for every A dom(M) with |A| < , every complete
1-type p(x) S nM (A) is realized in M. A model M is saturated simpliciter if and only if M is |M|-saturated
Example 6.
7.4.1 Homogeneity
Homogeneity is, as the term suggests, a self-similarity property; for a model M to be -homogeneous for some cardinal
, it must be that, when two parts of M of size < are elementary identical (using parameters from these parts), any
addition to one part can be mirrored by an addition on the other. That is, no difference occurs below the level. More
formally,
Homogeneous
Let M be a model and a cardinal. M is -homogeneous if and only if for any possibly infinite tuples a, b from M
with length < , if (M, a) (M, b), then for every d M, there is a c M such that (M, a, c) (M, b, d). M is
homogeneous simpliciter if and only if M is |M|-homogeneous.
Remark. Later, it will be popular to view this process as dealing in partial elementary embeddings; under this
scheme, the above states that any partial elementary embedding : A dom(M) where A dom(M) and |A| < can
be extended by an additional element c so long as |A {c}| < . The utility of this view clearly lies in the ability to
eventually construct a full elementary embedding of M to itself.
The definition of homogeneity should, of course, be quite familiar; it is, after all, a slight weakening of the alter-
native definition proved above for saturation:
The notion of homogenity also has close ties to the notion of type; simply by definition:
Proposition 7.7.
Suppose M is a 0 -homogeneous model and a, b from dom(M). If tp M (a) = tp M (b), then for any c dom(M), there is
a d dom(M) such that tp M (ac) = tp M (bd)
Theorem 7.6.
If M and N are homogeneous, M N, |M| = |N|, and both realize the same n-types over for every n, then M N.
Putting the facts above together, a characterization of not only 0 -saturation generally, but also countable saturated
models is possible:
7.4.2 Universality
Its clear that in moving from -saturation to -homogeneity something has been lost; luckily, that loss can be made
explicit:
-Universal
A model M is -universal for a cardinal if and only if for every B M with |B| < , B M. A model M is
universal simpliciter if and only if M is |M|+ -universal.
Universality is, as it were, the obvious dual to the previously-defined prime; as many models as possible elementary
embed into a universal model.
(i) M is -saturated
(ii) M is + -universal and -homogeneous
Additionally, if > 0 , then
7.5 Categoricity
Categorical
A theory T is -categorical for a cardinal if and only if T is consistent and for all T -models A, B such that
|A|, |B| = , A B. As usual, a model A is -categorical just in case Th(A) is.
A theory is said to be categorical simpliciter if and only if it is categorical for every cardinal the next result, however,
shows this to be a rather uninteresting result:
Theorem 7.11 (All Categorical Theories are Th(A) for a Finite Model A).
If T is a categorical theory, then T = Th(A) for some finite model A
Restricting to -categorical for some cardinal , however, proves much more nuanced. We already have two ready
examples of 0 -categorical models/theories:
Example 7.
For a, b dom(A), a and b are automorphic (notated a b) if and only if there is an automophism : A A such
that a = b. Putting together many of the results of this chapter,
(e) Given variables x, there are only finitely many pairwise non-equivalent formulas ( x) modulo T
(f) For every n < 0 , every type p S n (T ) is principal over T
(g) T is 0 -categorical
7.5. CATEGORICITY 47
Proof Sketch.
(a) (b) is trivial.
(b) (c) since automorphic tuples realize the same type.
(c) (d), fix n. A realizes k n-types. Suppose that |S n (T )| > K. Let p0 , . . . , pk be types in S n (T ). There
is a formula 0j ( x) such that 0j ( x) p j and 0j ( x) < pi , i , j, setting j = 0j i, j 0i p j . Not that
j,i p j pi ki=1 j,i p j pi . For every i, since pi S n (T ) and i pi , T ` x i ( x), and thus
A |= x i ( x) with some ai . Show that each ai realizes a different type for each ia contradiction.
(d) (e), we know there are k types in S n (T ), p1 ( x), . . . , pk ( x). Show that if ( x) and ( x) are such
that i = 1, . . . , k pi pi , then T ` x(( x) ( x)), possibly by contradiction. Argue
combinatorially.
(e) ( f ), let 1 ( x), . . . , k ( x) be a maximal set of inequivalent formulas. Given p( x) S n (T ), take
( x) := ik,i p i ( x) p( x). If ( x) p( x), then for some i, T ` i and i p, so ( x) ( x).
( f ) (g), every model of T is atomic.
(g) ( f ), every type that is not principle can be omitted (be careful with countability), so ( f ) (g)
since we can have one model realizing the non-principle type and another which does notmeaning they
arent isomorphic.
( f ) (d), Suppose S n (T ) is infinite and every type is principle; let p1 ( x), . . . be a listing of the types in
S n (t). Each pi ( x) has a generator i ( x). Note that the set of the negations of these generatorssay ( x)is
finitely satisfiable (each generator must be false in every type it doesnt generate; every type is realized
in some model). Then it can be extended to a complete typebut our list was supposed to contain all the
types!
(d) (a), Note thatby (f)every model of T is atomic. Moreover, if A is atomic, if a, b from A and
tpA (a) = tpA (b), then cdtpA (ac) = tpA (bd). Using thisas earlierwe can construct an automorphism
f : A A with f (a) = b. Noting that there are finitely many types, there must then by finitely many
equivalent classes.
An interesting corollary of the theorem above is that 0 -categorical models are locally finite:
Locally Finite
A model A is locally finite if and only if every finitely-generated submodel of A is itself finite.
Proof Sketch.
Suppose that a1 , . . . , an M generate an infinite model; if b1 , b2 ha1 , . . . , an i and b1 , b2 ,
tp(a1 , . . . , an , b1 ) , tp(a1 , . . . , an , b3 ) because b1 = t1 (a1 , . . . , an ) ....
A deeper result dealing in categoricityand beyond the scope of these notesis Morleys categoricity theorem;
while no proof is even attempted here, the statement is given and some of the basic notions of the theorem are developed
in the next subsections.
Fair warning, the section on Frass limits below has never been edited while the section on ultrafilters is woefully
incomplete.
Age
Let D be an L-model. The age of D is the class, up to isomorphism, of all finitely generated submodels of D.
Thus, a class of models is an age if and only if it is the age of some model D.
Its easy to show that an age has the following two properties: joint embedding and heredity.
Hereditary Property
A class of models K has the hereditary property (HP) if and only if for every A K and every finitely-generated
submodel B of A, an isomorphic copy of B is in K; in other words, K is closed under the operation of taking
finitely-generated submodels.
Of course, Frass proved more than just that ages have the joint embedding and hereditary properties; ages are
completely characterized by them:
49
50 CHAPTER 8. OTHER MODEL-THEORETIC CONSTRUCTIONS
Proof Sketch.
The forward direction is easy, so the reverse is sketched here. Our strategy is simply to take an arbitrary
class of finitely generated models with JEP and HP, then construct the model D for which it is an age.
Assume = {A0 , A1 , . . . } is an arbitrary class of finitely-generated models with HP and JEP. We build a
chain of models as follows:
Let B0 = A0 . In general, let Bn+1 be the model in such that both Bn and An+1 embed in it (JEP). Let
D = i< Bi . If A , then A embeds in D. If C is a finitely generated submodel of D, then all of the
S
generators must be present in some Bn , and thus C Bn . It follows that C and C D.
Alongside the joint embedding and hereditary properties, we single out one last property which will ultimately
prove necessary for our main result:
Amalgamation Property
A class of models K has the amalgamation property if and only if for every A, B, C K such that : A C,
: A B are embeddings, there is a D K and embeddings : B D, : C D such that =
D
B C
A
Although it is tempting to think so, the amalgamation property (AP) doesnt subsume the joint embedding property
as a special case. In addition to these three properties of classes of models, another property of particular models will
also prove useful:
Ultrahomogeneous
A model D is ultrahomogeneous if and only if whenever A and B are finitely-generated submodels of D, every
isomorphism A B extends to an automorphism of D.
The primary result of this section is the construction of a particular modelthe Frass limitfrom a class of count-
able models K which is itself countable; this particular construction is important because the Frass limit is not only
unique up to isomorphism (i.e. categorical), but also possess two other important properties:
Weakly Homogeneous
A model D is weakly homogeneous if and only if whenever A and B are finitely-generated submodels of D such
that A B and : A D is an embedding, then there is an embedding : B D extending .
Proof Sketch.
Build a sequence of models B0 B1 . . . all in such that for every A , there is a j where A
embeds in B j and for every C A , every i , and embedding g : C Bi , there is a j and an
embedding g0 : A B j . If we can establish this, then the conclusion follows easily. Note that N4 N,
let (i, j, k0 , k1 ) 7 hi, j, k0 , k1 i. At stage s we will build Bs+1 . If s = 2e, let Bs+1 be obtained using JEP with
Bs and Ae . If s is odd, its of the form 2hi, j, k0 , k1 i + 1. Suppose i, j, k0 , k1 < s. Let C be the jth finitely
generated submodel of Bi . Let f 0 be the k0 th finitely generated submodel of Ak1 . If C C 0 , let Bs+1 be
obtained by using AP on C, Ak+1 , and Bs . If C C 0 , do nothing.
Lemma 8.1.
Let C, D be L-models. If the age(C) age(D) and D is weakly homogeneous, then C embeds in D.
Proof Sketch.
Consider increasingly large elements of age(C) and note that, while each element may be disjoint inside of
D, we can construct an extended embedding around the smaller element which is isomorphic to the larger.
Building successively, we obtain an embedding of C.
Proof Sketch.
Let A, B be arbitrary submodels such that A B. Let A1 be the model generated by A0 and c0 an arbitrary
element of C. Since the ages of C and D are identical, there is a A01 in D such that A1 A01 . Note, however,
that A0 embeds in A01 , and soby weakly homogeneousthere is a B1 such that B1 A01 and B B1 . Let B2
be the model generated by B2 and d0 an arbitrary element of D. Repeat the argument above, going from D
to C.
52 CHAPTER 8. OTHER MODEL-THEORETIC CONSTRUCTIONS
Simply use the lemma above with the same model for C and D.
Proof Sketch.
Note that if tp(a1 , . . . , an ) = tp(b1 , . . . , bn ), then ha1 , . . . , an i hb1 , . . . , bn i (think about what happens if
theyre not). Since S n (T ) is finite (say size m), there are at most m distinct submodels generated from n
elementsso set f (n) = m for all n.
Lemma 8.3.
Let a, b be tuples of A. Then the following are equivalent:
(a) hai hbi
Proof Sketch.
(a) (b); Take an atomic formula, say R(t1 (a), t2 (a), . . . , tn (a)). Show that M |= R(t1 (a), t2 (a), . . . , tn (a))
if and only if M |= R(t1 (b), t2 (b), . . . , tn (b)), and so all quantifier-free formulas have the same truth values.
(b) (a); We want to produce f . Take t(a), define f (t(a)) = t(b). Consider atomic formulas and
show that f is well-defined, surjective, and injective.
Lemma 8.4.
If hai is finite, then there is a q.f. formula ( x) qf-tp(a) such that M |= x(( x) qf-tp(a)) (i.e. if b M and
M |= (b), then hai hbi).
8.1. FRASSS CONSTRUCTION 53
Proof Sketch.
Let hai = {t1 (a), . . . , tk (a)}. Let ( x) be the conjunctions of all formulas f (ti1 (a), . . . , til (a)) = t s (a) for all
of the l-ary function symbols f and all tuples (i1 , . . . , il ) kl . Repeat this for both R and R as well.
Homogeneous
A model M is homogeneous if and only if a, b if tp M (a) = tp M (b) a b.
Theorem 8.3.
If M is atomic, then M is homogeneous
Proof Sketch.
Assume tp(a) = tp(b). By atomic, there is ( x, y) generating tp(ac). M |= y(a, y), and so M |= y(b, y).
Let d be such that M |= (bd). Note that (bd) now generates the whole type.
Corollary.
If M is 0 -categorical, then M is homogeneous
Theorem 8.4.
Let L be finite. Let be a class of finitely generated models which is uniformly locally finite and has the HP, the JEP,
and the AP. Let M be the Frass limit of , T = Th(M). Then,
(a) T is 0 -categorical
(b) T has quantifier elimination
54 CHAPTER 8. OTHER MODEL-THEORETIC CONSTRUCTIONS
Proof Sketch.
(a) Note that the Frass limit, M is countable, ultrahomogeneous, and age(M) = . We want to define a
theory T such that the countable models of T are exactly the Frass limit of (because then M |= T and
T Th(M) and if A |= Th(M) A |= T A M).
To say that age(M) , for each A in generated by a A (look at the q f -formula generated above for
model with finite generators). For each hai in , there is a q f formula hai ( x) such that B |= (b) hbi
hai. Let T 0 = { xhai ( x) : hai }.
We want T 1 such that if A |= T 1 , then age(A) . The number of models in with n generators is
|ha1 , . . . , an i| f (n). The number of models in with n generators is less than the total number of L-
models with f (n) elements which is finite. Let B1 , . . . , Bl be all the models in with n generators. For
each Bi , there is a formula Bi ( x) such that M |= Bi ( x) h xi Bi . Let T 1 = { xli=1 Bi ( x) : for n }.
To get ultrahomogeneous, use M is ultrahomogeneous if and only if a, b if q f tp(a) = q f tp(b) cd
such that q f tp(a, c) = q f tp(b, d) and note that all the q f tps are principal. That is q f tp(a) = q f tp(b)
if and only if li=1 Bi (a) Bi (b)
Example 8.
Let be the class of finite graphs with no function symbols. The Frass limit is a graph that has the
property if X, Y R are finite, there is v R such that w X, (v, w) E and w Y, (v, w) < E.
8.2. ULTRAPRODUCTS 55
8.2 Ultraproducts
Ultraproductsalthough considerably more complicated than the procedures given thus farare simply yet another
means of generating new models from existing ones while preserving certain first-order characteristics. The method
originates with Skolem in the 1930s and has been used extensively since os work in 1955. The fundamental
theorem of ultraproducts simply establishes the connection between ultraproducts and first-order characteristics; before
proceeding to it, however, its necessary to establish the notion of an ultrafilter.
8.2.1 Ultrafilters
Recall that P(I) gives the powerset of Ithat is, the set of all subsets of I.
Filter
A filter D over a non-empty set I is a set D P(I) such that
(i) I D
(ii) X, Y D X Y D
(iii) X D and X Y I Y D
That is, a filter D over I contains I, is closed under intersection, and is closed upwards under the subset relation.
Common examples of filters over a set I are:
For each Y I, the filter D = {X I : Y X}; called the principal filter generated by Y
Proper Filter
A filter D over a set I is a proper filter if and only if it is not the improper filter
Generated Filter
Given a non-empty set I and any subset X of P(I), the filter generated by X is the filter D given by
Proposition 8.2.
Let X be any subset of P(I) and let DX be the filter generated by X. Then,
(i) DX is a filter over I
(ii) D is the set of all Y P(I) such that either Y = I or for some Z1 , . . . , Zn X, Z1 Zn Y
(iii) D is a proper filter if and only if X has the finite intersection property
56 CHAPTER 8. OTHER MODEL-THEORETIC CONSTRUCTIONS
Ultrafilter
A filter D over a non-empty set I is an ultrafilter over I if and only if for all X P(I),
X DIX <D
Its common to simply say that D is an ultrafilter and assume that I = D.
Proof Sketch.
Simply leverage the definition of ultrafilter
Put slightly differently, an f -indiscernible set is simply a set of elements from the original X all of whose <-increasing
k-tuples reside in the same partition, all of whose <-increasing k-tuples have received the same color. Although it is
far from obvious, the situation described above and combinatorial results guaranteeing the existence of f -indiscernible
subsets of certain sizes paves the way to many model theoretic results.
To introduce a bit more notation, suppose that it can be shown in the situation above that there exists an f -
indiscernible set Y of size . Since all of the above were arbitrary, this holds irrespective of the actual sets/colors
involved and merely in virtue of their relative sizes. Accordingly, let |X| = , |Y| = , and |B| = . Then,
()k
That is, for any linearly order set X of size and set of colors, a coloring of [(X, <)]k is guaranteed to have a set Y
of size at least such that all of [(Y, <)]k has the same color. This notation was chosen by Erdos and Rado so that if
a result of the form above holds, then it remains true for any increase of on the left or decrease of , , or k on the
right. Both the earliest and most important result for our purposes is:
The key move in adapting these results to model theory is to think of our partitions, our colors, as determined by
whether or not the given <-increasing k-tuple satisfies a particular combination of formulas.
-Indiscernible Sequence
Let A be an L-structure and a set of L-formulas all with k free variables. Suppose Y dom(A) and < is a linear
ordering on Y. Y is a -indiscernible sequence in A if and only if for every formula ( x) and every pair of
<-increasing k tuples a, b,
A
[a] [b]
Again, the above has little intuitive force in isolation, and so we tip our hand a bit. Given an ordering < on dom(A),
note that for any tuple a [(dom(A), <)]k , either A
[a] or A
[a] for each . Form a partition, then, on
the tuples in [(dom(A), <)]k so that tuples in the same part all behave the same with respect to every . Each part
in this partition is -indiscernible. In truth, -indiscernability isnt of considerable interest. It is, however, a stepping
stone to something which is:
58 CHAPTER 8. OTHER MODEL-THEORETIC CONSTRUCTIONS
Indiscernible Sequence
Let A be an L-structure, Y dom(A), and < a linear ordering on Y. (Y, <) is an indiscernible sequence in A if and
only if (Y, <) is simultaneously a {}-indiscernible sequence for every in L.
Proof.
Unsurprisingly, by compactness and Ramseys theorem. Expand L to include constants ci for every i I;
take
Any model of T 0 is as requested, and so it remains only to show finite satisfiability. Consider any finite
subset of T 0 ; we may assume with no loss that it contains T and sentences dictating that all the mentioned
constants are not equal. All that remains are a finite number of formulas from
It follows that there is a maximum arity that occurs among the finite number of , say k. Take a countable
model A of T and consider [(dom(A), <)]k where < is an arbitrary linear ordering on dom(A). Let
denote the union of the and note that we may think of each of the as taking k free variables. Thus,By
Ramseys theorem, there exists a countably infinite -indiscernible sequence Y dom(A). Assign the
constants mentioned so that the ordering <I is preserved by the chosen ordering on Athe result is a model
of the finite subset of T 0 .
Computability Theory
61
Chapter 9
Of Algorithms
Computability theory is, at base, simply an attempt to characterize the intuitive notion of an algorithm, a finite set of
simple, step-by-step directions. Although both computability theory and the notion of algorithm existed long before
computers, both areto the non-logicianmost manifest in these little boxes of metal and plastic. As any computer
scientist will happily tell you, algorithms are wonderfully powerful thingsespecially when they can be executed at
incredibly high speeds. Everything from watching a tv show, to playing a video game, to opening a word document is
governed by step-by-step algorithms encoded into your computer; incredibly complex algorithms to be sure, but still
the step-by-step sets of instructions that constitute our subject matter.
A natural first step towards formally characterizing the intuitive notion of algorithm is to generate a set of intuitive
conditions on how algorithms behave; the list below will serve as our basis:
Algorithm
In this last and only this last case, an algorithm A is said to output o O on input i I and we write fA (i) = o
to convey this fact. Generating a partial function fA in this manner gives the function computed by A, abbreviated of
course to simply fA . The function fA is partial because nothing requires the algorithm A to have an output for every
i I; thus, dom( fA ) I. As a last bit of symbolism, stating that g is a partial function taking a tuple of k inputs
from a set X and possibly returning an output from a set Y will often be abbreviated, following standard mathematical
practice, as g : X k 7 Y. Future references to g will often make the arity of the function explicit as a superscript, e.g.
gk .
To push forward and fully formalize the notion of algorithm, its necessary to specify a programming language,
a means with which to specify algorithms. There are a number of natural choices, andhistoricallyearly research in
computability theory gave rise to a myriad of suggestions. Luckily, they are all equivalent.
63
64 CHAPTER 9. OF ALGORITHMS
Each cell has some a A written in it (the from earlier can be interpreted as a blank)
A finite set of quintuples of the form: hq, a, b, R, si or hq, a, b, L, si where q, s S , a, b A, and L/R specify
movement right or left of the read/write head. Interpreted, in state q, reading a replace it with b and move one
square right/left, going into state s.
fA (i) = o
The standard function notation of mathematics, it turns out, carries over rather effortlessly into talk of algorithms, and
so we leverage this conceptualization as the basis of our analysis. Notice, moreover, that while the function fA doesnt
uniquely specify the algorithm Amaybe a different set of instructions produces all the same input-output pairs!it
does capture the fact that an algorithm exists which produces such and such output on such and such input. Some
thought shows, however, that this is all we truly care about; whether there exists any particular set of instructions with
such and such behavior is the intriguing questionand easily captured within our function notation. The fact that we
give up the ability to distinguish between particular sets of instructions doing the same thing is a very small loss in
return for a great deal of simplicity.
Thinking of algorithms as functions naturally leads to the question of what sets the functions map things between;
for complete generality, we might wish to always have the ability to specify an input set I and an output set O such
that the algorithm f under consideration is a partial map of the form I 7 O. This is certainly doable, but a better
alternative is readily available. Rather than allow wildly shifting input and output sets, assume that every function
maps from Nk to N for some k; at first glance, this might seem to unnaturally limit the algorithms we may consider. As
well soon see, however, the limitations imposed by this assumption are actually remarkably weak; simple encoding
schemes will allow us to effortlessly represent any algorithm with both a countable input and output set.
Having solidified the interpretation of algorithms as functions from Nk to N, the task is to lay out those function
operations which allow us to construct all and only the computable functions. This task will carry us through the next
three chapters, and centers on the notion of recursion. Recursion is, at base, simply defining something in terms of its
previous values (often called an inductive definition in mathematics)a process anyone with programming experience
knows well. As an intuitively computable operation, recursionalongside function composition and several simple
starting functionswill give us a first attempt at defining the computable functions, yielding a class known as the
primitive recursive functions.
Chapter 10
To sum the previous discussion, the primitive recursive functions are an attempt to generate formally a class of func-
tions which corresponds exactly to the intuitively computable functions; in order to do so, we select three types of
functions as obviously computable: the constant functions, the successor function, and the projection functions. Next,
we select two means of combining these functions into more complicated functions which are also intuitively com-
putable: function composition and primitive recursion.
i. Function Composition: Let f be a k-ary function and g1 , . . . , gk l-ary functions all in C, then the l-ary
function h(y) = f (g1 (y), . . . , gk (y)) is in C as well.
ii. Primitive Recursion: If h is a k-ary function, g a k + 2-ary function, and both are in C, then
f (0, x) = h( x)
Primitive Recursive
A function f : Nk N is primitive recursive if and only if there exists a primitive recursive derivation g1 , . . . , gk
with f = gk where each gi is either one of the basic primitive recursive functions or obtained from a previous g j
( j < i) by composition or primitive recursion.
65
66 CHAPTER 10. THE PRIMITIVE AND -RECURSIVE FUNCTIONS
(
0 if x R
R ( x) = 1 if x < R
Lemma 10.1. Y X
If h(z, x) is a primitive recursive function, then g1 (z, y) = h(z, x) and g2 (z, y) = h(z, x) are primitive recursive
x<y x<y
functions as well.
Proof Sketch.
Induction
Proof Sketch.
Composition; let P be a k-ary primitive recursive relation and f1 , . . . , fk primitive recursive functions.
Define,
Q(y) if and only if P( f1 (y), . . . , fk (y))
Note that Q (y) = P ( f1 (y), . . . , fk (y)), and thus is primitive recursive.
Bounded Quantifiers; Let R be a primitive recursive relation and define
x<y
10.2. PRIMITIVE RECURSIVE OPERATIONS 67
where each hi is a primitive recursive function, each Ri is a primitive recursive relation, and the Ri are both mutually
exclusive and exhaustive, then f is itself primitive recursive.
Proof.
Define f by
R1 ( x)) + h2 ( x)sg(
f ( x) = h1 ( x)sg( R2 ( x)) + + hk ( x)sg(
Rk ( x))
and note that this is primitive recursive.
Proof.
f ( x, 0) = 0
(
f ( x, n) if R( f ( x, n), x)
f ( x, n + 1) =
n+1 otherwise
pq = 0
fk (a0 , . . . , ak ) = pha0 , . . . , ak iq
is a primitive recursive function. More generally, the following primitive recursive predicates and functions are con-
structed to make working with encoded sequences easy.
S eq(n) if and only if n is the encoding of a sequence
Proposition 10.1.
Proof.
S eq(n) if and only if n = 0 p n[(Prime(p) Div(n, p)) (q < p[Prime(q) Div(q, n)])
prime(x)Div(n,x) (x)) if S eq(n)
X
sg(
lh(n) =
xn
0
otherwise
Finally, taking f (n) to give the nth prime:
Proof.
Let f, g, h1 , and h2 be as above. Let f2 be the length 2 encoding function and define as follows:
(0) = f2 (m0 , n0 )
f (n) = ((n))0
g(n) = ((n))1
Both of which are, of course, primitive recursive constructions. By the arbitrary nature of f, g, h1 and h2
the theorem is established.
Proof.
Let p(n) be the primitive recursive function which produces the nth prime. Define
f (n) = ((n))n
It follows by the arbitrary nature of f and Q that course of values recursion is primitive recursive, and thus
the theorem is proved.
Proof Sketch.
Utilizing the encoding scheme already defined, we may encode derivations of primitive recursive func-
tions; more importantly, however, we may encode them in such a way that it can be checkedprimitive
recursivelywhether or not a given number e encodes a valid derivation of a primitive recursive function.
Note that, by simply proceeding through the natural numbers, we now have a primitive recursive enumer-
ation within which every last primitive recursive functions encoding must appear. We may go further,
however. Once a derivation is fixed, its certainly algorithmically possible to run the derived function on
any appropriate input; that is, we should be able to construct a computable function such that
But, with such a computable in hand, we may channel the spirit of Cantor and define : N 7 N by
(x) = (x, x) + 1
A moments thought shows that not only should be computable, but it also couldnt have been listed
in our enumeration! It follows immediately that there must exist computable functions which are not
primitive recursive, as asserted.
y[R( x, y)]
gives the least number y such that, for a given x, R( x, y) holds; if no such y exists, the expression is undetermined.
That is, an expression containing the -operator may be undefined because a satisfying y cannot be found. The
definition of -recursion, however, hints at a more subtle difficulty:
-Recursion
Given (x1 , . . . , xk , y) a k + 1-ary function, the result of -recursion on with respect to y is the k-ary function,
Note the shift from talk of relations in the definition of the -operator to discussion of a partial function in the definition
of -recursion. -recursion is not restricted to primitive recursive or even -recursive relations; rather, we allow -
recursion over any -recursive functioneven though many are partial. This allowance is necessary to achieve the
class of functions we desire, but requires users of -recursion to keep the (z y)[( x, z) ] clause in mind; while
there might be a least y satisfying ( x, y), -recursion on will be undefined if ( x, z) isnt defined for all lesser natural
numbers.
Despite this, its common to omit the (z y)[( x, z) ] clause for -recursion on primitive recursive or -
recursive relations; since these are defined everywhere, the result always behaves as expected. Although it is tempting
to do so, the same liberties must not be taken with relations that are not -recursive; indeed, doing so represents an
unwarranted expansion of the class of functions under consideration.
10.5. THE COMPUTABLE FUNCTIONS 71
-Recursive
A function f is called -recursive if and only if f can be built using primitive recursive functions by means of
composition, primitive recursion, and -recursion.
With this last addition, its possible to prove thatfor any Turing machine Mthe k-ary function f Mk is -recursive;
in other words, the -recursive functions are the computable functions, assuming the Church-Turing thesis.
Proof Sketch.
()
.
Fix a Turing machine t and the partial function it defines ft .. N N. Define a primitive recursive predicate
T.Compt (p(x, y, L)q) where C = p(x, y, L)q with L coding a terminating Turing computation with machine
t, input x, and output y:
Informally, a terminating Turing comp with (t, x, y) is a sequence L of machine status descriptions. A status
description is a sequence l = p(T, h, i)q where T specifies the tape (a finite sequence of 0s and 1s) , h the
head (< lh(T )), and i (in {0, 1, . . . , s} if machine has states 0 through s) the Turing machines state. Thus,
L = p(p(T, h0 , i0 )q, . . . , p(T, hm , im )q)q and initial case i0 = 1, T 0 = (0, 1, 1, . . . , 1)(x many 1s), and h0 = 0
and end case ik = 0 (termination state), T k = (. . . , 1, 1, 1, . . . , 1, 0 . . . ) (y many 1s) ik case i < lh(L) and
how (T i+1 , hi+1 , ii+1 ) relates to (T 0 , hi , ii )
Computable Functions
The class of functions which can be computed by algorithm; according to the Church-Turing thesis, the term is
synonymous with the -recursive/Turing-computable functions.
Its important to note that a member of the computable functions may be partial; to distinguish the set of total
functions within the set of computable functions, we define
Total Computable Functions
The subset of the computable functions containing all and only the total functions; that is, functions defined for
every n N; also known as the total recursive or recursive functions
Computable Relation
A relation R is computable if and only if the characteristic function of the relation, R ( x), is a total computable
function; also called recursive.
As a special case, note that for k = 1 we have also given a definition for what it means to say that a set A N is
computable.
Chapter 11
As the section on primitive recursion argued, the computable functions should, intuitively, be capable of enumerating
themselvesproving that they actually can will occupy most of our time in this section. The constructions of this
section not only give a host of basic results about the computable functions, but also will showcase a number of
recurring tools in computability theory.
Proof Sketch.
Simply add extraneous steps to the derivation encoded by e
73
74 CHAPTER 11. THE FUNDAMENTAL THEOREMS OF COMPUTABILITY THEORY
As well see, it is Kleenes T predicate and the -recursion which do the work above; the primitive recursive U does
little more that output the appropriate value.
Proof Sketch.
The key to defining Kleenes T predicate is the notion of a computation; given a derivation f0 , . . . , fn , a
computation that fn ( x) = y is a list of statements of the form:
fi0 ( x0 ) = y0
fi1 ( x1 ) = y1
..
.
fn ( x) = y
where each line fia ( xa ) = ya is justified by the derivation of fn and earlier equations on the computation
list. Less formally, a computation should be thought of as a proof that fn ( x) = y for some x, y. Just as with
derivations, constructing a primitive recursive predicate to test for valid computations is little more than
picking a representation scheme and working through the details. Once this is accomplished, Kleenes T
predicate is defined as
Proof Sketch.
For any k, simply define
k (e, x) = U(C[T (e, x, C)])
using the k-ary Kleene T -predicate.
Note that in k (e, x) = U(C[T (e, x, C)]) above U and T are primitive recursive relations and appears just
once. It follows immediately that only a single application of the -operator in a derivation suffices to produce any
-recursive function.
Corollary.
Let f : N N be computable, total, and a bijection. Then, f 1 is also computable
11.4. THE S NM THEOREM 75
Proof.
Define -recursive g : N N by, informally, fix e N such that f = 1e (by enum. thm., e exists).
Given y N, to compute g(y), search for some computation C and x N such that C says f (x) = y.
Then, output x. (that defines g(y) = x).
Formally,
C[x, L C[C = p(e , p(x)q, y, L)q Comp(C)]] 1
0
or, since we have total, x[ f (x) = y] (the first construction will skip non-total portions of functions, the
latter may stall).
11.4 The sm
n Theorem
Also known as the parameter or parameterization theorem, the sm n theorem establishes, for any n and m, the existence
of a primitive recursive function smn whichgiven the encoding of an m+n-ary function f and m parameterspicks out
an encoding which represents f , but with its first m inputs fixed to the given parameters. At first blush, this seems like
a rather trivial task for such a vaunted theorem; implicit in this operation, however, is that the sm
n functions effectively
allow traversal through an infinite hierarchy of enumeration functions 1 , 2 , 3 , . . . which are based in the same
encoding scheme. The uses for such traversal are, as we will see, quite extensive.
Proof Sketch.
nS (a) ( x) = f (a, x)
Thus, given f (a, x), we may construct a list of parameterized functions fa ( x) = f (a, x) by varying the parameters a.
Using the theorem above, it follows that there is a primitive recursive function s that will allows us to convert each set
a of parameters into the index of the corresponding parameterized function.
Proof Sketch.
76 CHAPTER 11. THE FUNDAMENTAL THEOREMS OF COMPUTABILITY THEORY
0 k (e, x)
(
H (e, x) =
1 otherwise
As it turns out, this is provably impossible; that is, there exist functions which are demonstrably non-computable.
Proof Sketch.
Suppose H (e, x) solves the k-ary halting problem. Define a new k-ary function f as
0 H (x0 , x)= 1
(
f ( x) =
otherwise
and note that f is clearly computable; it follows that f = e f for some e f . Consider, then, f (e f , . . . , e f );
if H (e f , . . . , e f ) = 1, then f (e f , . . . , e f ) contra H (e f , . . . , e f ). If not H (e f , . . . , e f ) = 1, then
f (e f , . . . , e f ) contra H (e f , . . . , e f ) yet again. Thus, in either case, H (e f , . . . , e f ) answers incorrectly.
Corollary.
Let f be any computable function. Then, there is an inverse g for f such that g is computable, where an inverse is
taken to mean
g(y)= x if and only if y range( f ) y range( f )[ f (g(y)) = y]
Proof Sketch.
See the earlier proof for f total, bijective
ke ' kf (e)
Proof Sketch.
Suppose, for the moment, that we can computably construct a table of k-ary computable functions (by
specifying their indices in our K-ary enumeration) that has the following property:
(?) For every computable function f , there is a corresponding row r f such that for every n N,
(i) if f (n), the function f (n) is the nth entry in r f
(ii) if f (n) , the nth entry in r f is instead the function which halts on no input
Given such a table and the closure properties of the computable functions, its easy to see that the diagonal
of the table must itself appear as a row of the table. We thus have the following picture:
As it turns out, its easy to generate a table meeting the requirements above; the k = 1 case is given
explicitly, but the strategy easily generalizes. Consider
Taking x as our row variable and y as our column, the t above generates a table of the required type;
moreover, the process is computable. Define:
This s21 (et , x, y) allows, then, the effective generation of our table, while s21 (et , x, x) generates the diagonal
indices. We may carry out the remainder of the argument as above; since s21 (et , x, x) generates the diagonal
indices, m isby the construction of tthe index of f (s21 (et , x, x)), while the actual fixed point is given by
s21 (et , m, m).
By the definition of et ,
' 1 (1 (m, m), z)
and, finally, by the definition of m,
' 1 ( f (s21 (et , m, m)), z)
' 1f (s2 (et ,m,m)) (z)
1
An equivalent way of viewing the construction above is to notice that the fixed point e above, viewed as a function,
forces itself to be a fixed point relative to f by computing the function f (e ) and then mimicking this functions
behavior.
Corollary.
Let f be total computable; for every k there exists e such that
Wek = W kf(e)
Example 9.
As an alternative to our original proof of Rices Theorem, suppose I is a computable index set that is
non-trivial (, N, ). Get e using recursion theorem, define
b (y) if I(e)
(
f (e) (y) =
a (y) if I(e)
where a < I, b I. If I is computable, then there is a primitive recursive f such that (e) = f (e) .
11.6. KLEENES FIXED POINT THEOREM 79
Example 10.
Is there a partial recursive such that for every e, if We is finite, then (e) and |We | (e)? Define
if (e)
(
We =
{0, . . . , (e) + 1} if (e)
if (e)
(
W f (e) =
{0, . . . , (e) + 1} if (e)
(e) since otherwise We = is finite, and (e). But then, |We | = (e) + 1.
a = f (a,b)
b = g(a,b)
In other words, there are a, b such that Wa = {b} and Wb = {a}.
Proof.
Get a total recursive a = a(b) such that a(b) = f (a(b),b) for every b (by URT). Now, fix b such that
b = g(a(b),b)
by the ordinary recursion theorem. The pair (b, a(b)) then works.
Example 11.
Suppose is computable and A N, and for all e,
Claim: A is productive. The productive function is f where f (y) =? There is a total computable e(y) such
that for every y,
(ey )
(
Wey =
Wy (ey )
(z)
(
W sy (z) =
Wy (z)
where s(y, z) is total computable by S nm . Our productive function is f (y) = (ey ).
80 CHAPTER 11. THE FUNDAMENTAL THEOREMS OF COMPUTABILITY THEORY
Chapter 12
Non-Computable Relations
Two chapters ago, the definition of a computable relation was given; it has been reproduced below for ease of reference:
Computable Relation
A relation R is computable if and only if the characteristic function of the relation, R ( x), is a total computable
function; also called recursive or decidable.
Computable relations, then, have a computable decision procedure for whether any particular input is or is not in the
set; rather appropriately, computable relations are also known as decidable, a computable function can always decide
whether a given input is or is not in the set. This is, undoubtedly, a nice property for a relation to have, but its far from
necessary. Indeed, its natural to immediately distinguish to other classes of relations:
Computably Verifiable
A relation R is computably verifiable if and only if it has a computable verifying function; also known as
semi-recursive or semi-decidable
Computably Refutable
A relation R is computably refutable if and only if it has a computable refuting function; also known as
co-semi-recursive
Similarly, a refuting function (also known as a negative characteristic function) for a relation R is
(
otherwise
R ( x) = 1 if R( x)
We = domain(1e )
Ee = range(1e )
and offer a warning. Up until this point, all relations have been well-behaved, i.e. computable, and so its easy to give
little thought to familiar operations on them. From here forward, however, the relations under consideration need not
be well-behaved, and thusfor example-recursion is no longer always admissible on whichever relation currently
holds our interest. Failure to internalize this expansion in our topic matter is liable to lead to large errors, and it
may serve the reader well to refamiliarize themselves with the limitations of the operations covered thus far vis-a-vis
relations.
81
82 CHAPTER 12. NON-COMPUTABLE RELATIONS
Proof Sketch.
By definition, a computable relation R has a computable verifying function e but then R simply is We
Alternatively, a relation R is 01 if and only if both R and R are 01 /01 . This gives the basis of the arithmetical
hierarchy of relations; in picture form, we have:
01 01 01 01
00 /00 01
Formula Hierarchy Relation Hierarchy
12.1. A THREE-FOLD DISTINCTION 83
In both cases, the arrows denote set inclusion upwards. Our emphasis will, of course, be on the relation hierarchy; its
important, however, to keep the formula hierarchy in mind since it represents the basis for the former.
Some basic ways of classifying relationscomputably verifiable, computably refutable, 01 , 01 , 01 in hand, its
natural to ask how these classifications relate to one another and our previous work; uncovering and clarifying these
relationships is our next task.
Proof Sketch.
()
By induction; show that all of the starting primitive recursive functions are definable, then that composition
and primitive recursion are as well.
()
By induction on the complexity of ; note, in particular, that all of the operations we need to close under
have been shown to be primitive recursive.
Proof Sketch.
(i) (ii)
Suppose R is 01 ; that is, R is defined by y( x, y) for ( x, y) a 00 /00 formula. By the lemma above, ( x.y)
defines a primitive recursive relation Q. Thus, we may explicitly define a verifying function for R by:
(ii) (i)
Suppose R = dom(ke ) for some e, k. Then, by our earlier work, R( x) if and only if there is an encoded
computation C such that T k (e, x, C). Recall, however, that T k (e, x, C) is primitive recursive, and thus,
defined by a 00 /00 formula of LA , (y, x, w). Since e is fixed, it follows immediately from the preceding
that R is defined in N by w (e, x, w), a 01 -formula.
84 CHAPTER 12. NON-COMPUTABLE RELATIONS
Example 12.
Examples of 01 Relations:
(i) Every We under our enumeration scheme
(i) R is 01
(ii) R is computably refutable
Example 13.
Examples of 01 Relations
(i) Every N We under our enumeration scheme
(ii) Kleenes K-predicate; defined by: e K if and only if 1 (e, e)
2
The three-fold distinction present at the beginning of this section thus survives the initial characterization of rela-
tions via formulas; eventually, this simple picture will collapse in the face of an infinite hierarchy of relations, butfor
the momentwe may bask in the ability to sort relations into one of three intuitive classes.
Proof Sketch.
Leverage either the definition in terms of computable verifiability or definability by a certain class of
formula.
Moreover, as might be readily surmised from either the opposing natures of verifiability and refutability or existential
and universal quantification, Theorem 12.5 (01 and 01 Relations are Complements).
For any 1 relation R, the relation R is 01 .
0
Proof Sketch.
Simply leverage definitions
A bit of works shows that, while computable enumerability is a new consideration, the class of computably enu-
merable relations is one with which were already quite familiar.
Graph of a Function
For a k-ary partial function f : Ak 7 B, the graph of f is {h x, f ( x)i : x A and f ( x)}; that is, the k + 1-ary
relation relating elements of dom( f ) to their corresponding range element.
Proof Sketch.
()
Suppose f = ke for some k, e; then, the relation f ( x) = y has positive characteristic function
()
Assume graph( f ) is defined in N by a 01 -formula of LA , say w( x, y, w). Then, ( x, y, w) is a 00 /00 -
formula, and so defines a primitive recursive relation Q( x, y, w). Thus,
By inspection, f is computable.
86 CHAPTER 12. NON-COMPUTABLE RELATIONS
Proof Sketch.
Simply leverage the previous results and build f
Proof Sketch.
()
Construct an existential formula defining R using the Tarski T predicate
()
Define the computable f such that R = ran(F) by using recursion and the operator
We have, then, a plethora of names, but only three distinct classes of relations; since writing computably verifi-
able / 01 / computably enumerable invariably becomes unwieldy, we adopt the convention of referring to our classes
of relations by the names 01 , 01 , and 01 . The alternative characterizations in terms of verifiability, refutability, com-
putability, and enumerability should not, of course, be forgottenbut, one hopes, constant reminders are not particularly
necessary.
Proposition 12.1 (Infinite 01 Relations are the Ranges of Computable, Bijective Functions).
If A N is infinite and 01 , then there exists f : N A such that f is both bijective and total computable.
Remark. Note that any finite set is computable; thus, if f : N N is total computable and <-preserving, then
ran( f ) is computable.
12.3. REDUCTION AND COMPUTABLE SEPARABILITY 87
(ii) A0 B0 =
Proof.
Let A = range( f ), B = range(g). Let n A0 k(n = f (k) j < k(n , g( j)), n B0 k(n =
g(k) j k(n , f ( j)); that is, we simply look at which function gets to an element first.
In the context of the arithmetical hierarchy, the natural and immediate companion to the reduction property is the
separation property; since 01 has reduction, we may immediately show that the class of its complements01 has
separation:
Proof Sketch.
Apply the reduction theorem to A and B
Strangely enough, the primary utility of this section comes not from the positive results already given, but rather
the following negative result:
Proof Sketch.
We define two disjoint 01 sets, A and B, as follows:
e A 1 (e, e)= 0
e B 1 (e, e)= 1
Suppose that there is a computable C such that A C and C B = . Then, the following defines a
computable function, and so has some index e0 in our enumeration:
(
1 if x C
e0 (x) =
0 if x < C
Note that e0 A B. If e0 A, then 1 (e0 , e0 ) = 0, and so e0 < Ameaning A * C. Suppose, then, that
e0 B; then, 1 (e0 , e0 ) = 1, and so e0 Cmeaning e0 < B. It must be, then, that no such C exists; that
is, A and B are inseparable.
88 CHAPTER 12. NON-COMPUTABLE RELATIONS
Proof Sketch.
Taking A, B from the proof above, define
(
0 if x A
g(x) =
1 if x B
x C f (x)= 0
Fact. If a total computable function sends every member of K into a set A and every member of K into a set B,
e A f (e) B
Although implicit in much of work until now, its worth noting that only considering 1-ary relations doesnt result in
a loss of generalityencoding sequences is, after all, a primitive recursive operation. The notion of m-reducibility in
hand, its easy to see that it behaves as intended vis-a-vis computable relations:
Proof Sketch.
Simply note that A (x) = B ( f (x)) where f is the function which gives the reduction A m B
12.4. MANY-ONE REDUCIBILITY 89
Before moving forward, we introduce two additional pieces of terminology; although defined in terms of 01 ,
identical definitions hold for the terms 01 -hard and 01 -complete.
01 -Hard
A relation A N is 01 -hard if and only if for every 01 relation B N, B m A; i.e., A is at least as hard to
compute as any 01 relation.
01 -Complete
A relation A N is 01 -complete if and only if A is both 01 -hard and 01 .
Our original question, then, asks whether every relation in 01 01 / 01 01 is, in fact, 01 /01 -complete.
Our goal of completely characterizing the basic relation classes 01 , 01 , and 01 may then be readily rephrased into un-
derstanding the m-degrees of the relations in these classes; in particular, whether or not there are precisely three
m-degrees below the 01 /01 level.
Corollary (K is 01 -Complete).
Index Set
I N is an index set if and only if for every a, b,
a = b (a I b I)
90 CHAPTER 12. NON-COMPUTABLE RELATIONS
In other words, I is an index set if and only if, for some class F contained in the set of computable functions,
I = {a | a F}. There are two index sets that, one hopes, leap immediately to mind: and N. For obvious reasons,
we refer to these two as the trivial index sets.
Proof Sketch.
Suppose I is a non-trivial index set. Note that there exists a computable function f such that f (x) for all
noting the index sets are closed
inputs x; let e f denote the index of this function. Then, e f I or e f I;
under complementation, assume without loss of generality that e f I. By assumption, I , , so there
must exist some e0 Iand, by I an index set, e0 , e f . Consider the following function:
e f (y)
(
xK
g(x, y) =
otherwise
Note, in particular, that g is computable; it follows by the sm n theorem that there exists a computable
function s11 (x) such that
s11 (x) (y) = g(x, y)
Corollary.
then I is 0 -hard.
If I is an index set, I , 0, I , N, and is indexed in I, 1
Proof Sketch.
That is, any 01 , k-ary index set contains the index of a function just in case it also contains an index for a finite
function which extends into it.
12.4. MANY-ONE REDUCIBILITY 91
Productive
A relation B N is productive if and only if there is a total computable function f called the productive function
for Bsuch that for every e,
We B f (e) B We
Note that if B is productive, then B is not computably enumerable since, for any c.e. set We , there is always another
element f (e) in B, but not in We ; in other words, productive simply means effectively not computably enumerable
Completely Productive
B is completely productive if and only if there is a total computable f such that
Theorem 12.14.
B is productive if and only if B is completely productive.
Proof.
()
() tricky proof with recursion thm. or (a) K is completely productive f (e) = e K iff e < We (f witnesses
comp. prod.) (b) if K m B, then B comp. prod. (as for prod.)
Proof Sketch.
Set f (e) = e. Then need, if We K, e K and e < We . But, if e We , e Kbut then e < We . Thus,
e < We , and so e K.
Proof Sketch.
Note that, since B is productive, it has a productive function f ; note further that is a computably enumer-
able subset of B. Taking e to be its index, f (e ) is another element of Bmeaning { f (e )} is a c.e. subset
of B! In general, the pattern above can be computably cycled, giving an infinite, c.e. subset of B.
[Myhill 1955]
92 CHAPTER 12. NON-COMPUTABLE RELATIONS
Proof.
Let fm be the many-one reduction of B to C and fB the productive function for B. Define fC (x) =
fm ( fB (s11 (x))) where s11 is defined as follows. Set
Noticing that W s11 (e) = fm1 (We ) makes it easy to show that fC is, in fact, a productive function for C.
Creative
A relation A N is creative if and only if A is 01 and A is productive.
Fact. By definition, no creative relation can be computable; indeed, the name creative derives from this built
in non-decidability
Computably Isomorphic
Two relations A and B are computably isomorphic if and only if there exists a total computable, bijective function
from A to B
Being computably isomorphic means that two sets are isomorphic from the limited perspective of the computable
functions; that is, there is no difference between the computational difficulty of the two sets according to any of the
measures considered thus far.
[Myhill 1955]
Proof Sketch.
12.4. MANY-ONE REDUCIBILITY 93
Having explored the bottom of the arithmetical hierarchy, we now set our sights on the hierarchy as a whole; its
not clear, however, that the rest of the arithmetic hierarchy is worth studying. For the lowest level, we had a clear
motivation in the notion of computability; the higher levels, though, dont at first seem to admit anything similar. In
this section, not only is the remainder of the hierarchy generated, but natural problems are shown to reside at levels
beyond 01 and 01 ; in the next section, we take up the task of giving an intuitive framework for thinking about these
higher levels.
As before, call a first-order formula of LA , the language of first-order arithmetic, 00 or 00 if and only if it is logically
equivalent to some using only bounded quantification and the standard truth-functional connectives. Inductively,
This gives rise to an infinite hierarchy of LA formulas; as earlier, we again associate the relations on the natural
numbers defined by each of these formulas in N with the formulas themselves, generating a hierarchy of relations
over N. We also define,
95
96 CHAPTER 13. RELATIVE COMPUTATION AND TURING REDUCIBILITY
.. .. .. ..
. . . .
03 03 03 03
03
02 02 02 02
02
01 01 01 01
00 /00 01
00 /00 = P.R.
Formula Hierarchy Relation Hierarchy
i. U is itself in D
ii. Whenever a relation R is in D and k-ary, then there exists an e N such that
Proof Sketch.
Proceed by induction on n, noting that the n = 1 case has already been established. For the inductive step,
suppose the result holds for n and consider n + 1. Given k, let V be 0n -universal for k + 1-ary (note that
0n -universal immediately guarantees a 0n -universal, namely its negation); define:
U(e, x) yV(e, x, y)
Proof Sketch.
Suppose that a k + 1-ary 0n -universal U is 0n . Consider the k-ary relation R defined by:
R(e, n) U(e, e, n)
Note that the hierarchy theorem guarantees something weve tactily assumed thus far; the arithemtic hierarchy
defined above never collapses at some level. Going higher always introduces new, previously unseen relations.
Hard
A relation A N is D-hard for some class of relations D if and only if for every relation B N in D, B m A
K(e) e (e)
Tot(e) We = N
Und(e) We =
Inf(e) We is infinite
Fin(e) We is finite
Cof(e) We is cofinite
Onto(e) Ee = N
Rec(e) We is computable
Comp(e) We is Turing complete
Using our work with index sets, its easy to see that all of thesewith the exception of Kare non-computable. Classi-
fying these relations further, however, can become quite involved. Several of these results are presented below:
Proof Sketch.
For hardness, let B(n) be an arbitrary 02 -relation. B is, by definition, defined by mnR(x, m, n) for some
primitive recursive R. To reduce B to Tot, consider the total computable f which gives
Proof Sketch.
For hardness, let B(n) be an arbitrary 02 -relation. B is, by definition, defined by nmR(x, n, m) for some
primitive recursive R. To reduce B to Fin, consider the total computable f which gives
(
0 (m y)(n)R(e, m, n)
f (e) (y) =
otherwise
Proof Sketch.
In general, then, we have the following picture of the arithmetic hierarchy when all of the canonical problems have
been assigned to the appropriate level of the hierarchy:
.. ..
. .
(m Comp) 04 04 (m Comp)
04
(m Co f , Rec) 03 03 (m Co f )
03
(m Fin) 02 02 (m T ot, In f )
02
(m K) 01 01 (m K)
01
Primitive Recursive
Limit Computation
Tree
A tree on a set X is a set T nN n X where nN n X is the set of sequences of values ha0 , . . . , an1 i where each
ai X such that if t T k(t k T )
13.2. RELATIVE COMPUTABILITY 99
We let [T ] = { f : N X|k f k T }
Finitely Branching
T is finitely branching iff s T , {a|sh ai T } is finite.
n
{0, 1} Suppose T is computable, finitely branching. T has a 02 branch.).
S
Theorem 13.3 (Consider T nN
A
ll we really need is that we have a recursive function bounding the size of levels?
Proof.
Let f be the leftmost branch, i.e. f o = , f (n) =least i {0, 1} such that f n ` hii has infinitely
many extensions in T . Note that what follows the such that in the previous sentence is equivalent to
k > dom(t)s k {0, 1}[s T t S ]01 . Gives that f is 02 . f (k) = i iff s(seq(s) lh(s) = k + 1 j
lh(s) (somethingaboutthepreviousconditiononS j, (s)) (s)k = i).
Theorem 13.4.
There is a recursive tree T on {0, 1} such that T is infinite and T has no infinite recursive branch.
Proof.
Let A, B be c.e., A B = such that there is no recursive C with A C and C B = . T will be the
tree of all attempts to build C , for some C such that A C and C B = . Formally, let h : N A,
g : N B both surjective, recursive, total. Put s T iff s is a finite sequence of 0s and 1s and
n lh(s)[h(n) dom(s) s(h(n)) = 0 g(n) dom(s) s(g(n)) = 1]. T recursive and check that
[T ] = {C |A C C B = }.
To phrase this new vision slightly differentlyand draw on terminology we eventually adoptwe may imagine hav-
ing an oracle for a particular problem, call it Y. This oracle, like a rather peculiar character out of Greek mythology,
tells us instantly whether or not any item we present to it is a solution to Y; beyond this, however, our oracle is a
proverbial black box. We do not question how it operates, why it is so magnificently fast, etcetera; we simply know
that it is always correct and always gives its answer instantly. Oracle in hand, the problems that are reducible to Y are
simply classified as all of the problems we may solve with the use of our oracle for Y.
Returning to the formalism weve developed thus far, the analog to a problem is a k-ary relation on Nk ; an oracle
O for a k-ary relation R (the problem above) simply says yes or no for any k-tuple of N according to whether the
tuple is or is not in the relation. This characterization immediately gives rise to a number of possibilities for what, in
particular, we should take an oracle to be.
Its not hard to see that all of these are reducible to one another; in a by now stereotypical move, its not necessary
to consider k-ary relations at allsimply encode it into a 1-ary relation in the standard way; similarly, taking R is a
very natural choice, but restricting ourselves to a characteristic function still gives the same power as a total function,
so why bother? The oracle tape suggestion is out if only because of our function-based analysis up until now; the
choice, then, is between representing oracles as 1-ary relationsknown, more colloquially, as setsor total functions
from N to N. We take the latter tact, but build in aspects of the others:
Oracle
An oracle O is a any total function O : N N
In starting our study of computability, we selected a set of functions which were intuitively computable, and then
proceeded; now, we take those same functions and simply add one more: the oracle function, O. Just as we constructed
a notion of computable, this immediately gives rise to, for any oracle O, a notion of O-computability.
O-Computable
A partial k-ary function f : Nk 7 N is computable in an oracle O or O-computable if and only if there is a
derivation of f using the primitive recursive functions, O, and the operations of primitive recursion, composition,
and -recursion
Just as earlier, we must invoke the following with only suggestive evidence:
Notation. e (x) < s, sometimes written la Soare as {e}s (x) , indicates that e (x)
without recourse to any number greater than or equal to s; thus, s > 0, e, , x < s, and no
number used in the computation is greater than or equal to s
Proof Sketch.
Although quite strong in the abstract, each of the above follows easily from how weve defined oracle
computation
A formula of LA (O) is 00 (O)/00 (O) is logically equivalent to a formula using only the boolean
connectives and bounded quantification
A formula of LA (O) is 0n+1 (O) is logically equivalent to x where is 0n (O) and x is finite
A formula of LA (O) is 0n+1 (O) is logically equivalent to x where is 0n (O) and x is finite
As before, we associate the relation defined by a given formula in the standard model N with that formulas position
in the hierarchy, defining
This is, of course, equivalent to: a relation R is 0n (O) if and only if both R and R are 0n (O)-relations. We thus have:
.. .. .. ..
. . . .
03 (O)
02 (O)
The closure rules associated with the arithmetical hierarchy carry over verbatim to the relativized arithmetical
hierarchy:
i. The class of 01 (O) predicates is closed under , , substitution of total O-computable functions, substitution of
O-computable functions past the n = 1 level, and existential generalization
ii. The class of 01 (O) predicates is closed under , , substitution of total O-computable functions, substitution of
O-computable functions past the n = 1 level, and universal generalization
iii. The class of 01 (O) predicates are closed under , , , substitution of total O-computable functions, and substi-
tution of O-computable functions past the n = 1 level.
That is, A T B just in case A is decidable given an oracle for B; the following are easy to establish:
13.3. TURING REDUCIBILITY 103
(i) T is a preorder
(ii) m T
(iii) For any relation A, A T A
(iv) For any computable relation B, B T A for any relation A
Turing Degree
The equivalence classes generated by the relation of mutual Turing reducibility; that is, for any relation A N,
[A]T = {B N : B T A and A T B}
Two relations with the same Turing degree are, for obvious reasons, said to be Turing equivalent, often notated by
T .
Conventionally, Turing degrees are written in boldface, e.g. a, b, and c, while the class of all Turing degrees is
denoted D. The following should be reminiscent of the terminology used with many-one reducibility: for a class of
relations ,
As above, there is a unique Turing degree for the computable relations, 0, and so the unrelativized version of K is 0
This notation can clearly be extended and often is:
A(n+1) = (A(n) )0
Proof Sketch.
Use the relativized sm
n theorem and Myhills isomorphism theorem
(ii) A0 T A
(iii) B is 01 (A) if and only if B m A0
(iv) If A is 01 (B) and B T C, then A is 01 (C)
(v) B T A if and only if B0 m A0
(vi) If B T A, then B0 m A0
(vii) A is 01 (B) if and only if A is 01 ( B)
Proof Sketch.
pg. 53 Soare
Proof Sketch.
13.3. TURING REDUCIBILITY 105
Using the Turing jump operator, we define a parallel operation on Turing degrees:
Using our results above, the jump is well-defined on degrees, andby definition0 is the minimal degree. We thus have
an infinite hierarchy of degrees:
0 < 00 < 000 < 0000 < < 0(n) < . . .
In fact, this hierarchy is closely related to another, as the first few levels make clear:
0 = deg() = {A : A is computable}
00 = deg(0 ) = degT (K) = degT (K)
000 = deg(00 ) = degT (Fin) = degT (Tot)
0000 = deg(000 ) = degT (Cof) = degT (Rec)
00000 = deg(0000 ) = degT (Comp) = degT (Comp)
0(n)
..
.
000
0 -Jump
A(0 ) = {hn, ki|k A(n) }
This is the computable join of all A(n) for n N.
Fact. (0 ) m ThN
106 CHAPTER 13. RELATIVE COMPUTATION AND TURING REDUCIBILITY
Chapter 14
Much of our work thus far has been foundational, establishing the basic results and basic notions of computability
theory. In this chapter, we leverage the results and notions of the last few chapters in order to prove deep results about
the structure of both the Turing and m-degrees.
Immune
B N is immune if and only if both B is infinite and B has no infinite 01 -subset.
Proof Sketch.
Priority?
Simple
A set A N is simple if and only if A is 01 and A is immune.
107
108 CHAPTER 14. POSTS PROBLEM & ORACLE CONSTRUCTIONS
Proof.
Let f be total recursive, 1-1, and ran( f ) is not recursive. Put D(n) k > n( f (k) < f (n)) (deficiency
stages in the enumeration of ran( f )). Claim: D is simple Proof: D is 01 ; D is infinite: if D has one element,
it has infinitely many since there must always be a next smallest and, by definition, this is also in D. Note
further that there must always be at least one element in D since there must be a smallest element (which,
by definition, is in D).
D is Immune: if not, let B D be infinite r.e. Then, m < ran( f ) iff n(n B m < f (n) k < n( f (k) ,
m)). Note that the right side is 01 .
i. A is infinite
ii. Whenever g is total recursive N N, and Fg(m) Fg(n) = for all m , n, then n Fg(n) A = .
Hyper-Simple
A set A is hyper-simple if and only if A is 01 and A is hyper-immune.
Incomparable
Two degrees a, b are incomparable, notated a|b, if and only if a b and b a
Proposition 14.2.
If a function f is total, extends every partial function in some A-computable sequence, and is completely determined
by the sequence, then f is A-computable.
Proof Sketch.
To determine any value, simply search the sequence until a partial function which defines it is found (such
a partial function must exist).
Proof Sketch.
Our proof method is the first in a set of standard proof techniques in computability theory: a finite exten-
sion, oracle construction. Put simply, we build step-by-step the characteristic functions for two relations
A and B, intending to let a,b be the Turing degrees of A, B respectively. Crucially, however, we build A
and B in both a 00 -computable manner (guaranteeing a,b T 00 ) and so that
(i) No B-computable function computes A
where BinS eq is true if and only if the input is a number encoding a sequence of 0s and 1s. Put less
formally, we ask whether there is an acceptable extension of 2e+1
B such that
Case 1: If the sentence is true, there must be a least pair h0 , t0 i witnessing this; set 2e+1+1
B = 0 and
define 2e+1+1
A as 2e+1
A , but with the addition of
t0 (lh(As )) > 0
( 0
A (lh(A )) = 01 ifotherwise
2e+1+1 2e+1
2e+1+1
A (lh(2e+1
A )) = 0
2e+1+1
B (lh(2e+1
B )) = 0
Reversing the treatment of A and B in the even case gives a complete procedure of the type required.
Proof Sketch.
Generalize the proof above
Proof Sketch.
Another finite extension, oracle construction; the two requirements necessary correspond to avoiding the
lower cone of degrees with reduce to b and the upper cone of degrees which reduce b.
c T d c0 T d0
So, 00 T d0 for all d.
Proof Sketch.
Finite extension, oracle construction; we need to construct an A meeting two general constraints: A0 T B
and B T A0 . In a now standard move, well ensure the first on odd steps and the second on even. As well
see, the latter of these is quite easy, while the former requires more thought. Turning to this problem first,
we wish to provide a means for B to decide A0 ; since B is arbitrary, this information must somehow be
built into Awhich, of course, is only useful if B can compute A. Our strategy, then, is to make our entire
construction of A B-computable, then to somehow encode whether or not e A0 into the construction.
Moving to the details of this procedure, suppose we are at an odd step 2e + 1 with an initial approximation
2e+1
A of A . We need a B-decidable question which will somehow indicate whether or not e A0 which
since all we know of B is 0 T Bthis means all we may ask is a strictly existential question. As usual,
we invoke the use properties of relative computation, querying:
( 2e+1
A )(t)[e (e)< t]
After all the excitement above, building A (B-computably) so that B T A0 is as easy as, for any even step
2e + 2, always extending 2e+2
A by B (e); that is, set
A if x < lh(2e+2
( 2e+2
)
2e+3
A (x) =
B(e)
A
if x = lh(2e+2 )
A
Proof Sketch.
Mimic the proof of the original, replacing 0 with c
14.2. FRIEDBURG COMPLETENESS 111
Proof Sketch.
Induction on m
Minimal
A Turing degree a is minimal if and only if 0 <T a and there is no Turing degree b such that
0 <T b <T a
Proof.
Construct A by approximating A with computable perfect ? or {0, 1} (i.e. T is a binary tree).
s T ks k T
T is perfect : s T u, v T [s u s v u v]
where r s iff n dom(r) dom(s)[r(n) , s(n)].
T is computable
Notation:
For T a tree, s T ,
T s = {u T |u 6 s}
Requirements on A: Re : Either e A is not total or e A is computable or e A T A.
Set T 0 = nN n {0, 1}. Given T e : want to ensure Re .
S
Case 1: s T e such that u, v s with u, v T k such that ue (k) and ve (k) , then ue (k) = ve (k). in
that case, set T e+1 = (T e ) s for some such s. Re is now guaranteed: suppose A [T e+1 ] and e A
is total.
Then, e (k) = i iff u
A
T e+1 ue (k)= i (note that this is 01 )
Case 2: Otherwise. Well define a T as follows: For r 2< , define (r) = T e
computable, perfect T e+1
by induction of the length of r. Then, take {u|r[u (r)]}. Set () = . Given (r), we need to define
(r ` h0i)
(r ` h1i)
Search for u, v, k showing Case 1 fails for s = (r). For the first ones you find, set u = (r ` h0i),
v = (r ` h1i). Thus completes the definition of . Note that is computable. Thus, set
Theorem 14.10.
There is a simple set A such that A0 T 00
00 = A0 A 0
14.2. FRIEDBURG COMPLETENESS 113
Proof.
Well define finite sets A s , s N, A s A s+1 , S `0 A0 computable, A = s A s .
S
Requirements: Pe : ensure We A , (if We infinite) Ne : if you see e As (e), then preserve As k (i.e.
k
115
Chapter 15
Historically, logical interest has centered around a handful of first-order theories; in this chapter we present the basic
terminology for working with both first-order logic and theories from a computability perspective, then canvas those
theories which have proven most interesting, proving some basic results about each.
Axioms
A set of L-sentences is a set of axioms for a deductively-closed L-theory T if and only if T is the deductive
closure of under some specific consequence relation
Computably Axiomatizable
A deductively-closed theory T in a computable language L is axiomatizable if and only if there exists a
computable set of axioms for T ; that is, there is a computable function which decides whether or not any natural
number encodes an axiom
117
118 CHAPTER 15. LANGUAGES, THEORIES, AND COMPUTABILITY
Historically, one of the largest areas of interaction between computability theory and logic is in proving or dis-
porving the decidability of some particular, usually mathematical, theory.
Proof Sketch.
By assumption, we have a theory T = {n : n N} such that, for some computable f , f (n) = pn q for
every N. Using this f , define, for each n/n , a new formula n as ni=1 n (n conjoined with itself n
times). Define:
= {n |n N}
is decidable since we may simply count the number of conjuncts in any well-formed formula, checking
they are all the same, then use the f above to ensure that the conjunct is the correct one. Note finally that
T = { : |= }
Proof Sketch.
Assume T is consistent since, otherwise, it is trivially decidable. To decide whether T , search for
both a proof of from the computable axioms for and a proof of from the computable axioms for
simultaneously, return whichever terminates first. Note that determining whether or not some encoding
is a valid proof is computable so long as the process for determining whether or not a given premise is
admissible is itself computable.
Of course, not all theories of interest are so well-behaved as to be both complete and computably axiomatizable;
indeed, much work has gone into the question of determining the decidability or undecidability of a theory under more
difficult circumstances; in this vein, we distinguish two notions stronger than simple undecidability:
Essentially Undecidable
A deductively-closed theory T in a computable L is essentially undecidable if and only if T has no consistent,
decidable extension in L.
Strongly Undecidable
A theory T in a computable L is strongly undecidable if and only if, for every deductively-closed L-theory S
such that S T is consistent, S is undecidable. A model A is called strongly undecidable if and only if Th(A) is
strongly undecidable
15.2. DECIDABILITY AND UNDECIDABILITY 119
Essential undecidability captures the idea that there is something about the sentences of the theory which forces
undecidability; there is no way to repair the undecidability of the theory. Strongly undecidable theories, in contrast,
guarantee that no decidable theory is even so much as consistent with thema distinctly stronger claim. Our picture,
then, is as follows:
Undecidable < Essentially Undecidable < Strongly Undecidable
From just our previous work, we thus have:
This means, for example, that if a theory is known to be both computably axiomatizable and undecidable, then it must
incomplete. The next proposition helps to demonstrate exactly how powerful a property strong undecidability really
is:
Proof Sketch.
Simply leverage definitions
Proof Sketch.
Suppose S is a non-empty, deductively-closed, decidable theory consistent with an essentially undecidable
T and is the finite axiomatization of T . A decidable, consistent extension of T can be constructed as
follows: Proceed through all formulas , if is in S , is in our extension; if not, it is not.
Theorem 15.4 (Incomplete, Decidable Theories have Consistent, Decidable, and Complete Extensions).
Every incomplete, deductively-closed, and decidable theory T has a decidable, consistent, and complete extension
[Tarski, 1949].
120 CHAPTER 15. LANGUAGES, THEORIES, AND COMPUTABILITY
Proof Sketch.
By deductively-closed and incomplete, we have that the theory is consistent. We construct the consistent,
complete, and decidable extension as follows: start with our theory and begin moving through the well-
formed sentences of the language, starting with those having least Gdel number; for any given sentence,
add it to our theory if its negation is not a consequence of the theoryotherwise, simply move to the next
sentence. Leverage this process and the deductive-closure of the original theory to prove decidability.
(ii) Model-Theoretic
(iii) Interpretations
The first two, quantifier elimination and model-theoretic, seek to leverage both the results above and in the model-
theory portion of this text. In particular, weve shown that a deductively-closed theory which is complete, consistent,
and computably axiomatizable must also be decidable. Since theories of interest are generally both consistent and
computably axiomatizable, both of these methods are often used to demonstrate the last property: completeness.
Given (3) and (4), its easy to see that quantifier elimination shows the decidability of T ; (1) and (2), in contrast,
are empirical observations about sets for which logicians have established (3) and (4). Crucially, even without (3)
and(4), all hope isnt lost for proving decidability; in particular, if it can be shown that T is complete with respect to
all the sentences in (for any sentence, either it or its negation are a consequence of T ), then T must be completea
property that, under many circumstances, readily gives decidability.
To achieve the full flavor of this method, we walk through a proof demonstrating the decidability of Presburger
arithmetic:
Fact. To establish (?), its only necessary to consider an arbitrary formula of the form x[( x)] where is
a set containing only literals
Other natural theories whose decidability can be demonstrated using only the method of quantifier elimination are:
The theory of the standard model of N under the signature containing only a successor function S and constant
0 [Herbrand, 1928]
The theory of the standard model of N under the signature containing only a successor function S , multiplication
function , and constant 0 [Skolem, 1930]
Elementary Theory of Identity; the first-order validities of a language with empty signature [Lowenheim, 1915].
Theory of Finitely Many Sets; the first-order validites of a language with only m unary relations [Lowenheim,
1915]
Theory of Discrete Orders; axiomatized by,
xy[x y y x x = y]
xy[x y y x]
xyz[x y y z x z]
xy[y x x = y]
xy[x < y z(x < z y z)]
xy[y < x zw(z < x (z < w z w))]
[Langford, 1927]
Theory of linear order in Q; i.e. Th(hQ, i)
Theory of Boolean Algebras [Tarski, 1949]
Theory of R; i.e. Th(hR, 0, 1, +, , , i) [Tarski, 1948]
15.3.3 Interpretation
The method of interpretation is, at heart, the result of a simple, familiar idea: reduction. Suppose that, on one hand,
we have a theory T ? in a computable language L? for which we wish to prove decidability; meanwhile, on the other
hand, we have a provably decidable theory T D formalized in a computable language LD . If there exists a computable
function f : L? LD such that, for every L? ,
T ? f () T D
then T ? must be decidable. Our decision procedure simply translates into LD using f , then leverages the decision
procedure for T D . Of course, in line with our earlier work with reductions, reducibility is a two-way street. In
particular, given two theories T 0 , T 1 in two language L0 , L1 and a computable f : L0 L1 such that
T 0 f () T 1
i. I(c) for any constant symbol c is a well-formed formula c (v0 ) of one free variable
ii. I(R) for some n-ary relation symbol R is a well-formed formula R (v0 , . . . , vn1 )
iii. I( f ) for any n-ary function symbol f is a well-formed formula f (v0 , . . . , vn )
In addition, with every interpretation function I we associate a well-formed formula D (v0 ) of L1 with a single free-
variable; in our scheme D (v0 ) will determine the domain of the model we wish to interpret within A while I provides
a simple translation of formulas from L0 to L1 . Thus, given an L1 -structure A, an interpretation function I : L0 L1 ,
and an L1 formula D (v0 ), we may construct an model-like entity AI as follows:
{
Let |AI | be AD ; that is, all elements a of A such that A
D [a]
I
For every constant symbol c of L0 , let cA be the unique b AD such that A
c [b]
AI I
For every relation symbol R of L0 , let RA be RA (AD )n
I
For every function symbol f of L0 , let f A be defined by
I
f A (a) = b A
f (a, b)
In order for AI to be a model, it must be the case that A I where I is defined as follows:
{
v0 [D (v0 )]
v0 . . . vn1 [n1
i=0 D (vi ) !vn [ f (v0 , . . . , vn ) D (vn )] for every function symbol f of L0
15.4. PROVING UNDECIDABILITY 123
The first condition on I guarantees that the domain of AI is nonempty, while the second and third guarantee that the
constant and function definitions of AI are respectively acceptable.
Interpretable
Let B be an L0 structure and A a L1 structure. B is interpretable in A if and only if B AI for some interpretation
function I such that A
I ; B is computably interpretable in A if and only if B AI for some computable
interpretation function I such that A
I
At long last, we turn to how interpretability can be leveraged. If Th(A) or Th(B) is of interest, any interpretation
function I can be used to generate a full, computable map f : L0 L1 reducing Th(B) to Th(A), just as in the
introduction to this section.
1 = v1 v3 v4 . . . (v1 , f2 (v1 ), v3 , . . . )
= v1 v3 v5 . . . (v1 , f2 (v1 ), v3 , f4 (v1 , v3 ), . . . )
2
Then, Bi S AT i+1 S AT
(iii) Suppose that L has only unary predicate symbols and no function symbols. Then S AT L is computable (this is
again a result of the following property has a model iff has a finite model)
(iv) If L has 2 unary function symbols S AT L is 01 complete; if theres just one and nothing else, S atL is computable.
Fact. (can use in exercises) Can use (iv) to get an essentially undecidable, not strongly undecidable, axiomati-
zable T again using inseparable r.e. sets.
This signature is often referred to, for historical reasons, as thesignature of Peano arithmetic.
Fact. Adding exponenetiation to the signature of LA doesnt add expressivity; in other words, exponentiation
is definable in LA
and then designating our intended model of LA as N, the standard model. To complete the picture and return to the
topic of this section, we identify the term true arithmetic with Th(N) in the language LA . Recalling that much of our
work with relations in computability theory is using true arithmetic (and, by extension, LA ), it should now be quite
obvious where the term arithmetic derives from in the arithmetic hierarchy:
Indeed, as we saw earlier, the correspondence is even more exact; relations definable in N by a bounded formula
are primitive recursive (although the converse is not true), the 01 -relations are exactly those definable in N by an
existential formula, so on and so forth.
125
126 CHAPTER 16. ARITHMETIC & INCOMPLETENESS
Corollary.
Let
T hN
0
= {pq| is an existential sentence of LENT and N
}
1
Then, K m T hN
01
Theorem 16.2.
T hN
0
is 01 -complete
1
Proof.
First, show its 01 by showing:
pq T hN
0
= v for some bounded sS at(pq, s)
1
Next, show K m T hN
0
. Let
1
K(n) N
[n]
( existential), then
K(n) p(n)q T hN
0 1
Then,
f (n) = p(n)q
is primitive recursive.
Let N + = (N, +, , exp, S , 0, , T k for every k N) where T k is intended to represent the Kleene T -predicate. It
+
follows that T hN
0
is, trivially, 01 -complete.
1
Corollary.
N is strongly undecidable
Validities of a Language
Let L be a first-order language; the validities of L, denoted ValL , is the theory
Proof.
PA
ThmQ m ValL : T hmQ i f f 7 Q which requires that Q is finitely axiomatized!
16.2. FIRST-ORDER PEANO ARITHMETIC 127
Weve already shown that any such theory is 01 -complete and thus undecidable.
Theorem 16.3.
Suppose N is computably definable in some model D. Let T be axiomatizable and true in D. Then, T is 01 -complete
and D is strongly undecidable
Proof.
Let I: symbols of LPA into wffs in the language L of D be an interpretation function. Let I be the
sentences of L saying this about I (i.e. D |= I iff DI is an LPA structure). Let S iff T I ` I . I is
defined by induction on in such a way that for any D |= I , D |= I iff DI |= . E.g.,
Comparing the original Dedekind-Peano second-order induction axiom with the replacement first-order induction
schema, we trade induction over arbitrary subsets of Na philosophically and set-theoretically questionable actfor
induction over the first-order, LA definable subsets of Na safer, but weaker ability. Its precisely this loss whichas
we will seeallows the formulation of so-called non-standard models of PA.
Note that, while not finitely axiomatizable, PA is computably axiomatizable, and thus, by earlier work, computably
enumerable.
Robinsons Arithmetic
Robinsons arithmetic, denoted Q, is the deductively-closed theory in LA axiomatized by the following:
There are a number of facts about Q which follow just from considering the axioms given; first, Q is finitely axiom-
atizable, and thus computably enumerable. Next, a bit of work shows that all the axioms of Q are either axioms or
consequences of axioms of PA. That is, Q PA and N
Q. Finally, Q lacks any induction axiom, meaning its
possible to prove, for every natural number every instance of a property (x), but yet not obtain the universalization
x(x).
Numeral
The set of numerals of Q/PA is defined inductively as follows:
2 = 2 = SS0
Representability
Given a theory T in a language L with a suitable notion of numeral,
(i) A k-ary relation R is weakly represented in T by a well-formed formula of L with k free variables if and
only if for any k-tuple of numbers n,
R(n1 , . . . , nk ) T ` [n1 , . . . , nk ]
(ii) A k-ary relation R is strongly represented in T by a well-formed formula of L with k free variables if and
only if for any k-tuple of numbers n,
R(n1 , . . . , nk ) T ` [n1 , . . . , nk ]
R(n1 , . . . , nk ) T ` [n1 , . . . , nk ]
(iii) A k-ary function f is functionally represented in T by a well-formed formula with k + 1 free variables if
and only if for any natural numbers n1 , . . . , nk , m,
f (n) = m T ` [n1 , . . . , nk , m]
f (n) = m T ` (!z)[n1 , . . . , nk , z]
R(n1 , . . . , nk ) PA ` [n1 , . . . , nk ]
R(n1 , . . . , nk ) PA ` [n1 , . . . , nk ]
Thus, a relation is strongly represented in PA if and only if it is weakly represented by some formula and its
negation is weakly represented by the negation of the same formula.
130 CHAPTER 16. ARITHMETIC & INCOMPLETENESS
(v) A relation R is strongly representable in PA if and only if its characteristic function R is representable in PA
(vi) For any LA -formula and any n N,
(vii) For any LA -formula and any n N, if for every m < n, PA ` (m) and PA ` (n) as well, then
Proof Sketch.
The above results, though sometimes tedious to prove, all straightforwardly rely on the axioms of PA,
requiring only clever usage there of.
Proof Sketch.
Leverage the definition of the computable functions; show that the starting functions are representable in
PA, and that representability in PA is closed under the closure operations of the computable functions.
Sound
A theory T is D-sound for a class of sentences D if and only if whenever is a D sentence and T ` , then is
true (what this means, of course, depends on context; usually truth in some standard model).
Of course, PA was built with the intent that PA Th(N); unfortunately, this hasnt been proven. Indeed, although
there is a great deal of anecdotal evidence, it isfor all intents and purposesan open question whether or not PA is
even consistent. Soundness results with regard to PA, then, are not nearly as trivial as they may, at first, seem.
Proof Sketch.
Leverage the representability of computable relations, note that if PA ` [n], then PA ` x(x)
Proof Sketch.
Note that PA strongly represents the formula [a] by itself (our results above translate addition in N
to addition in PA, etc.) where a is the witness to x(x). By our earlier results, PA ` [a], and thus
PA ` x(x).
Proof Sketch.
Computable axiomatizability guarantees that T is 01 , while extending PA and being 01 -sound guarantee
that T is consistent and still correctly proves 01 facts about Nmaking it 01 -hard.
Proof Sketch.
Let A, B N be disjoint, 01 , and computably inseparable. Since A, B are 01 , PA strongly represents them;
say by A (x) and B (x). Any decidable extension of Q can thus computably separate A and B using A ,
B a contradiction.
16.4.2 Representability in Q
Although we present it in less detail, many of the representability results above carry over verbatim into Robinsons
arithmetic; of course, this is because Robinson selected precisely the axioms needed to prove them.
Corollary (Q is 01 -Complete).
The theory Q is 01 -complete
Proof Sketch.
Computable axiomatizability guarantees that T is 01 , while extending Q and being 01 -sound guarantee
that T is consistent and still correctly proves 01 facts about Nmaking it 01 -hard.
Proof Sketch.
Let A, B N be disjoint, 01 , and computably inseparable. Since A, B are 01 , Q strongly represents them;
say by A (x) and B (x). Any decidable extension of Q can thus computably separate A and B using A ,
B a contradiction.
Q ` (pq)
Proof.
Define
if m encodes a LA -formula with one free variable
(
p[n]q
sub(m, n) =
0 otherwise
Note that sub is computable, and thus functionally represented in Q; suppose that the formula which does
so is sub (x, y, z). This established, let a well-formed formula of LA with one free variable, (x), be given.
Define (v) as
u(sub (v, v, u) (u))
Colloquially, (v) claims that there is a u such that u = pv [pvq]q (v = pv (x)q) and [u]. Let be (pq).
Thus, is
u(sub (pq, pq, u) (u))
and, since Q functionally represents sub,
Thus,
T ` (pq)
To establish the converse, by the definition of ,
And thus,
T ` () u(sub (pq, pq, u) (u))
T ` ()
which gives,
T ` ()
There are a few interesting characteristics of the proof given above; first, the proof in no way depended on what
(x) actually is. The fixed point, while depending on , is not in any way based on the content of . In every case,
there is a fixed point of the same form as above. The fixed point itself, above, is often understood to claim I
have property ; while this is certainly a fair description of how behaves, the direct self-reference is misleading.
actually says "there is something which is both and has properties X" where the properties X are enough for Q
to uniquely determine that something as the encoding of that very sentence. The self-reference utilized above is thus
134 CHAPTER 16. ARITHMETIC & INCOMPLETENESS
indirect. Finally, while T was used throughout the proof above, note that both Q and PA meet the criteria for T , and
thus the fixed point lemma holds in both.
Proof Sketch.
Given that T is computably axiomatizable, its computable to check, given a reasonable encoding scheme,
whether any given natural number encodes a proof of some particular statement. Thus, the following is a
computable relation:
ProofT (x, y) x is a proof of y from the axioms of T
Since Q strongly represents all computable relations, there must be some LA formula ProofT (x, y) with
the requisite properties. Noting that x[ProofT (x, y)] is an LA -formula with only one free variable, the
diagonalization lemma gives that there is a such that
Then, T 6` since, if it did, the proof must have an encoding e, so that ProofT (e, pq). Then,
Remark. This is, of course, not quite the oft quoted statement of Gdels first incompleteness theorem. In
particular, it leaves open the possibility that T is, in fact, complete, and that T ` . There are a number of ways to get
around this and establish the incompleteness of T . Strengthening the assumptions to include 01 -soundness suffices, as
well as Gdels tact: -consistency instead of simply consistency.
-Consistency
A theory T which interprets the natural numbers is -consistent if and only if, for every (x), if T ` (0),
T ` (1), T ` (2), . . . , then T 6` x[(x)]
In truth, however, neither are necessary. A clever adjustment to Gdels original proof, known as Rossers trick,
suffices to establish the stronger conclusion without strengthening the assumptions given above:
Proof Sketch.
Given that T is computably axiomatizable, its computable to check, given a reasonable encoding scheme,
whether any given natural number encodes a proof of some particular statement. Thus, the following is a
computable relation:
ProofT (x, y) x is a proof of y from the axioms of T
Since Q strongly represents all computable relations, there must be some LA formula ProofT (x, y) with
the requisite properties; similarly,
is also computable, and thus represented by some LA -formula, neg (x, y). Note that the following, denoted
(y) for short, is an LA -formula with one free variable:
Less formally, the formula (y) above states "for every proof of the formula encoded by y, there is a
shorter proof of the negation of the formula encoded by y". Its this shift from a formula claiming its
own unprovability in Gdels proof to one claiming the earlier provability of its negation which comprises
Rossers contribution. From here, the rest of the proof follows easily.
Notation. The computability of proof for computably axiomatizable theories and the strong
representability of this relation in theories extending Q allowed us, in the proofs above, to talk
about the PA-provability of various sentences in PA itself. We continue this trend by utilizing
the formula which strongly represents provability to define a LA sentence which claims that
T Q is consistent. Let ConT denote
Proof Sketch.
Using our work in Gdels first and the fixed point lemma, let be such that
Well show
PA ` ConT
which, by our previous work, implies PA 6` ConT . Throughout the remainder of the proofs, we use T
as an abbreviation of x[ProofT (x, pq)] (the notation is, of course, remnant of modal logic; in particular,
the provability logic GL; see Boolos text on the subject). To start, note that
PA ` T
PA ` T ConT
Thus,
PA ` ConT
For the reverse, note that, by the construction of ,
PA ` T T T
Thus,
PA ` T T
In other words,
PA ` T ConT
Contraposing,
PA ` ConT T
By the definition of ,
PA ` ConT
Putting the two together,
PA ` ConT
Proof.
(Kripke)
Show PA {} is inconsistent. By assumption and modus tollens,
PA {} ` PA (pq)
But then,
PA {} ` PA{}
By Gdels second, however, it now follows that PA {} is inconsistent.
16.4. GDELS INCOMPLETENESS THEOREMS 137
Remark. Note that PA only proves the soundness of its provability predicate with respect to a sentence when
it knows that holds, i.e. when the value of PA doesnt matter. Given our previous work, this should make a great
deal of sense; for all PA can prove, it is inconsistent, and so provability cant be depended on to entail truth generally.
T i+1 = T i {ConTi }
Some reflection shows, however, that if wed like hierarchy which doesnt immediately collapse, we need another
condition on T ; after all, sometimes adding a consistency claim as above actually yields inconsistency, e.g. consider
starting with PA {ConPA }. We thus stipulate that N
T or, less constrictively, T is 01 -sound.
Of course, this hierarchy over T can be extended transfinitely in the typical way:
[
T = Tn
n<
Note that if the original T is computably axiomatizable, so too is T n for every n, as well as T ; in a similar vein, note
that if T is 01 -sound, then so to is T n for every n N.
The Consistency Strength Hierarchy itself is the result of generalizing across the hierarchies generated by different
extensions of PA:
Consistency Strength Hierarchy
For consistent, 01 -sound, deductively-closed theories T, S extending PA, S Con T if and only if for every finite
F S , there is a finite G T such that PA ` ConG ConF . Note that in the finitely axiomatizable case this
reduces to PA ` ConT ConS . The consistency strength hierarchy is the hierarchy over consistent,
01 -sound,deductively-closed theories extending PA generated by the ordering Con .
Notation. Define
Proof.
()
Let S ` where is universal. Then have a finite F S such that F ` . Let G finite T such
that F Con G. Then, T ` ConG . (Use proof of PA proving consistency of all finite subsets, basically
the same). So, T ` ConF . Then, T ` because T ` xProvF (x, ). T ` xProvF (x, ), and PA `
xProvF (x, ) ConF )
()
Finish this direction
138 CHAPTER 16. ARITHMETIC & INCOMPLETENESS
Example 14.
Let PA T . RosserT = "for every proof of me, there is a shorter (smaller Godel number) proof of my
negation". RosserT is universal. T + RosserT Con T
Remark. if T + ConT is 01 -sound, T + ConT Con T (01 )T +RosserT * (01 )T since RosserT is a coun-
terexample
Proof.
Reason in PA. Assume ConT . We need ConT +RosserT . Suppose not. Then, T ` RosserT , so T `
ProvT (x, RosserT ) for some x. For every y < x, T ` ProvT (y, RosserT ). So, T ` RosserT .
So T is inconsistent. Contradiction. Thus, ConT +RosserT .
16.4.6
If T is 01 -sound, then T is 01 -sound because its consistent. The converse does not, however, hold (PA+ ConPA ).
01 -soundness is actually equivalent to 01 -soundness (Look at this).
Theorem 16.19.
Peano Arithmetic is finitely axiomatizable
Proof.
Assume PA consistent. Let F finite such that F Axioms of PA. Then, PA ` ConF ; bound the complexity
of these sentences (there must be some n such that for all F, is 0n -sentence). Note that satisfaction
for 0n is definable over N . We now have truth for F.
Say by the wff ,
PA ` ()
for all sentences of F. Then,
PA ` ()
Then, we use the formalized in PA soundness of some proof system. Thus, if F ` PA, then F ` ConF , so
F is inconsistenta contradiction.
Remark. Let T be axiomatizable in LPA , PA T .
i. Suppose T ` where is universal. Then, PA + ConT ` (T Con
PA + ConT )
ii. (ConT is sterile) Let T + ConT ` , universal. Then, T ` .
Proof.
Have PA + ConT +ConT ` by (i). But, PA ` PA ` ConT ConT +ConT (formalized Godel 2). So,
PA + ConT ` . So T ` .
Cut
For a model M of PA, a non-empty subset I of M is a cut if and only if for all x I and all y dom(M),
M
y < x y I and I is closed under the successor function.
Proof.
Taking such a formula, note that the model makes the antecedent of the induction schema true, and thus
the consequent as wella contradiction.
Corollary (Overspill).
Let M
PA be non-standard and let I be a proper cut of M. Suppose a dom(M) and (x, a) is an LA formula such
that M
(b, a) for all b I. Then, there is c > I in M such that M
(x < c)(x, a)
Corollary (Underspill).
Let a M with M
PA, I dom(M) a proper cut, and (x, y) an LA formula. Then,
Cofinal Extension
Suppose that M, N are PA models such that M N. N is a cofinal extension of Mnotated M c f Nif and only
if for every n dom(N), there is an m dom(M) such that N
n < m
140 CHAPTER 16. ARITHMETIC & INCOMPLETENESS
141
Chapter 17
Orderings
(i) Infinite discrete linear orders with a starting point, but no endpoint; e.g., (N, <).
(ii) Infinite discrete linear orders with an endpoint, but no starting point; e.g., (N, <) with the reverse ordering.
(iii) Infinite discrete linear orders with both a starting and ending point; e.g. an order of type (i) followed by an order
of type (ii).
In all cases, quantifier elimination can be used to show that the theory is complete.
(i) Dense linear orders with a starting point, but no endpoint; e.g., ([0, 1), <).
(ii) Dense linear orders with an endpoint, but no starting point; e.g., ((0, 1], <)
(iii) Dense linear orders with both a starting and ending point; e.g. ([0, 1], <)
(iv) Dense linear orders without endpoints; e.g. (Q, <), (R, <)
143
144 CHAPTER 17. ORDERINGS
The theory of dense linear orders without endpoints is complete (use os-Tarski), 0 -categorical (QE), but not
categorical in any uncountable cardinal. Quantifier elimination is often a popular and productive proof tactic
when working with DLOs; popular examples include
In cases (i)-(iii), quantifier elimination can easily be used to show that the theory is complete.
Corollary.
The theory of partial orders is 01 -complete.
Example 15.
(Q, ) is 0 -saturated; (Q, ) is not 1 saturated (look at the type realized by i)
Example 16.
(R, ) is not 1 -saturated (look at the type containing n < x for all n N.
Chapter 18
Similarly, an n-ary relation R over a set X is a subset of X n , while a function f is a binary, functional relation.
Equivalence Relation
For some set X, a relation which is:
i. Reflexive - xRx for every x X
ii. Symmetric - xRy yRx
iii. Transitive - xRy and yRz xRz
145
146 CHAPTER 18. A BRIEF SYNOPSIS OF SET THEORY
Orderings
For some set X,
i. Partial Ordering - A binary relation R over X which is reflexive, transitive, and anti-symmetric (xRy and yRx
x = y)
ii. Simple Ordering - A partial ordering R of X which is connected (for all x, y X, either xRy or yRx)
iii. Well Ordering - A simple ordering R of X with the property that every nonempty subset of X has an R-least
element (for every other y in the subset, xRy)
iv. Strict Well Ordering - A relation R over X which is irreflexive (xRx for every x X) such that adding
{hx, xi : x X} gives a well ordering
Chains
For some set X,
i. X is a chain if and only if X is simply ordered by
18.1 Ordinals
Ordinal
An ordinal is a set such that and is strictly well ordered by the relation
S
Example 17.
In canonical set theory, the first three ordinals are:
0=
1 = {}
2 = {, {}}
Successor Ordinal
For a given ordinal , the successor of is the ordinal
+ 1 = {}
Any ordinal which can be generated as the successor of another is called a successor ordinal.
Limit Ordinal
An ordinal which is not a successor ordinal
Example 18.
The least limit ordinal is , the order type of N under <
In addition, we define the infimum of a nonempty set X of ordinals is the least element of X, while the supremum
is the least ordinal which is greater than or equal to every element of X; in fact,
Order Type
A well ordering R over a set X has order type for some ordinal if and only if there is a bijection f : X
such that xRy f (x) f (y); f is called an isomorphism
Proposition 18.3.
Every well ordering has exactly one order type
Example 19.
As the above shows, ordinal addition is not commutative:
+1,
1+=
Multiplication
For two ordinals and , is defined as the ordinal given by the lexicographic ordering over the Cartesian product
. Graphically, this is equivalent to listing distinct copies of the ordinal:
Some thought shows that while ordinal multiplication is associative, itlike additionisnt commutative.
Example 20.
2,
2=
Exponentiation
For better or worse, ordinal exponentiation lacks an easy geometric presentation; for finite exponents, its easiest to
think of exponentiation in terms of multiplication, e.g. 2 = . The general case is given inductively by
0 = 1
+1 = ( )
Multiplication
Cardinal multiplication is similarly well-behaved; given two disjoint sets X and Y with cardinalities and respec-
tively, is simply the cardinality of the Cartesian product X Y. Finite numbers behave as usual and if both and
are non-zero with at least one infinite, = max(, ). Like cardinal addition, cardinal multiplication maintains both
associativity and commutativity.
Exponentiation
Cardinal exponentiation is defined as follows: given two disjoint sets X and Y with cardinalities and respectively,
is |X Y | where X Y is the set of all functions from Y into X, also commonly written as Y X. Finite numbers again
behave as usual and cardinal exponentiation maintains all of the basic exponent manipulations characteristic of the
finite setting.
i+1 = 2i
and when is a limit ordinal, [
i = i
<
Meanwhile,
150 CHAPTER 18. A BRIEF SYNOPSIS OF SET THEORY
Proposition 18.4.
(i) is an infinite limit cardinal if and only if there is a limit ordinal such that =
(ii) is an infinite strong limit cardinal if and only if there is a limit ordinal such that = i
(iii) For all ordinals , i
(iv) There are arbitrarily large cardinals such that || = = i
As a point of interest, the continuum hypothesis (CH) states that i1 = 1 , and the generalized continuum hypoth-
esis (GCH) states that, for all , 2 = +1 . Thus, GCH implies that for all , i = and that every limit cardinal
is, in fact, a strong limit cardinal.
Cofinality
Let be a limit ordinal; a set X is cofinal in if and only if X and = X. The cofinality of , notated
S
cf(), is the least cardinal such that a set of power is cofinal in .