MIT14_03F16_lec16
MIT14_03F16_lec16
MIT14_03F16_lec16
1
1 Risk Aversion and Insurance: Introduction
• A significant hole in our theory of consumer choice developed in 14.03/14.003 to date is that
we have only modeled choices that are devoid of uncertainty: everything is known in advance.
That’s convenient, but not particularly plausible.
– Prices change
– Income fluctuates
– Bad stuff happens
• Most decisions are forward-looking: these decisions depend on our beliefs about what is
the optimal plan for present and future. Inevitably, such choices are made in a context of
uncertainty. There is a risk (in fact, a likelihood) that not all scenarios we hoped for will
be borne out. In making plans, we should take these contingencies and probabilities into
account—and there is no doubt that people do take these things into account. If we want a
realistic model of choice, we need to model how uncertainty affects choice and well-being.
– How do people choose among “bundles” that have uncertain payoffs, e.g., whether to fly
on an airplane, whom to marry?
– Insurance: Why do people want to buy it?
– How (and why) the market for risk operates? (Markets for risk include life insurance,
auto insurance, gambling, futures markets, warranties, bonds, etc.)
• Most people would not enter into a $1, 000 dollar heads/tails fair coin flip.
• You are offered a gamble. We’ll flip a coin. If it’s heads, I’ll give you $10 million dollars.
If it’s tails, you owe me $9.8 million.
2
Its expected monetary value is :
1 1
· 10 − · 9.8 = $0.1 million($100, 000)
2 2
Want to play?
3. People won’t pay large amounts of money to play games with huge upside potential. Example
“St. Petersburg Paradox.”
• Flip a coin. I’ll pay you in dollars 2n , where n is the number of tosses until you get a
head:
X1 = $2, X2 = $4, X3 = $8, ...Xn = 2n .
1 1 1 1
E(X) = 2 + 4 + 8 + ... n 2n = ∞.
2 4 8 2
• How much would you be willing to pay to play this game? [People generally do not
appear willing to pay more than a few dollars to play this game.]
• What is the variance of this gamble? V (X) = ∞.
• The fact that a gamble with infinite expected monetary value has (apparently) limited utility
value suggests something pervasive and important about human behavior: As a general rule,
uncertain prospects are worth less in utility terms than certain ones, even when expected
tangible payoffs are the same.
• To have a coherent model of choice under uncertainty, we need to be able to say how people
make choices when:
We’ll introduce the notion of risk aversion, insurance, and insurance markets in several steps. First,
I’ll review some basic probability theory, which will likely already be familiar. Next, I’ll provide
an optional formal development of so-called Von Neumann-Morgenstern expected utility theory
(AKA, Expected Utility Theory). You don’t have to spend time with the formal development,
but you are welcome to do so. Third, I’ll provide an informal discussion of the Expected Utility
property. Following that, I’ll show how preferences that satisfy the Expected Utility property
can be used to formalize notions of risk preference—specifically, risk averse, risk neutral, and risk
3
seeking preferences. From there, we will use these tools to understand how and why insurance
markets work, and why risk is a good (or bad) that consumers will want to trade.
Here’s a noteworthy feature of markets for risk that you should contemplate as you study
this material: in settings with risk, there may be gains from trade—that is, potential Pareto
improvements—even when all consumers have identical preferences and endowments.
It is also important to note before we launch in that the models we are going to develop in this
lecture note incorporate uncertainty in a very particular way. While we will will explore uncertainty
in the probability of certain outcomes occurring, we are going to maintain the assumption that
consumers know two important things:
These are strong assumptions, but probably not unreasonable for certain important settings.
In this equation f (x) is the ‘probability density function’ (PDF) of the continuous random variable
x, meaning that f (x) is essentially the probability of randomly drawing the given value x (so, f (x)
is just like the πi in the discreet case). [Note that the probability of drawing any specific value
from a continuous distribution is zero since there are an infinite number of possibilities. Depending
on the distribution, however, some ranges of values will be much more likely than others.]
4
Example: Expected value of a fair dice roll is E(x) = 6i=1 πi i = 1
· 21 = 72 . We’ve been using
P
6
expectations throughout the semester, so no doubt this is familiar.
7 2
Pn
In dice example, V (x) = i=1 πi i − 2
= 2.92.
Definition 4. Independence.
A case in which the probabilities of two (or multiple) outcomes do not depend upon one an-
other. If events A and B are independent, then Pr (A and B) = Pr (A) · Pr (B), and similarly,
E [A · B] = E [A] · E [B] .
Example: The probability of flipping two sequential heads with a fair coin is Pr (H and H) =
Pr (H) · Pr (H) = 0.25. These probabilities are independent, meaning that if you flipped heads the
first time, you are no more or less likely to flip heads the second time.
Example: The probabilities of seeing lightning and hearing thunder in an afternoon are not inde-
pendent of one another. If you see lightning, you’re reasonably likely to hear thunder, and vice
versa.
Example: If you flip a fair coin 100 times, the probability of getting heads more than ≥ 51%
of the time (that is, 51 or more times) is reasonably high. If you flip a fair coin 100,000 times,
the probability of getting heads more than ≥ 51% of the time (that is, 510,00 or more times) is
vanishingly small.
5
•
Dispersion and risk are closely related notions. Holding constant the expectation of X,
more dispersion means that the outcome is “riskier”—it has both more upside and more downside
potential.
Consider four gambles:
All four of these “lotteries” have same expected value (50 cents), but they have different levels of
risk.
A key statistical result, which I will not prove here, is that the variance of n identical inde-
pendent gambles is n1 times the variance of one of the gambles. What this means in practice is
that pooling a large number of independent, identical gambles reduces the aggregate riskiness of
those gambles. This is closely related to the previous example of flipping a coin 100 versus 100,000
times. The more independent gambles in the pool—the more flips of a fair coin—the greater the
certainty with which you can forecast the mean outcome.
Definition 6. A simple lottery L is a list L = (p1 , ...pN ) with pn ≥ 0 for all n and Σn pn = 1,
where pn is interpreted as the probability of outcome n occurring.
• A more general variant of a lottery, known as a compound lottery, allows the outcomes of a
lottery to themselves be simple lotteries.
Definition 7. Given K simple lotteries Lk = pk1 , ...pkN , k = 1..., K, and probabilities
αk ≥ 0 with Σk αk = 1, the compound lottery (L1 , ...LK ; α1 , ...αK ) , is the risky alternative
that yields the simple lottery Lk with probability αk for k = 1, ..., K.
6
• For any compound lottery (L1 , ...LK ; α1 , ...αK ), we can calculate a corresponding reduced
lottery as the simple lottery L = (p1 , ...pN ) that generates the same ultimate distribution
over outcomes. So, the probability of outcome n in the reduced lottery is:
That is, we simply add up the probabilities, pkn , of each outcome n in all lotteries k, multi-
plying each pkn by the probability αk of facing each lottery k.
• The basic premise of the model that follows is what philosophers would call a ‘consequential-
ist’ viewpoint: for any risky alternative, the decision maker cares only about the outcomes
and their associated probabilities, or in technical terms, the reduced lottery over final out-
comes. By assumption, the decision maker is indifferent to the (possibly many) compound
lotteries underlying these reduced lotteries.
• This compound lottery assumption states that the ‘frame’ or order of lotteries is unimportant.
So consider the following two stage lottery:
– We spin a pointer on a wheel with 8 areas, 2 areas of 900 representing $1.00, and $0.75,
and 6 areas of 300 each, representing $0.10, $0.20, ...$0.60 each.
– This single stage lottery has the same payouts at the same odds as the 2−stage lottery.
– The ‘compound lottery’ axiom says the consumer is indifferent between these two.
– Counterexamples? [This is not an innocuous set of assumptions.]
– [Is this realistic? Hard to develop intuition on this point, but research shows that this
assumption is often violated.]
7
• Implicitly, we are assuming that what enters into the decision maker’s utility function is the
final outcomes of these lotteries—the actual bundles consumed—not the probabilities along
the way. If that assumption is correct from a utility perspective, then compound lotteries
can be collapsed into simple lotteries so long as both the compound and simple lottery give
rise to the identical set of consumption bundles with identical probabilities of consumption.
(Another way to say this: the consumer does not consume the probabilities, only the realized
outcomes.)
• We assume the consumer has a rational preference relation % on £, a complete and transi-
tive relation allowing comparison among any pair of simple lotteries (I highlight the terms
complete and transitive to remind you that they have specific meaning from axiomatic utility
theory, given at the beginning of the semester). [This could also be called an axiom—or even
two axioms!]
• Axiom 1. Continuity. Small changes in probabilities do not change the nature of the
ordering of two lotteries. This can be made concrete here (I won’t use formal notation b/c
it’s a mess). If a “bowl of miso soup” is preferable to a “cup of Kenyan coffee,” then a mixture
of the outcome “bowl of miso soup” and a sufficiently small but positive probability of “death
by sushi knife” is still preferred to “cup of Kenyan coffee.”
• Continuity rules out “lexicographic” preferences for alternatives, such as “safety first.” Safety
first is a lexicographic preference rule because it does not trade-off between safety and
competing alternatives (fun) but rather simply requires safety to be held at a fixed value for
any positive utility to be attained.
1
This section draws on Mas-Colell, Andreu, Michael D. Winston and Jerry R. Green, Microeconomic Theory,
New York: Oxford University Press, 1995, chapter 6. For those of you considering Ph.D. study in economics, MWG
is the only text that covers almost the entire corpus of modern microeconomic theory. It is the Oxford English
Dictionary of modern economic theory. Most economists keep it on hand for reference; few read it for pleasure.
8
• The second key building block of our theory about preferences over lotteries is the so-called
Independence Axiom.
• In words, when we mix each of two lotteries with a third one, then the preference ordering
of the two resulting mixtures does not depend on (is independent of ) the particular third
lottery used.
• Example: If a bowl of miso soup is preferred to cup of Peets coffee, then the lottery (bowl
of miso soup with 50% probability, steak dinner with 50% probability) is preferred to the
lottery (cup of Peets coffee with 50% probability, steak dinner with 50% probability).
• It’s important to clarify now that “expected utility theory” does not replace consumer theory,
which we’ve been developing all semester. Expected utility theory extends the model of
consumer theory to choices over risky outcomes. Standard consumer theory continues to
describe the utility of consumption of specific bundles. Expected utility theory describes
how a consumer might select among risky bundles. [This paragraph will be repeated below
in the non-optional section of the lecture note.]
U (L) = u1 p1 + ... + uN pN .
• A utility function with the expected utility form is called a Von Neumann-Morgenstern
(VNM) expected utility function.
9
• The term expected utility is appropriate because with the VNM form, the utility of a lottery
can be thought of as the expected value of the utilities un of the N outcomes.
• In other words, a utility function has the expected utility form if and only if:
X X
K K
U k=1 αk Lk = k=1 αk U (Lk )
• Intuitively, a utility function has the expected utility property if the utility of a lottery is
simply the (probability) weighted average of the utility of each of the outcomes.
• A person with a utility function with the expected utility property flips a coin to gain or lose
one dollar. The utility of that lottery is
No. We haven’t actually defined the utility of an outcome, and we certainly don’t want to
assume that U (w) = w.
10
¯ + (1 − β)L αL
2. Let α, β ∈ [0, 1] . Then βL ¯ + (1 − α)L if and only if β > α. This follows
from the prior step.
¯ + (1 − αL ) L ∼ L. Existence follows
3. For any L ∈ £, there is a unique αL such that αL L
from continuity. Uniqueness follows from the prior step.
4. The function U : £ → R that assigns U (L) = αL for all L ∈ £ represents the preference
relation %.
Observe by Step 3 that, for any two lotteries L, L0 ∈ £, we have
¯ + (1 − αL ) L % αL0 L̄ + (1 − αL0 ) L .
L % L0 if and only if αL L
5. The utility function U (·) that assigns U (L) = αl for all L ∈ £ is linear and therefore has
the expected utility form.
We want to show that for any L, L0 ∈ £, and β [0, 1] , we have U (βL + (1 − β) L0 ) =
βU (L) + (1 − β) U 0 (L) .
By step (3) above, we have
¯ + (1 − U (L)) L = αL L̄ + (1 − αL ) L
L ∼ U (L) L
L0 ∼ U (L0 ) L̄ + (1 − U (L0 )) L = αL0 L̄ + (1 − αL0 ) L.
¯ + β (1 − U (L)) + (1 − β) 1 − U (L)0 L
βL + (1 − β) L0 ∼ [βU (L) + (1 − β) U (L0 )] L
¯ + [1 − βαL + (β − 1) α0 ] L
[βαL + (1 − β) αL0 ] L L
= β αL L̄ + (1 − αL ) L + (1 − β) αL0 L̄ + (1 − αL0 ) L
= βU (L) + (1 − β) U (L0 ) .
This establishes that a utility function that satisfies continuity and the Independence Axiom,
11
has the expected utility property: U (βL + (1 − β) L0 ) = βU (L) + (1 − β) U (L0 )
• Preferences that satisfy VNM Expected Utility theory have the property:
• This equation says that for a person with VNM preferences, the utility of consuming two
bundles L and L0 with probabilities β and (1 − β) , respectively, is equal to β times the
utility of consuming bundle L plus (1 − β) times the utility of consuming bundle L0 . Thus,
the utility function is linear in probabilities though not necessarily linear in preferences over
the bundles. [Note: VNM does not imply that U (2L) = 2 × U (L). As we’ll see below, that
equation would only hold for risk neutral preferences.]
• A person who has VNM EU preferences over lotteries will act as if she is maximizing expected
utility—a weighted average of utilities of each state, where weights equal probabilities.
• If this model is correct, then we don’t need to know exactly how people feel about risk per
se to make strong predictions about how they will optimize over risky choices.
• [If the model is not entirely correct—which it surely is not—it may still provide a useful
description of the world and/or a normative guide to how one should analytically structure
choices over risky alternatives.]
1. First, a utility function that assigns bundles an ordinal utility ranking. Note that
such functions are defined up to an affine (i.e., positive linear) transformation. This
means they are required to have more structure (i.e., are more restrictive) than standard
consumer utility functions, which are only defined up to a monotone transformation.
12
– These utility functions will capture consumer preferences for risk as well as consumer
preferences for bundles.
2. Second, the VNM assumptions. These make strong predictions about the maximizing
choices consumers will take when facing risky choices (i.e., probabilistic outcomes) over
bundles, which are of course ranked by this utility function.
– These assumptions are discussed at length above, but these assumptions guarantee
that consumers have well behaved preferences over lotteries, similarly to how the
axioms of consumer choice guaranteed that consumers have well-defined preferences
over bundles of good. Specifically, we assume that consumer preferences over lotter-
ies are continuous (i.e. small changes in probabilities do not change the ordering of
preferences) and independent (i.e. if we mix each of two lotteries with a third one,
then the preference ordering of the two resulting mixed lotteries does not depend
on (is independent of ) the particular third lottery used)
• It’s important to clarify now that expected utility theory does not replace consumer theory,
which we’ve been developing all semester. Expected utility theory extends the model of
consumer theory to choices over risky outcomes. Standard consumer theory continues to
describe the utility of consumption of specific bundles. Expected utility theory describes
how a consumer might select among risky bundles.
• Where does risk aversion come in? It is going to come in with the specification of the utility
function
• Consider the following three utility functions characterizing three different expected utility
maximizers:
•
u1 (w) = w
13
•
u2 (w) = w2
•
√
u3 (w) = w
14
• Consider a lottery where the consumer faces 50/50 odds of either receiving two dollars or
zero dollars. The expected monetary value of this lottery is $1.
• First notice that u1 (1) = u2 (1) = u3 (1) = 1. That is, they all value one dollar with certainty
equally.
• Now consider the Certainty Equivalent for a lottery L that is a 50/50 gamble over $2 versus
$0. The certainty equivalent is the amount of cash that the consumer would be willing to
accept with certainty in lieu of facing lottery L.
15
1. CE1 (L) = U1−1 (1) = $1.00
2. CE2 (L) = U2−1 (2) = 2.5 = $1.41
3. CE3 (L) = U3−1 (0.71) = 0.712 = $0.51
• Depending on the utility function, a person would pay $1, $1.41, or $0.51 dollars to participate
in this lottery.
• Although the expected monetary value E(V ) of the lottery is $1.00, the three utility functions
value it differently:
1. The person with U1 is risk neutral : CE = $1.00 = E(V alue) ⇒ Risk neutral
2. The person with U2 is risk loving: CE = $1.41 > E(V alue) ⇒ Risk loving
3. The person with U3 is risk averse: CE = $0.50 < E(V alue) ⇒ Risk averse
• What gives rise to these inequalities is the shape of the utility function. Risk preference comes
from the concavity/convexity of the utility function:
N
P
• Expected utility of wealth: E(U (w)) = pi U (wi )
i=1
N
P
• Utility of expected wealth: U (E(w)) = U p i wi
i=1
• Jensen’s inequality:
• So, the core insight of expected utility theory is this: For a risk averse consumer facing an
uncertain set of possible wealth levels, the expected utility of wealth is less than the
utility of expected wealth.
16
• The reason this is so:
– If wealth has diminishing marginal utility (as is true if U (w) = w1/2 ), losses cost more
utility than equivalent monetary gains provide. This can be seen in the concave shape
of the utility function - for lower levels of wealth, the utility function has a steeper slope.
– Consequently, a risk averse consumer is better off receiving a given amount of wealth
with certainty than the same amount of wealth on average but with variance around
this quantity.
6 Conclusions
You may be thinking that we have used a lot of machinery to build a pretty modest conceptual
widget. So far, you would be right. But the tool of Expected Utility Theory will prove quite
powerful in the lectures to follow. We will first use VNM EU Theory to formally model people’s
willingness to pay to defray (avoid/reduce) risk. We will then analyze markets for risk, and glimpse
the potential for insurance markets to generate Pareto-improving trades among economic agents
who each possess identical bundles, preferences, and technologies. We will also analyze implicit
markets for risk, as seen in the case of speed limits and the so-called value of a statistical life
(VSL). Risk and insurance will then prove foundational in our study of imperfect information in
markets, which is the final broad topic of the semester.
17
MIT OpenCourseWare
https://ocw.mit.edu
For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms.