| The expected utility
hypothesis of John von Neumann and Oskar Morgenstern,
while formally identical, has nonetheless a somewhat
different interpretation from Bernoulli’s. However,
the major impact of their effort was that they attempted
to axiomatize this hypothesis in terms of agents' preferences over different ventures with random prospects, i.e. preferences over what can be called lotteries.
Lotteries
We are all familiar with how a lottery works. Mathematically,
it can be described as:
Let x be an "outcome" and let X be a set of outcomes.
Let p be a simple probability measure on X, thus
p = (p(x1), p(x2), ...,
p(xn)) where p(xi) are probabilities
of outcome xi Î X occurring, i.e. p(xi) ³ 0 for all i =
1,
..., n and å i=1np(xi) = 1.
Note that for simple probability measures, there are finite elements x Î X for which p(x) > 0. Define D (X) as the set of simple probability measures
on X. A particular lottery p is a point in D (x).
Of course we can build a compound lottery, whereby a win in the first lottery results in tickets for another lottery. We can reduce compound lotteries into simple lotteries by combining the probabilities of the lotteries so that all we obtain is a single distribution over outcomes.
In the von Neumann-Morgenstern hypothesis, probabilities
are assumed to be "objective" or exogenous
and thus cannot be influenced by the agent. The problem
of an agent under uncertainty is to choose among
lotteries,
and thus find the "best" lottery in D (X).
One of von Neumann and Morgenstern’s major contributions to economics was
to show that if an agent has preferences defined over lotteries, then there is
a utility function U: D (X) ® R that assigns a utility to every lottery
p Î D (X) that represents these preferences.
If lotteries are merely distributions, it might
not seem to make sense that a
person would "prefer" a particular distribution to another on its own. If we
follow Bernoulli’s construction, we get a sense that what people really get
utility from is the outcome or consequence, x Î X. We do not eat "probabilities", after all, we eat apples! Yet what von Neumann and Morgenstern
suggest is that people's preferences are formed over
lotteries and from these preferences over lotteries, combined with objective
probabilities, we can deduce what the underlying preferences on outcomes
might be. Thus, in von Neumann-Morgenstern's theory, unlike Bernoulli's, preferences
over lotteries logically precede preferences
over outcomes.
In other words the ony reason we prefer a lottery over another is due to the implied underlying outcomes, but the preferences are not defined over these outcomes but rather defined over lotteries. In other words, von Neumann and Morgenstern's great insight was to avoid defining preferences over outcomes and capturing everything in terms of preferences over lotteries. The essence of von Neumann and Morgenstern's expected utility hypothesis, then, was to confine themselves to preferences over distributions and then from that, deduce the implied preferences over the underlying outcomes.
With Von Neumann being a mathematician, he naturally structured his formulation by using an axiomatic approach. The four axioms used are comparability, transitivity, independence and certainty equivalence.
Axioms
1. Comparability
An investor can state a preference among all alternative certain outcomes. Thus, if the investor has a choice of outcome A or B, a preference for A to B or B to A can be stated or indifference between them can be expressed. This is also referred to as completeness.
2. Transitivity
If an investor prefers A to B and B to C, then A is preferred to C.
3. Independence
If an investor is indifferent between two prospects X and Y, then they will be indifferent between the following two gambles (where Z is a third prospect):
X with probability P and Z with probability 1-P, and
Y with probability P and Z with probability 1-P.
4. Certainty Equivalent
For every gamble there is a value (called certainty equivalent) such that the investor is indifferent between the gamble and the certainty equivalent. (Everything has a price).
We derive the expected utility theorem more formally with the following steps:
- define and state axioms for preference relation ³ h over simple lotteries, D (X);
- use this preference relation to construct a utility
function on simple lotteries, U: D (X) ® R;
- prove that this utility
function U has an "expected utility" structure,
i.e. there is an underlying utility on outcomes
u: X ® R
that
yields U(p) = å p(x)u(x).
Axioms of Preference
Let ³ h be
a binary
relation over D (X), i.e. ³ h Ì D (X) ´ D (X). Hence, we can write (p, q) Î ³ h,
or p ³ h q to indicate that
lottery p is "preferred to or equivalent to" lottery q. Naturally, Ø (p ³ hq) = p <h q, i.e. if p is not preferred
to or equivalent to q, then we say q is strictly preferred to p. Of
course, p ³ h q and q ³ h p implies p ~h q,
i.e. p is equivalent to q. We now state the four axioms for these preferences:
(A.1) ³ h is
complete, i.e. either p ³ h q or q ³ h q for all p, q Î D (X).
(A.2) ³ h is
transitive, i.e. if p ³ h q
and q ³ h r
then p ³ h r
for all p, q, r Î D (X)
(A.3) Archimedean Axiom:
if p, q, r Î D (X)
such that p >h q >h r,
then there is an a , b Î (0,
1) such that a p
+ (1-a )r >h q and q >h b p
+ (1-b )r.
(A.4) Independence
Axiom: for all p, q, r Î D (X) and any a Î [0,
1], then p ³ h q
if and only if a p
+ (1-a )r ³ h a q + (1-a )r.
The first two axioms (A.1) and (A.2) should be familiar
from conventional theory. Together, (A.1) and (A.2)
are sometimes referred to as the "weak order" axioms.
The Archimedean Axiom (A.3) works like a continuity
axiom on preferences. It effectively states that
given any three lotteries strictly preferred to each
other, p >h q >h r, we
can combine the most and least preferred lottery
(p and r) via an a Î (0,
1) such that the compound of p and r is strictly
preferred to the middling
lottery q and we can combine p and r via a b Î (0,
1) so that the middling lottery q is strictly preferred to the compound of p
and r. Notice that one really needs D (X)
to be a linear, convex structure to have (A.3).
The Independence Axiom (A.4) effectively claims
that the preference between p and q is unaffected
if they are both combined in the same way with
a third lottery r. One can envisage this as a choice
between a pair of two-stage lotteries. In this case, a p
+ (1-a )r is a two stage
lottery which yields either lottery p with probability a and lottery r with probability (1-a ) in the first stage. Using the same interpretation
for a q
+ (1-a )r, then since both mixtures lead to r with
the same probability (1-a )
in the first stage and since one is equally well-off
if this case occurs, then preferences between the
two-stage lotteries ought to depend entirely on one's
preferences between the alternative lotteries in
the second-stage, p and q.
Note that these axioms, as stated, are derived from
N.E. Jensen (1967) and are not exactly the original V-M
axioms (in particular, they did not have an explicit
independence axiom).
Utility Function
We now want to proceed to the next step and derive
the V-M utility function,
U: D (X) ® R to represent preferences over lotteries,
where by representation we mean that for any p, q Î D (X),
p ³ h q if and only if
U(p) ³ U(q). Thus
if lottery p is preferred or equivalent to q, then the utility from lottery p
is
greater than utility from lottery q and vice-versa. Let us then turn to
the main existence theorem:
Theorem: Let D (X) be a convex subset of a linear space.
Let ³ h be
a binary relation on D (X).
Then ³ h satisfies (A.1), (A.2), (A.3)
and (A.4) if and only if there is a real-valued
function U:D (X) ® R such that:
(a) U represents ³ h (i.e. " p, q Î D (X),
p ³ h q Û U(p) ³ U(q))
(b) U is affine (i.e. " p, q Î D (X), U(a p
+ (1-a )q)
= a U(p) + (1-a )U(q) for any a Î (0,
1))
Moreover, if V:D (X) ® R also represents preferences, then there
is an b, c Î R
(where b > 0) such that V = bU +
c, i.e. U is unique up to a positive linear transformation.
Proof:
We divide the proof into the following parts:
- axioms Þ utility
representation and affinity;
- utility representation and affinity Þ axioms; and
- uniqueness of the utility function up to a positive linear transformation.
Part I: (Axioms Þ Representation
and Affinity)
We proceed in three steps: firstly, we prove two
lemmas on preferences; secondly, we prove that the
theorem holds on a closed preference interval; finally,
we extend this result to the entire D (X). So let us begin with the two lemmas:
Lemma: (L.1
- Mixture Monotonicity): For any p, q Î D (X),
and a , b Î (0, 1) where p >h q and a £ b ,
then b p + (1-b )q >h a p + (1-a )q.
Proof: (i) Suppose a = 0. Note that p = b p + (1-b )p and q = b q + (1-b )q obviously. Now, by (A.4), p >h q Þ b p + (1-b )p >h b p + (1-b )q as we have b p on both sides. But, by (A.4) again, b p + (1-b )q >h b q + (1-b )q as we now have (1-b )q in common on both sides. But note that
this implies b p
+ (1-b )q >h q
= a p +
(1-a )q when a = 0, and we are done. (ii) Suppose a > 0. Now, recall from (i) that b p + (1-b )q >h q.
Thus, defining r = b p + (1-b )q, then r >h q. Now,
define g = a /b . Then g r + (1-g )r >h q. But, as r >h q,
then by (A.4), g r
+ (1-g )r >h g r
+ (1-g )q where g r is in common on both sides. Or, by definition
of r, g r
+ (1-g )r >h g (b p + (1-b )q)) + (1-g )q. Then, rearranging, g r + (1-g )r >h g b p
+ (1- b g )q. But, by definition of g , g b = a ,
thus g r + (1-g )r >h a p + (1- a )q. But as r = g r + (1-g )r = b p + (1-b )q by definition, then b p + (1-b )q >h a p + (1- a )q.
Intuitively, if lottery p is preferred to lottery
q, then if we construct two compound lotteries with
different weights, then we prefer the compound lottery
in which lottery p is given the relatively greater
weight.
Lemma: (L.2
- Unique Solvability): If p, q, r Î D (X)
and p ³ h q ³ h r and p >h r,
then there is a unique a * Î [0, 1] such that q ~h a *p + (1-a *)r.
Proof: (i) If p ~h q,
then a * = 1 and we are done. (ii) if r ~h q, then a * = 0, and we are done. (iii) if p >h q >h r, then define the
set Q³ =
{a Î (0,
1) ½ q ³ h a p + (1-a )r}. This set is non-empty because a = 0 is an element of it and it is bounded
above by a £ 1. Thus, there is a supremum (least upper bound) of Q³ . Let a * = sup Q³ . Then we can consider two violating
cases. Case 1: p >h q >h a *p + (1-a *)r. Then, by (A.3),
there is a b Î [0, 1] such that q >h b (a *p
+ (1-a *)r)
+ (1-b )p.
Or, rearranging, q >h [1 - b (1-a *)]p + b (1-a *)r. But, as b (1-a *) < (1-a *), then (1-b (1-a *)) > a *. But then a * is not a supremum of Q³ . A contradiction. Case
2: a *p
+ (1-a *)r >h q.
We can proceed the same way, i.e. by (A.3) we can
find some g Î [0, 1] such that [1 - g (1-a *)]p + g (1-a *)r >h q, which implies that a *
is not a supremum - thus a contradiction. Consequently, it must be
that neither Case 1 or Case 2 can apply, thus a *p + (1-a *)r ~h q.
Finally, by mixture monotonicity (L.1), a * is unique.
Given a lottery q, we can construct a compound lottery
which yields the same utility as q by appropriately
combining any lottery p which is preferred to q with
any lottery r to which q is preferred.
Now, let us return to the main proof. Consider first
the following case: suppose that, for any p, q Î D (X), we have p ~h q (all lotteries
are equivalent) In this case, U is constant, i.e. U(p) = c for all p Î D (X),
which is of course real-valued and affine. Thus,
this trivial case is easily disposed with. But consider
now the following. Suppose s, r Î D (X) where s >h r. Define RS
= {p Î D (X) ½ s ³ h p ³ h r}, which is a closed and convex
subset of D (X)
(by (A.4)). For each p Î RS,
define ¦ (p) as a number such that p ~h ¦ (p)s +
(1-¦ (p))r.
By unique solvability (L.2), such a ¦ (p)
exists and is unique. We now make two claims:
Proposition (Representation): ¦ (.)
represents preferences on RS, i.e. for all p, q Î RS, ¦ (p) ³ ¦ (q)
if and only if ¦ (p)s + (1-¦ (p))r ³ h ¦ (q)s + (1-¦ (q))r.
To prove this, consider that by mixture monotonicity (L.1), s >h r and ¦ (p) ³ ¦ (q)
implies that ¦ (p)s + (1-¦ (p))r >h ¦ (q)s
+ (1-¦ (q))r. But, by the definition of ¦ (p) and ¦ (q), (i.e. p ~h ¦ (p)s + (1-¦ (p))r and q ~h ¦ (q)s + (1-¦ (q))r), we can note immediately by transitivity
(A.2) that this implies that p >h q.
The same argument works in reverse. Thus, ¦ (p) ³ ¦ (q) Û p >h q, i.e. ¦ (.) represents preferences ³ h on RS, and we are done. Q.E.D.
Proposition (Affinity): ¦ (.)
is affine for all p, q Î RS, i.e. ¦ (a p
+ (1-a )q)
= a ¦ (p) + (1-a )¦ (q).
To prove this, consider any p, q Î RS
and define p¢ = a p + (1-a )q. As RS is convex,
then p¢ Î RS for any a Î (0,
1). Thus, by unique solvability (L.2) there
is a real number ¦ (p¢ ) such that p¢ ~h ¦ (p¢ )s + (1-¦ (p¢ ))r. But as p¢ = a p + (1-a )q and p ~h ¦ (p)s + (1-¦ (p))r by (L.2), then p¢ ~h a [¦ (p)s
+ (1-¦ (p))r]
+ (1-a )q by the independence axiom (A.4). Doing
the same for q ~h ¦ (q)s + ((1-¦ (q))r,
then we obtain p¢ ~h a [¦ (p)s
+ (1-¦ (p))r]
+ (1-a )[¦ (q)s + ((1-¦ (q))r]. Rearranging a bit, we obtain that
p¢ ~h [a ¦ (p)
+ (1-a )¦ (q)]s + [a (1-¦ (p)) + (1-a )(1-¦ (q))]r, thus p¢ is equivalent to another convex combination
of s and r. But, by unique solvability (L.2), there
is only one a *
such that p¢ ~h a *s + (1-a *)r. Thus, it must
be that a *
= ¦ (p¢ ) = [a ¦ (p)
+ (1-a )¦ (q)], or, by the definition of p¢ , ¦ (a p
+ (1-a )q)
= a ¦ (p)
+ (1-a )¦ (q). This is the definition of affinity.
Let us now enter on our third stage and extend the
representation and affinity results from RS to the
entire set. To do so, we first need to prove the
following claim:
Proposition: (Order-Preservation):
If ¦ represents ³ h and is affine, then g = a +
b¦ where
b > 0 also (i) represents ³ h and
(ii) is affine.
The proof is simple. (i)
For any p, q Î D (X),
then p ³ h q Þ ¦ (p) ³ ¦ (q) by representation of ¦ . Thus, if b > 0,
then this implies a + b¦ (p) ³ a + b¦ (q), thus g(p) ³ g(q)
by definition. (ii) As ¦ is affine, then ¦ (a p
+ (1-a )q)
= a ¦ (p) + (1-a )¦ (q).
Now by definition, g(a p
+ (1-a )q)
= a + b¦ (a p
+ (1-a )q))
= a + b[a ¦ (p)
+ (1-a )¦ (q)] = a a + (1-a )a + ba ¦ (p)
+ b(1-a )¦ (q) = a [a + b¦ (p)] + (1-a )[a + b¦ (q)] = a g(p) + (1-a )g(q). Q.E.D..
Let us return to the extension of RS. By the definition
of ¦ , s ~h ¦ (s)s
+ (1-¦ (s))r and r ~h ¦ (r)s + (1-¦ (r))r, thus ¦ (s) = 1 and ¦ (r) = 0. Now, Define RS1 = {p Î D (X) ½ s1 ³ h p ³ h r1} where s1 >h s
and r >h r1, so obviously
RS Ì RS1.
Now, let us define ¦ 1 over
RS1 a manner analogous to before, so that
for any p Î RS1, then p ~h ¦ 1(p)s1 + (1-¦ 1(p))r1 and ¦ 1 is affine. Let us now find
a1 and a b1 > 0 and thus
a function g1 = a1 + b1¦ 1 such
that g1(s) = a1 + b1¦ 1 (s) = 1 and g1(r) = a1 +
b1¦ 1(r)
= 0. If we think of D (X)
as the real line and preferences increasing along
it, then ¦ 1 and the adjustment to g1 can
be represented as in Figure 2.
Now, define RS2 = {p Î D (X)
| s2 ³ h p ³ h r2} where s2 >h s
and r >h r2, so we again
obtain RS Ì RS2.
Defining ¦ 2 the
same way as before on RS2, we can thus
find now find a2 and b2 > 0
such that g2(s) = a2 + b2¦ 2(s)
= 1 and g2(r) = a2 + b2¦ 2(r) = 0. Thus, g1(r) =
g2(r) = 0 and g1(s) = g2(s)
= 1
We now show that for any p Î RS1 Ç RS2 Þ g1(p)
= g2(p). As p is in the intersection,
then either p is inside, above or below RS. In other
words, one of the following three cases will be true:
(i) s ³ h p ³ h r: Þ by unique solvability (L.2), $ a such
that p ~h a s
+ (1-a )r
(ii) p >h s >h r: Þ by
unique solvability (L.2), $ a such that s ~h a p + (1-a )r
(iii) s >h r >h p: Þ by
unique solvability (L.2), $ a such that r ~h a s + (1-a )p
Consider now the consequences of the different cases:
Case (i) implies that g1(p)
= a g1(s)
+ (1-a )g1(r) = a by construction of g1. But it
is also true that g2(p)
= a g2(s) + (1-a )g2(r) = a again by construction, thus g1(p)
= g2(p) = a .
Case (ii) implies that 1 = g1(s) = a g1(p)
+ (1-a )g1(r)
= a g1(p), so g1(p) =
1/a . But
similarly, 1 = g2(s) = a g2(p) + (1-a )g2(r) = a g2(p), so g2(p) =
1/a . Thus,
once again g1(p) = g2(p) = 1/a . Finally, Case (iii) implies that 0 = g1(r)
= a g1(s)
+ (1-a )g1(p)
= a + (1-a )g1(p), so g1(p) = a /(a -1). Similarly, 0 = g2(r) = a g2(s)
+ (1-a )g2(p)
= a + (1-a )g2(p), so g2(p) = a /(a -1). Thus, again g1(p) = g2(p) = a /(a -1).
Thus, for every p Î RS1 Ç RS2, g1(p) = g2(p). Consider now an increasing
sequence RS Ì RS1 Ì RS2 Ì RS3 Ì ...Ì D (X).
At each step, we can define gi that
represents preferences over RSi,
but gi(p) = gi-1(p) = gi-2(p) = ... for all
p Î RSi-1.
Thus, let us define this common value gi(p) = gi-1(p)
= U(p). We can thereby construct a U that represents
preferences over the entire set D (X).
Thus the first important part of the proof, the derivation
of a utility function U:D (X) ® R from axioms (A.1)-(A.4) is finished.
Part II: (Representation and Affinity Þ Axioms).
If U: D (X) ® R is affine and represents preferences,
then we want so show that the axioms (A.1)-(A.4)
hold. Completeness is clear enough: as U is defined
over D (X),
then for any pair p, q Î D (X) then either U(p) ³ U(q) or U(p) £ U(q)
or both. By representation, this implies (A.1). Similarly,
for any triple, p, q, r Î D (X),
by representation, U(p) ³ U(q) and U(q) ³ U(r) implies p ³ h q and q ³ h r. It then follows from the
properties of the real number line that U(p) ³ U(r), thus p ³ h r, so transitivity (A.2) is
done. The Archimedean axiom (A.3) is just as simple.
By representation, U(p) ³ U(q) ³ U(r) implies p ³ h q ³ h r. We know by the properties
of the real line (called the Archimedian axiom
in fact), there is an a Î (0, 1) such that a U(p) + (1-a )U(r) ³ U(q). As, by affinity, a U(p) + (1-a )U(q) = U(a p + (1-a )q), then U(a p + (1-a )r) ³ U(q) so, by representation, a p + (1-a )r ³ h q. The same reasoning applies
when choosing a b so b U(p) + (1-b )U(r) £ U(q), etc., thus we are done. Finally, for
the independence axiom (A.4), note that if U(p) ³ U(q), then p ³ h q. Notice also that this implies
that for a Î (0, 1), that a U(p) ³ a U(q).
Thus, adding (1-a )U(r)
from both sides a U(p)
+ (1-a )U(r) ³ a U(q)
+ (1-a )U(r).
By affinity, as U(a p
+ (1-a )r) = a U(p) + (1-a )U(r) and U(a q + (1-a )r) = a U(q) + (1-a )U(r), thus U(a p + (1-a )r) ³ U(a q + (1-a )r) so, by representation a p + (1-a )r ³ h a q + (1-a )r. The reverse also applies by the same
reasoning. Thus, the Archimedean axiom is finished.
Part III: (Uniqueness)
We now wish to turn to the "moreover" remark and
prove that if both U: D (X) ® R and V: D (X) ® R represent preferences, then there is a
c and b > 0 such that V = bU +
c. Let us get rid of the trivial case first: if for
all p, q Î D (X),
p ~h q, then U(p)
= k and V(p) = k¢ , thus V(p) = U(p) - (k-k¢ ), so c = (k - k¢ ) and b = 1. Now, suppose there is s, p,
r Î D (X) such that s >h p >h r.
Then define the following: HU(p) = [U(p) - U(r)]/[U(s) - U(r)] and HV(p)
= [V(p) - V(r)]/[V(s) - V(r)]. By unique solvability
(L.2), there is an a Î (0, 1) such that p ~h a s + (1-a )r. Thus HU(p) = [U(a s + (1-a )r) - U(r)]/[U(s) - U(r)] or, by affinity:
HU(p)
= [a U(s) + (1-a )U(r) - U(r)]/[U(s) - U(r)] = a
Similarly, as HV(p)
= [V(a s + (1-a )r) - V(r)]/[V(s) - V(r)], then HV(p)
= a . This
implies, then, that HU(p)
= HV(p). Thus,
[U(p)
- U(r)]/[U(s) - U(r)] = [V(p) - V(r)]/[V(s) - V(r)]
cross-multiplying:
[U(p)
- U(r)][V(s) - V(r)] = [U(s) - U(r)][V(p)
- V(r)]
or:
U(p)[V(s)
- V(r)] - U(r)[V(s) - V(r)] = [U(s) - U(r)]V(p) -
[U(s) - U(r)]V(r)
or simply:
V(p)
= U(p)[V(s) - V(r)]/[U(s) - U(r)] - U(r)[V(s) - V(r)]/
[U(s) - U(r)] + V(r)
thus letting b = [V(s)
- V(r)]/[U(s) - U(r)] and c = - U(r)[V(s) - V(r)]/
[U(s) - U(r)] + V(r), then:
V(p)
= bU(p) + c
which is the form we wanted.
The Expected Utility Representation
We have now obtained the utility function U:D (X) ® R on the basis of the four axioms set forth
earlier. However, we have not finished in proving
the expected utility hypothesis, namely, that a utility
function U:D (X) ® R has a representation:
U(p)
= å xÎ Supp(p) p(x)u(x)
where u: X ® R
is a elementary utility function on the underlying outcomes X.
Note that as D (X) is the set of simple probability distributions
on X, then if p Î D (X), then p has a finite support denoted
Supp(p) Ì X. D (X), of course, is a convex set. Finally,
we should note that by convexity, for any p, q Î D (X), a p
+ (1-a )q Î D (X)
for any a Î (0, 1) and that, if p and q are simple probability
distributions, then (a p
+ (1-a )q)(x)
= a p(x) + (1-a )q(x) for any x Î X.
We now state the expected utility representation
as a corollary to the earlier V-M theorem:
Corollary: (Expected
Utility Representation) Let D (X)
be the set of all simple probability distributions
on X. Let ³ h be a binary relation on D (X). Then ³ h satisfies (A.1)-(A.4) if and
only if there is a function u: X ® R
such that for every p, q Î D (X):
p ³ h q if and only if å xÎ Supp(p) p(x)u(x) ³ å xÎ Supp(q) q(x)u(x).
Moreover, v: X ® R represents ³ h in the above sense if and only
if there exist c and b > 0 such that v = bu +
c.
Proof: From the V-M theorem, there is a U: D (X) ® R which represents preferences ³ h on D (X) and is affine. Now, define the function d x:
X ® {0, 1} as d x(y) = 1 if y = x and d x(y) = 0 otherwise. This implies
that for every x Î X, d x is a degenerate distribution,
thus d x Î D (X).
Let U(d x)
= u(x). Now consider a distribution p = [p(x), p(y)].
Obviously, we can write this out as a convex combination
of degenerate distributions d x and d y, i.e. p = p(x)d x +
p(y)d y.
Thus, U(p) = U(p(x)d x +
p(y)d y) = p(x)U(d x) + p(y)U(d y) = p(x)u(x) + p(y)u(y) by affinity
and our definition of u(x) and u(y). Thus, more generally,
any distribution p with finite support can be written
out as a convex combination of degenerate distributions,
p = å xÎ Supp(p) p(x)d x,
and thus we obtain U(p) = U(å xÎ Supp(p)p(x)d (x))
= å xÎ Supp(p) p(x)u(x) which is the expected
utility representation of U(p). Thus as p ³ h q iff U(p) ³ U(q) by the von Neumann-Morgenstern theorem,
then equivalently, p ³ h q iff å xÎ Supp(p) p(x)u(x) ³ å xÎ Supp(q) q(x)u(x).
(i) Cardinality
Conventional, non-stochastic utility
functions u: X ® R are generally
assumed to be ordinal, i.e. they are order-preserving indexes of
preferences. By this we mean that the numerical magnitudes
we give to u are irrelevant, as long as they preserve
preference orderings. However, when facing the V-M
expected utility decomposition U(p) = å x p(x)u(x), it is common to fall
into the misleading conclusion that utility is cardinal,
i.e. that utility here is a measure of preferences.
Of course, the elementary utility function
u: X ® R within U(p) is cardinal.
Why this is so is clear enough: if expected utility
is obtained by adding up probabilities multiplied
by elementary utilities, then the precise measure
of the elementary utilities matters very much indeed.
More explicitly, if u represents preferences over
outcomes, then it is unique up to any linear transformation,
i.e. if v represents preferences over outcomes, then
there is a b> 0 and a such that v = bu +
c. This can be regarded as the "definition" of cardinality.
What is important to realize is that even though
the elementary utility function u is a cardinal utility
measure on outcomes, the utility function over lotteries
U is not a cardinal utility function. This
is because the elementary utilities on outcomes
are not primitives, rather the preferences on lotteries are
the primitives. The utility function over lotteries
U thus logically precedes the elementary utility
function: the latter is derived from the former.
Suppose we have an agent whose ordinal utility function
is known. Indeed, suppose that it’s our river-crossing
fugitive. Let’s assign him the following ordinal
utility function:
Escape 4
Death by shooting 3
Death by rockfall 2
Death by snakebite 1
Now, we know that his preference
for escape over any form of death is likely
to be stronger than his preference for, say, shooting
over snakebite. This should be reflected in his choice
behavior in the following way. In a situation such
as the river-crossing game, he should be willing
to run greater risks to increase the relative probability
of escape over shooting than he is to increase the
relative probability of shooting over snakebite.
This bit of logic is the crucial insight behind the
V-M solution to the cardinalization problem.
Begin by asking our agent to pick, from the available
set of outcomes, a best one and a worst one. ‘Best’ and ‘worst’ are
defined in terms of rational choice: a rational agent
always chooses so as to maximize the probability
of the best outcome -- call this W --
and to minimize the probability of the worst outcome
-- call this L. Now consider prizes
intermediate between W and L.
We find, for a set of outcomes containing such prizes,
a lottery over them such that our agent is indifferent
between that lottery and a lottery including only W and L.
In our example, this would be a lottery having shooting
and rockfall as its possible outcomes. Call this lottery T . We define a utility function q = u(T)
such that if q is the expected prize in T ,
the agent is indifferent between winning T and
winning a lottery in which W occurs
with probability u(T) and L occurs
with probability 1 u(
T).
We now construct a compound lottery T*
over the
outcome set {W, L} such that the agent is indifferent
between T and T*. A compound lottery is one
in which the prize in the lottery is another lottery. This makes sense because,
after all, it is still W and L that are at
stake for our agent in both cases; so we can then analyze T*
into a simple lottery over W and L. Call this
lottery r. It follows from
transitivity that T is equivalent to r. The
rational agent will now choose the action that maximizes the probability of
winning W. The mapping from the set of outcomes to u(r)
is a von Neumann-Morgenstern utility
function (VNMuf).
What exactly have we done here? We’ve simply given
our agent choices over lotteries, instead of over
prizes directly, and observed how much extra risk
he’s willing to run to increase the chances of winning
escape over snakebite relative to getting shot or
clobbered with a rock. A VNMuf yields a cardinal, rather than an ordinal,
measure of utility. Our choice of endpoint-values, W and L,
is arbitrary, as before; but once these are fixed
the values of the intermediate points are determined.
Therefore, the VNMuf does measure the relative preference intensities
of a single agent. However, since our assignment
of utility values to W and L is arbitrary,
we can’t use VNMufs to
compare the cardinal preferences of one agent with
those of another. Furthermore, since we are using
a risk-metric as our measuring instrument,
the construction of the new utility function depends
on assuming that our agent’s attitude to risk
itself stays constant from one comparison of
lotteries to another.
Consequently, the utility function U:D (X) ® R itself is an ordinal utility function
since any increasing transformation of U will
preserve the ordering on the lotteries. In
other words, if U represents preferences on lotteries,
then so does V = ¦ (U)
where ¦ is
an increasing monotonic transformation of U, e.g.
if U(p) is a representation
of the preference ordering on lotteries D (X), then V(p) = [U(p)]2 = [å xp(x)u(x)]2 is also a representation
of the preference ordering on lotteries. However,
there is one sense in which U still carries an element
of cardinality. Namely, given U(p) = å x p(x)u(x),
then v: X ® R
will generate an ordinally equivalent
V(p) = å x p(x)v(x) if and only if v = bu +
c for b > 0 - and thus V = bU + c. Thus, in V-M theory, we have a "cardinal utility
which is ordinal".
|