Home
About Crown Hill
Exchange Traded
Products
Institutional Investors
Investment Process
Advanced ALM
 
 
   
The expected utility hypothesis of John von Neumann and Oskar Morgenstern, while formally identical, has nonetheless a somewhat different interpretation from Bernoulli’s. However, the major impact of their effort was that they attempted to axiomatize this hypothesis in terms of agents' preferences over different ventures with random prospects, i.e. preferences over what can be called lotteries.

Lotteries

We are all familiar with how a lottery works. Mathematically, it can be described as:

Let x be an "outcome" and let X be a set of outcomes. Let p be a simple probability measure on X, thus p = (p(x1), p(x2), ..., p(xn)) where p(xi) are probabilities of outcome xi Î X occurring, i.e. p(xi) ³ 0 for all i = 1, ..., n and å i=1np(xi) = 1. Note that for simple probability measures, there are finite elements x Î X for which p(x) > 0.  Define D (X) as the set of simple probability measures on X. A particular lottery p is a point in D (x).

Of course we can build a compound lottery, whereby a win in the first lottery results in tickets for another lottery. We can reduce compound lotteries into simple lotteries by combining the probabilities of the lotteries so that all we obtain is a single distribution over outcomes.

In the von Neumann-Morgenstern hypothesis, probabilities are assumed to be "objective" or exogenous and thus cannot be influenced by the agent. The problem of an agent under uncertainty is to choose among lotteries, and thus find the "best" lottery in D (X). One of von Neumann and Morgenstern’s major contributions to economics was to show that if an agent has preferences defined over lotteries, then there is a utility function U: D (X) ® R that assigns a utility to every lottery p Î D (X) that represents these preferences.

If lotteries are merely distributions, it might not seem to make sense that a person would "prefer" a particular distribution to another on its own. If we follow Bernoulli’s construction, we get a sense that what people really get utility from is the outcome or consequence, x Î X. We do not eat "probabilities", after all, we eat apples! Yet what von Neumann and Morgenstern suggest is that people's preferences are formed over lotteries and from these preferences over lotteries, combined with objective probabilities, we can deduce what the underlying preferences on outcomes might be. Thus, in von Neumann-Morgenstern's theory, unlike Bernoulli's, preferences over lotteries logically precede preferences over outcomes.

In other words the ony reason we prefer a lottery over another is due to the implied underlying outcomes, but the preferences are not defined over these outcomes but rather defined over lotteries. In other words, von Neumann and Morgenstern's great insight was to avoid defining preferences over outcomes and capturing everything in terms of preferences over lotteries. The essence of von Neumann and Morgenstern's expected utility hypothesis, then, was to confine themselves to preferences over distributions and then from that, deduce the implied preferences over the underlying outcomes.

With Von Neumann being a mathematician, he naturally structured his formulation by using an axiomatic approach. The four axioms used are comparability, transitivity, independence and certainty equivalence.

Axioms

 1. Comparability

An investor can state a preference among all alternative certain outcomes. Thus, if the investor has a choice of outcome A or B, a preference for A to B or B to A can be stated or indifference between them can be expressed. This is also referred to as completeness.

2. Transitivity

If an investor prefers A to B and B to C, then A is preferred to C.

3. Independence

If an investor is indifferent between two prospects X and Y, then they will be indifferent between the following two gambles (where Z is a third prospect):

X with probability P and Z with probability 1-P, and

Y with probability P and Z with probability 1-P.

4. Certainty Equivalent

For every gamble there is a value (called certainty equivalent) such that the investor is indifferent between the gamble and the certainty equivalent. (Everything has a price).

We derive the expected utility theorem more formally with the following steps:
  • define and state axioms for preference relation ³ h over simple lotteries, D (X);
  • use this preference relation to construct a utility function on simple lotteries, U: D (X) ® R;
  • prove that this utility function U has an "expected utility" structure, i.e. there is an underlying utility on outcomes u: X ® R that yields U(p) = å p(x)u(x).

Axioms of Preference

Let ³ h be a binary relation over D (X), i.e. ³ h Ì D (X) ´ D (X). Hence, we can write (p, q) Î ³ h, or p ³ h q to indicate that lottery p is "preferred to or equivalent to" lottery q. Naturally, Ø (p ³ hq) = p <h q, i.e. if p is not preferred to or equivalent to q, then we say q is strictly preferred to p. Of course, p ³ h q and q ³ h p implies p ~h q, i.e. p is equivalent to q. We now state the four axioms for these preferences:

(A.1) ³ h is complete, i.e. either p ³ h q or q ³ h q for all p, q Î D (X).

(A.2) ³ h is transitive, i.e. if p ³ h q and q ³ h r then p ³ h r for all p, q, r Î D (X)

(A.3) Archimedean Axiom: if p, q, r Î D (X) such that p >h q >h r, then there is an a , b Î (0, 1) such that a p + (1-a )r >h q and q >h b p + (1-b )r.

(A.4) Independence Axiom: for all p, q, r Î D (X) and any a Î [0, 1], then p ³ h q if and only if a p + (1-a )r ³ h a q + (1-a )r.

The first two axioms (A.1) and (A.2) should be familiar from conventional theory. Together, (A.1) and (A.2) are sometimes referred to as the "weak order" axioms. The Archimedean Axiom (A.3) works like a continuity axiom on preferences. It effectively states that given any three lotteries strictly preferred to each other, p >h q >h r, we can combine the most and least preferred lottery (p and r) via an a Î (0, 1) such that the compound of p and r is strictly preferred to the middling lottery q and we can combine p and r via a b Î (0, 1) so that the middling lottery q is strictly preferred to the compound of p and r. Notice that one really needs D (X) to be a linear, convex structure to have (A.3).

The Independence Axiom (A.4) effectively claims that the preference between p and q is unaffected if they are both combined in the same way with a third lottery r. One can envisage this as a choice between a pair of two-stage lotteries. In this case, a p + (1-a )r is a two stage lottery which yields either lottery p with probability a and lottery r with probability (1-a ) in the first stage. Using the same interpretation for a q + (1-a )r, then since both mixtures lead to r with the same probability (1-a ) in the first stage and since one is equally well-off if this case occurs, then preferences between the two-stage lotteries ought to depend entirely on one's preferences between the alternative lotteries in the second-stage, p and q.

Note that these axioms, as stated, are derived from N.E. Jensen (1967) and are not exactly the original V-M axioms (in particular, they did not have an explicit independence axiom).

 Utility Function

We now want to proceed to the next step and derive the V-M utility function, U: D (X) ® R to represent preferences over lotteries, where by representation we mean that for any p, q Î D (X), p ³ h q if and only if U(p) ³ U(q). Thus if lottery p is preferred or equivalent to q, then the utility from lottery p is greater than utility from lottery q and vice-versa. Let us then turn to the main existence theorem:

Theorem: Let D (X) be a convex subset of a linear space. Let ³ h be a binary relation on D (X). Then ³ h satisfies (A.1), (A.2), (A.3) and (A.4) if and only if there is a real-valued function U:D (X) ® R such that:

(a) U represents ³ h (i.e. " p, q Î D (X), p ³ h q Û U(p) ³ U(q))

(b) U is affine (i.e. " p, q Î D (X), U(a p + (1-a )q) = a U(p) + (1-a )U(q) for any a Î (0, 1))

Moreover, if V:D (X) ® R also represents preferences, then there is an b, c Î R (where b > 0) such that V = bU + c, i.e. U is unique up to a positive linear transformation.

Proof: 

We divide the proof into the following parts:

  • axioms Þ utility representation and affinity;
  • utility representation and affinity Þ axioms; and
  • uniqueness of the utility function up to a positive linear transformation.

Part I: (Axioms Þ Representation and Affinity)

We proceed in three steps: firstly, we prove two lemmas on preferences; secondly, we prove that the theorem holds on a closed preference interval; finally, we extend this result to the entire D (X). So let us begin with the two lemmas:

Lemma: (L.1 - Mixture Monotonicity): For any p, q Î D (X), and a , b Î (0, 1) where p >h q and a £ b , then b p + (1-b )q >h a p + (1-a )q.

Proof: (i) Suppose a = 0. Note that p = b p + (1-b )p and q = b q + (1-b )q obviously. Now, by (A.4), p >h q Þ b p + (1-b )p >h b p + (1-b )q as we have b p on both sides. But, by (A.4) again, b p + (1-b )q >h b q + (1-b )q as we now have (1-b )q in common on both sides. But note that this implies b p + (1-b )q >h q = a p + (1-a )q when a = 0, and we are done. (ii) Suppose a > 0. Now, recall from (i) that b p + (1-b )q >h q. Thus, defining r = b p + (1-b )q, then r >h q. Now, define g = a /b . Then g r + (1-g )r >h q. But, as r >h q, then by (A.4), g r + (1-g )r >h g r + (1-g )q where g r is in common on both sides. Or, by definition of r, g r + (1-g )r >h g (b p + (1-b )q)) + (1-g )q. Then, rearranging, g r + (1-g )r >h g b p + (1- b g )q. But, by definition of g , g b = a , thus g r + (1-g )r >h a p + (1- a )q. But as r = g r + (1-g )r = b p + (1-b )q by definition, then b p + (1-b )q >h a p + (1- a )q.

Intuitively, if lottery p is preferred to lottery q, then if we construct two compound lotteries with different weights, then we prefer the compound lottery in which lottery p is given the relatively greater weight.

Lemma: (L.2 - Unique Solvability): If p, q, r Î D (X) and p ³ h q ³ h r and p >h r, then there is a unique a * Î [0, 1] such that q ~h a *p + (1-a *)r.

Proof: (i) If p ~h q, then a * = 1 and we are done. (ii) if r ~h q, then a * = 0, and we are done. (iii) if p >h q >h r, then define the set Q³ = {a Î (0, 1) ½ q ³ h a p + (1-a )r}. This set is non-empty because a = 0 is an element of it and it is bounded above by a £ 1. Thus, there is a supremum (least upper bound) of Q³ . Let a * = sup Q³ . Then we can consider two violating cases. Case 1: p >h q >h a *p + (1-a *)r. Then, by (A.3), there is a b Î [0, 1] such that q >h b (a *p + (1-a *)r) + (1-b )p. Or, rearranging, q >h [1 - b (1-a *)]p + b (1-a *)r. But, as b (1-a *) < (1-a *), then (1-b (1-a *)) > a *. But then a * is not a supremum of Q³ . A contradiction. Case 2: a *p + (1-a *)r >h q. We can proceed the same way, i.e. by (A.3) we can find some g Î [0, 1] such that [1 - g (1-a *)]p + g (1-a *)r >h q, which implies that a * is not a supremum - thus a contradiction. Consequently, it must be that neither Case 1 or Case 2 can apply, thus a *p + (1-a *)r ~h q. Finally, by mixture monotonicity (L.1), a * is unique.

Given a lottery q, we can construct a compound lottery which yields the same utility as q by appropriately combining any lottery p which is preferred to q with any lottery r to which q is preferred.

Now, let us return to the main proof. Consider first the following case: suppose that, for any p, q Î D (X), we have p ~h q (all lotteries are equivalent) In this case, U is constant, i.e. U(p) = c for all p Î D (X), which is of course real-valued and affine. Thus, this trivial case is easily disposed with. But consider now the following. Suppose s, r Î D (X) where s >h r. Define RS = {p Î D (X) ½ s ³ h p ³ h r}, which is a closed and convex subset of D (X) (by (A.4)). For each p Î RS, define ¦ (p) as a number such that p ~h ¦ (p)s + (1-¦ (p))r. By unique solvability (L.2), such a ¦ (p) exists and is unique. We now make two claims:

Proposition (Representation): ¦ (.) represents preferences on RS, i.e. for all p, q Î RS, ¦ (p) ³ ¦ (q) if and only if ¦ (p)s + (1-¦ (p))r ³ h ¦ (q)s + (1-¦ (q))r.

To prove this, consider that by mixture monotonicity (L.1), s >h r and ¦ (p) ³ ¦ (q) implies that ¦ (p)s + (1-¦ (p))r >h ¦ (q)s + (1-¦ (q))r. But, by the definition of ¦ (p) and ¦ (q), (i.e. p ~h ¦ (p)s + (1-¦ (p))r and q ~h ¦ (q)s + (1-¦ (q))r), we can note immediately by transitivity (A.2) that this implies that p >h q. The same argument works in reverse. Thus, ¦ (p) ³ ¦ (q) Û p >h q, i.e. ¦ (.) represents preferences ³ h on RS, and we are done. Q.E.D.

Proposition (Affinity): ¦ (.) is affine for all p, q Î RS, i.e. ¦ (a p + (1-a )q) = a ¦ (p) + (1-a )¦ (q).

To prove this, consider any p, q Î RS and define p¢ = a p + (1-a )q. As RS is convex, then p¢ Î RS for any a Î (0, 1). Thus, by unique solvability (L.2) there is a real number ¦ (p¢ ) such that p¢ ~h ¦ (p¢ )s + (1-¦ (p¢ ))r. But as p¢ = a p + (1-a )q and p ~h ¦ (p)s + (1-¦ (p))r by (L.2), then p¢ ~h a [¦ (p)s + (1-¦ (p))r] + (1-a )q by the independence axiom (A.4). Doing the same for q ~h ¦ (q)s + ((1-¦ (q))r, then we obtain p¢ ~h a [¦ (p)s + (1-¦ (p))r] + (1-a )[¦ (q)s + ((1-¦ (q))r]. Rearranging a bit, we obtain that p¢ ~h [a ¦ (p) + (1-a )¦ (q)]s + [a (1-¦ (p)) + (1-a )(1-¦ (q))]r, thus p¢ is equivalent to another convex combination of s and r. But, by unique solvability (L.2), there is only one a * such that p¢ ~h a *s + (1-a *)r. Thus, it must be that a * = ¦ (p¢ ) = [a ¦ (p) + (1-a )¦ (q)], or, by the definition of p¢ , ¦ (a p + (1-a )q) = a ¦ (p) + (1-a )¦ (q). This is the definition of affinity.

Let us now enter on our third stage and extend the representation and affinity results from RS to the entire set. To do so, we first need to prove the following claim:

Proposition: (Order-Preservation): If ¦ represents ³ h and is affine, then g = a + b¦ where b > 0 also (i) represents ³ h and (ii) is affine.

The proof is simple. (i) For any p, q Î D (X), then p ³ h q Þ ¦ (p) ³ ¦ (q) by representation of ¦ . Thus, if b > 0, then this implies a + b¦ (p) ³ a + b¦ (q), thus g(p) ³ g(q) by definition. (ii) As ¦ is affine, then ¦ (a p + (1-a )q) = a ¦ (p) + (1-a )¦ (q). Now by definition, g(a p + (1-a )q) = a + b¦ (a p + (1-a )q)) = a + b[a ¦ (p) + (1-a )¦ (q)] = a a + (1-a )a + ba ¦ (p) + b(1-a )¦ (q) = a [a + b¦ (p)] + (1-a )[a + b¦ (q)] = a g(p) + (1-a )g(q). Q.E.D..

Let us return to the extension of RS. By the definition of ¦ , s ~h ¦ (s)s + (1-¦ (s))r and r ~h ¦ (r)s + (1-¦ (r))r, thus ¦ (s) = 1 and ¦ (r) = 0. Now, Define RS1 = {p Î D (X) ½ s1 ³ h p ³ h r1} where s1 >h s and r >h r1, so obviously RS Ì RS1. Now, let us define ¦ 1 over RS1 a manner analogous to before, so that for any p Î RS1, then p ~h ¦ 1(p)s1 + (1-¦ 1(p))r1 and ¦ 1 is affine. Let us now find a1 and a b1 > 0 and thus a function g1 = a1 + b1¦ 1 such that g1(s) = a1 + b1¦ 1 (s) = 1 and g1(r) = a1 + b1¦ 1(r) = 0. If we think of D (X) as the real line and preferences increasing along it, then ¦ 1 and the adjustment to g1 can be represented as in Figure 2.

Now, define RS2 = {p Î D (X) | s2 ³ h p ³ h r2} where s2 >h s and r >h r2, so we again obtain RS Ì RS2. Defining ¦ 2 the same way as before on RS2, we can thus find now find a2 and b2 > 0 such that g2(s) = a2 + b2¦ 2(s) = 1 and g2(r) = a2 + b2¦ 2(r) = 0. Thus, g1(r) = g2(r) = 0 and g1(s) = g2(s) = 1

We now show that for any p Î RS1 Ç RS2 Þ g1(p) = g2(p). As p is in the intersection, then either p is inside, above or below RS. In other words, one of the following three cases will be true:

(i) s ³ h p ³ h r: Þ by unique solvability (L.2), $ a such that p ~h a s + (1-a )r

(ii) p >h s >h r: Þ by unique solvability (L.2), $ a such that s ~h a p + (1-a )r

(iii) s >h r >h p: Þ by unique solvability (L.2), $ a such that r ~h a s + (1-a )p

Consider now the consequences of the different cases: Case (i) implies that g1(p) = a g1(s) + (1-a )g1(r) = a by construction of g1. But it is also true that g2(p) = a g2(s) + (1-a )g2(r) = a again by construction, thus g1(p) = g2(p) = a . Case (ii) implies that 1 = g1(s) = a g1(p) + (1-a )g1(r) = a g1(p), so g1(p) = 1/a . But similarly, 1 = g2(s) = a g2(p) + (1-a )g2(r) = a g2(p), so g2(p) = 1/a . Thus, once again g1(p) = g2(p) = 1/a . Finally, Case (iii) implies that 0 = g1(r) = a g1(s) + (1-a )g1(p) = a + (1-a )g1(p), so g1(p) = a /(a -1). Similarly, 0 = g2(r) = a g2(s) + (1-a )g2(p) = a + (1-a )g2(p), so g2(p) = a /(a -1). Thus, again g1(p) = g2(p) = a /(a -1).

Thus, for every p Î RS1 Ç RS2, g1(p) = g2(p). Consider now an increasing sequence RS Ì RS1 Ì RS2 Ì RS3 Ì ...Ì D (X). At each step, we can define gi that represents preferences over RSi, but gi(p) = gi-1(p) = gi-2(p) = ... for all p Î RSi-1. Thus, let us define this common value gi(p) = gi-1(p) = U(p). We can thereby construct a U that represents preferences over the entire set D (X). Thus the first important part of the proof, the derivation of a utility function U:D (X) ® R from axioms (A.1)-(A.4) is finished.

 

Part II: (Representation and Affinity Þ Axioms).

If U: D (X) ® R is affine and represents preferences, then we want so show that the axioms (A.1)-(A.4) hold. Completeness is clear enough: as U is defined over D (X), then for any pair p, q Î D (X) then either U(p) ³ U(q) or U(p) £ U(q) or both. By representation, this implies (A.1). Similarly, for any triple, p, q, r Î D (X), by representation, U(p) ³ U(q) and U(q) ³ U(r) implies p ³ h q and q ³ h r. It then follows from the properties of the real number line that U(p) ³ U(r), thus p ³ h r, so transitivity (A.2) is done. The Archimedean axiom (A.3) is just as simple. By representation, U(p) ³ U(q) ³ U(r) implies p ³ h q ³ h r. We know by the properties of the real line (called the Archimedian axiom in fact), there is an a Î (0, 1) such that a U(p) + (1-a )U(r) ³ U(q). As, by affinity, a U(p) + (1-a )U(q) = U(a p + (1-a )q), then U(a p + (1-a )r) ³ U(q) so, by representation, a p + (1-a )r ³ h q. The same reasoning applies when choosing a b so b U(p) + (1-b )U(r) £ U(q), etc., thus we are done. Finally, for the independence axiom (A.4), note that if U(p) ³ U(q), then p ³ h q. Notice also that this implies that for a Î (0, 1), that a U(p) ³ a U(q). Thus, adding (1-a )U(r) from both sides a U(p) + (1-a )U(r) ³ a U(q) + (1-a )U(r). By affinity, as U(a p + (1-a )r) = a U(p) + (1-a )U(r) and U(a q + (1-a )r) = a U(q) + (1-a )U(r), thus U(a p + (1-a )r) ³ U(a q + (1-a )r) so, by representation a p + (1-a )r ³ h a q + (1-a )r. The reverse also applies by the same reasoning. Thus, the Archimedean axiom is finished.

 

Part III: (Uniqueness)

We now wish to turn to the "moreover" remark and prove that if both U: D (X) ® R and V: D (X) ® R represent preferences, then there is a c and b > 0 such that V = bU + c. Let us get rid of the trivial case first: if for all p, q Î D (X), p ~h q, then U(p) = k and V(p) = k¢ , thus V(p) = U(p) - (k-k¢ ), so c = (k - k¢ ) and b = 1. Now, suppose there is s, p, r Î D (X) such that s >h p >h r. Then define the following: HU(p) = [U(p) - U(r)]/[U(s) - U(r)] and HV(p) = [V(p) - V(r)]/[V(s) - V(r)]. By unique solvability (L.2), there is an a Î (0, 1) such that p ~h a s + (1-a )r. Thus HU(p) = [U(a s + (1-a )r) - U(r)]/[U(s) - U(r)] or, by affinity:

HU(p) = [a U(s) + (1-a )U(r) - U(r)]/[U(s) - U(r)] = a

Similarly, as HV(p) = [V(a s + (1-a )r) - V(r)]/[V(s) - V(r)], then HV(p) = a . This implies, then, that HU(p) = HV(p). Thus,

[U(p) - U(r)]/[U(s) - U(r)] = [V(p) - V(r)]/[V(s) - V(r)]

cross-multiplying:

[U(p) - U(r)][V(s) - V(r)] = [U(s) - U(r)][V(p) - V(r)]

or:

U(p)[V(s) - V(r)] - U(r)[V(s) - V(r)] = [U(s) - U(r)]V(p) - [U(s) - U(r)]V(r)

or simply:

V(p) = U(p)[V(s) - V(r)]/[U(s) - U(r)] - U(r)[V(s) - V(r)]/ [U(s) - U(r)] + V(r)

thus letting b = [V(s) - V(r)]/[U(s) - U(r)] and c = - U(r)[V(s) - V(r)]/ [U(s) - U(r)] + V(r), then:

V(p) = bU(p) + c

which is the form we wanted.

 

The Expected Utility Representation

We have now obtained the utility function U:D (X) ® R on the basis of the four axioms set forth earlier. However, we have not finished in proving the expected utility hypothesis, namely, that a utility function U:D (X) ® R has a representation:

U(p) = å xÎ Supp(p) p(x)u(x)

where u: X ® R is a elementary utility function on the underlying outcomes X. Note that as D (X) is the set of simple probability distributions on X, then if p Î D (X), then p has a finite support denoted Supp(p) Ì X. D (X), of course, is a convex set. Finally, we should note that by convexity, for any p, q Î D (X), a p + (1-a )q Î D (X) for any a Î (0, 1) and that, if p and q are simple probability distributions, then (a p + (1-a )q)(x) = a p(x) + (1-a )q(x) for any x Î X.

We now state the expected utility representation as a corollary to the earlier V-M theorem:

Corollary: (Expected Utility Representation) Let D (X) be the set of all simple probability distributions on X. Let ³ h be a binary relation on D (X). Then ³ h satisfies (A.1)-(A.4) if and only if there is a function u: X ® R such that for every p, q Î D (X):

p ³ h q if and only if å xÎ Supp(p) p(x)u(x) ³ å xÎ Supp(q) q(x)u(x).

Moreover, v: X ® R represents ³ h in the above sense if and only if there exist c and b > 0 such that v = bu + c.

Proof: From the V-M theorem, there is a U: D (X) ® R which represents preferences ³ h on D (X) and is affine. Now, define the function d x: X ® {0, 1} as d x(y) = 1 if y = x and d x(y) = 0 otherwise. This implies that for every x Î X, d x is a degenerate distribution, thus d x Î D (X). Let U(d x) = u(x). Now consider a distribution p = [p(x), p(y)]. Obviously, we can write this out as a convex combination of degenerate distributions d x and d y, i.e. p = p(x)d x + p(y)d y. Thus, U(p) = U(p(x)d x + p(y)d y) = p(x)U(d x) + p(y)U(d y) = p(x)u(x) + p(y)u(y) by affinity and our definition of u(x) and u(y). Thus, more generally, any distribution p with finite support can be written out as a convex combination of degenerate distributions, p = å xÎ Supp(p) p(x)d x, and thus we obtain U(p) = U(å xÎ Supp(p)p(x)d (x)) = å xÎ Supp(p) p(x)u(x) which is the expected utility representation of U(p). Thus as p ³ h q iff U(p) ³ U(q) by the von Neumann-Morgenstern theorem, then equivalently, p ³ h q iff å xÎ Supp(p) p(x)u(x) ³ å xÎ Supp(q) q(x)u(x).


(i) Cardinality

 

Conventional, non-stochastic utility functions u: X ® R are generally assumed to be ordinal, i.e. they are order-preserving indexes of preferences. By this we mean that the numerical magnitudes we give to u are irrelevant, as long as they preserve preference orderings. However, when facing the V-M expected utility decomposition U(p) = å x p(x)u(x), it is common to fall into the misleading conclusion that utility is cardinal, i.e. that utility here is a measure of preferences.

Of course, the elementary utility function u: X ® R within U(p) is cardinal. Why this is so is clear enough: if expected utility is obtained by adding up probabilities multiplied by elementary utilities, then the precise measure of the elementary utilities matters very much indeed. More explicitly, if u represents preferences over outcomes, then it is unique up to any linear transformation, i.e. if v represents preferences over outcomes, then there is a b> 0 and a such that v = bu + c. This can be regarded as the "definition" of cardinality.

What is important to realize is that even though the elementary utility function u is a cardinal utility measure on outcomes, the utility function over lotteries U is not a cardinal utility function. This is because the elementary utilities on outcomes are not primitives, rather the preferences on lotteries are the primitives. The utility function over lotteries U thus logically precedes the elementary utility function: the latter is derived from the former.

Suppose we have an agent whose ordinal utility function is known. Indeed, suppose that it’s our river-crossing fugitive. Let’s assign him the following ordinal utility function:

Escape >> 4

Death by shooting >> 3

Death by rockfall >> 2

Death by snakebite >> 1

Now, we know that his preference for escape over any form of death is likely to be stronger than his preference for, say, shooting over snakebite. This should be reflected in his choice behavior in the following way. In a situation such as the river-crossing game, he should be willing to run greater risks to increase the relative probability of escape over shooting than he is to increase the relative probability of shooting over snakebite. This bit of logic is the crucial insight behind the V-M solution to the cardinalization problem.

Begin by asking our agent to pick, from the available set of outcomes, a best one and a worst one. ‘Best’ and ‘worst’ are defined in terms of rational choice: a rational agent always chooses so as to maximize the probability of the best outcome -- call this W -- and to minimize the probability of the worst outcome -- call this L. Now consider prizes intermediate between W and L. We find, for a set of outcomes containing such prizes, a lottery over them such that our agent is indifferent between that lottery and a lottery including only W and L. In our example, this would be a lottery having shooting and rockfall as its possible outcomes. Call this lottery T . We define a utility function q = u(T) such that if q is the expected prize in T , the agent is indifferent between winning T and winning a lottery in which W occurs with probability u(T) and L occurs with probability 1  u( T).

We now construct a compound lottery T* over the outcome set {W, L} such that the agent is indifferent between T and T*. A compound lottery is one in which the prize in the lottery is another lottery. This makes sense because, after all, it is still W and L that are at stake for our agent in both cases; so we can then analyze T* into a simple lottery over W and L. Call this lottery r. It follows from transitivity that T is equivalent to r. The rational agent will now choose the action that maximizes the probability of winning W. The mapping from the set of outcomes to u(r) is a von Neumann-Morgenstern utility function (VNMuf).

What exactly have we done here? We’ve simply given our agent choices over lotteries, instead of over prizes directly, and observed how much extra risk he’s willing to run to increase the chances of winning escape over snakebite relative to getting shot or clobbered with a rock. A VNMuf yields a cardinal, rather than an ordinal, measure of utility. Our choice of endpoint-values, W and L, is arbitrary, as before; but once these are fixed the values of the intermediate points are determined. Therefore, the VNMuf does measure the relative preference intensities of a single agent. However, since our assignment of utility values to W and L is arbitrary, we can’t use VNMufs to compare the cardinal preferences of one agent with those of another. Furthermore, since we are using a risk-metric as our measuring instrument, the construction of the new utility function depends on assuming that our agent’s attitude to risk itself stays constant from one comparison of lotteries to another.

Consequently, the utility function U:D (X) ® R itself is an ordinal utility function since any increasing transformation of U will preserve the ordering on the lotteries. In other words, if U represents preferences on lotteries, then so does V = ¦ (U) where ¦ is an increasing monotonic transformation of U, e.g. if U(p) is a representation of the preference ordering on lotteries D (X), then V(p) = [U(p)]2 = [å xp(x)u(x)]2 is also a representation of the preference ordering on lotteries. However, there is one sense in which U still carries an element of cardinality. Namely, given U(p) = å x p(x)u(x), then v: X ® R will generate an ordinally equivalent V(p) = å x p(x)v(x) if and only if v = bu + c for b > 0 - and thus V = bU + c. Thus, in V-M theory, we have a "cardinal utility which is ordinal".

    Home | About Crown Hill | Exchange Traded Products | Institutional Investors | Investment Process