The Mathematical Garden

 | Welcome  | Games  | Mathematical methods  | About us |
Games
Black and White
HEX
Magic
Mark Six
Recursion, Games and Strategy
The Towers of Hanoi
Sample game - tower of Hanoi
The Fibonacci Numbers
Finding the Maximum Element
A matches game
Sample game - matches
A knapsack problem
A gambler choice
SIM

A gambler choice

A gambler has $2. He is allowed to play a game of chance four times, and his goal is to maximize his probability of ending up with a least $6. If the gambler bets $ b dollars on a play of the game, then with probability 0.4, he wins the game and increase his capital position by $ b dollars with probability 0.6, he loses the game and decreases his capital position by $ b dollars. On any play of the game, the gambler may not bet more money than he has available. Determine a betting strategy that will maximize the gambler's probability of attaining a wealth of at least $ 6 dollars by the end of the fourth game. We assume that bets of zero dollars (that is, not betting) are permissible.

Define ft(d) to be the probability that by the end of game 4, the gambler will have at least $6, given that she acts optimally and has d dollars immediately before the game is played for the tth time.

If we give the gambler a reward of 1 when his ending wealth is at least $6 and a reward of 0 if it is less than, ft(d) will equal the maximum expected award that can be earned during games t, t+1, ... 4. if the gambler has d dollars immediately before the tth play of the game.

If the gambler is playing the game for the fourth and final time, his optimal strategy is clear: if he has $6 or more, don't bet anything, but if he has less than $6, bet enough money to ensure (if possible) that he will have $6 if he wins the last game. Note that if he begins game 4 with $0, $1, or $2, there is no way to win (no way to earn a reward of 1). This reasoning yield the following results:
f4(0) = f4(1) = f4(2) = 0

f4(3) = f4(4) = f4(5) = 0.4

f4(d) = 1     for d ³ 6.
For t £ 3, we can find a recursion for ft(d) by noting that if the gambler has d dollars, is about to about to play the game for the tth game, and bets b dollars, then the following summarizes what can occur:

  • with probability 0.4 win the game t, expected reward ft+1(d+b);

  • with probability 0.6 lose the game t, expected reward ft+1(d-b).

Thus if the gambler has d dollars at the beginning of game t and bets b dollars, the expected reward will be 0.4 ft+1(d+b) + 0.6 ft+1(d-b). This leads to the following recursion:
ft(d) =
max
0 £ b £ d 
max
{ 0.4 ft+1(d+b) + 0.6ft+1(d-b) }.
Compute f1(2) ?

Department of Mathematics, HKU, 2010