The Monty Hall problem or paradox is famous and well-studied. But what confused me about the description was an unstated assumption.

Suppose you're on a game show, and you're given the choice of three doors: behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what's behind the doors, opens another door, say No. 3, which has a goat. He then says to you, "Do you want to pick door No. 2?" Is it to your advantage to switch your choice?

The assumption is that the host of the show does not have a choice whether to offer the switch. In fact, Monty Hall himself, in response to Steve Selvin's original formulation of the problem, pointed out that as the host he did not always offer the switch.

Because the host knows what's behind the doors, it would be possible and to his advantage to offer a switch more often to contestants who guess correctly. If he only offered the switch to contestants who guess correctly, all contestants who accept the offer would lose. However, if he did this consistently, the public would learn not to accept the offer and soon all contestants who first guess correctly would win.

If, for instance, he gave the offer to one third of incorrect guessers and two thirds of correct guessers, 2/9 contestants would be given the offer and should not switch and 2/9 contestants would be given the offer and should, which would bring the chances of winning back to 1/2 whether one accepts the offer or not, instead of 1/3 or 2/3.

Is this a Nash equilibrium for the Monty Hall problem (or the iterated Monty Hall problem) as a two-player zero-sum game? And if not, what is it, or aren't there any?

  • 2,024
  • 1
  • 14
  • 28
Eric Mickelsen
  • 243
  • 1
  • 7

2 Answers2


The car probably doesn't come out of the host's salary, so he probably doesn't really want to minimize the payoff, he wants to maximize the show's ratings. But OK, let's suppose he did want to minimize the payoff, making this a zero-sum game. Then the optimal value of the game (in terms of the probability of winning the car) would be $1/3$. An optimal strategy for the contestant is to always refuse to switch, ensuring that the expected payoff is $1/3$. An optimal strategy for the host is never to offer a switch unless the contestant's first guess is correct, ensuring that the expected payoff is no more than $1/3$.

Robert Israel
  • 416,382
  • 24
  • 302
  • 603

Thanks for your answer, Robert. If the optimal value is 1/3 as you showed, then I suppose there must be infinitely many mixed strategies that the host could employ that would be in equilibrium. If, as I mentioned in the question, the host offers the switch to 2/3 of correct guessers and 1/3 of incorrect guessers, 1/9 contestants will guess correctly and not be offered a switch, winning immediately. Also, 2/9 will be correct guessers offered a switch and 2/9 will be incorrect guessers offered a switch. Therefore 4/9 will be offered a switch and 2/9 will win, whether they accept it or not, since their expected value will be 1/2 whether they accept the switch or not. The rest of the contestants lose immediately.

1/9 + 2/9 = 1/3

This means the host can offer any number between 0 and 2/3 of his contestants a switch without changing the expected value, as long as the number of correct guessers and incorrect guessers he offers a switch is the same. He can accomplish this easily with a mixed strategy of making correct guessers exactly twice as likely to receive an offer.

With any of this family of strategies, the host then cannot possibly ensure a better outcome than 1/3, which he has done. And the contestant will have the same expected value regardless of their strategy, so they cannot improve. So, these are also Nash equilibria. And you would have to agree, from a practical perspective, the host should employ one of these highly mixed strategies so that the game is more exciting, without harming his bottom line. I don't have any direct evidence, but I would hazard a guess that this is roughly what Monty Hall actually did.

The surprising thing about this is that the naive answer to the classic Monty Hall problem, "No, there is no benefit," (50/50) can be correct under reasonable assumptions.

Eric Mickelsen
  • 243
  • 1
  • 7