How To Make Sense Of Bertrand’s Box Paradox?

Bertrand's Box Paradox - Three boxes are placed side by side. Box 1 contains 2 gold coins. Box 2 contains one silver coin and one gold coin. Box 3 contains 2 silver coins.

Joseph Louis François Bertrand was a genius French mathematician who lived in the 19th century. By the time he was just nine years old, he had already lost his father. But even at that age, Bertrand could speak Latin fluently and was well-versed in mathematical concepts.

A black and white portrait of Joseph Louis François Bertrand - Image courtesy of Wikimedia Commons — Joseph Louis François Bertrand — Image courtesy of Wikimedia Commons

Between the ages of eleven and seventeen, he completed two bachelor’s degrees and a PhD in the mathematical theory of electricity. He went on to make significant contributions in the fields of number theory, probability theory, differential geometry, economics, and thermodynamics. Among many other things, he was the first to define real numbers using a notion that we now refer to as a Dedekind cut.

In 1889, Bertrand published his work titled “Calcul des Probabilités”, in which he wrote about a counterintuitive paradox involving probability. It starts with a simple puzzle.

You are presented with three identical boxes. Each box has two drawers, and inside each drawer, there is one coin. The first box contains two gold coins, one in each drawer. The second box contains one gold coin and one silver coin, one in each drawer. The third box contains two silver coins, one in each drawer.

Now, all the drawers are closed and the boxes are shuffled. You are presented with one box at random. When you open the first drawer at random (either the left-hand side drawer or the right-hand drawer), you find a gold coin inside. You take the coin out and keep it with you. Without replacing the coin that you just took out, what is the probability of finding another gold coin inside the same box (inside the second drawer)?

This is the fundamental scenario behind Bertrand’s box paradox. Why don’t you give some thought to this question? Once you have an answer in your mind, you will be able to appreciate what follows much more.

The Intuitive Answer to Bertrand’s Puzzle

To begin, we know for certain that the coin that you first drew was a gold coin. This means that the box that was presented at random was definitely not the one with two silver coins (Box 3). So, we can eliminate it from our consideration for what could be in the second drawer.

As a result, we are now left with two potential options: the second drawer could either contain a silver coin or a gold coin. In other words, the random chance leads us to only two equally probable outcomes. Based on this, it makes sense to wager that we have a 50–50 probability split.

However, Bertrand argued that this is objectively not the case. To be fair, a 50–50 split was my first consideration as well (such is the tricky nature of probability). Let us now move on to Bertrand’s reasoning behind his claim.

The Objective Answer to Bertrand’s Puzzle

The are two mistakes/fallacies in the intuitive approach we just considered:

1. We simply eliminate the box with two silver coins (Box 3) from the probability equation altogether.

2. After reducing the problem to two boxes only, we “assume” an equal probability split (50–50).

Bertrand’s point was to consider the probabilities right from the start. We know that you were presented with a box at random and that the coin you drew from the first drawer was a gold coin.

This essay is supported by Generatebg

Since the box you were presented was at random, we originally had three equally probable boxes:

1. Box 1 — with two gold coins.

2. Box 2 — with one silver coin and one gold coin.

3. Box 3 — with two silver coins.

The fact that you drew a gold coin from the first drawer does not alter the probabilities of choosing one of these boxes retroactively. Now, considering that we had three equally probable boxes originally, AND the fact that the first coin was gold, we need to consider the following three cases:

1. Case 1 — The second draw could be the remaining silver coin from Box 2

2. Case 2 — The second draw could be one of the two gold coins from Box 1

3. Case 3 — The second draw could be one of the two gold coins from Box 1

If you are wondering why Cases 2 and 3 both refer to one of the coins from Box 1, there are two possibilities here, depending on which gold coin you drew in the first place. This fact becomes clear if we choose to number the coins as follows:

The objective answer to Bertrand's box paradox: Three cases are presented side by side. In each case, the three boxes are represented vertically in one column. Box 2 (contains coins 3 and 4) is below Box 1 (contains coins 1 and 2) and Box 3 (contains coins 5 and 6) is below Box 2 in each case. In case 1, the first draw is a gold coin from Box 2 (the silver coin 4 remains). In case 2, first draw is 1-gold from Box 1 (2-gold remains). In case 2, first draw is 2-gold from Box 1 (1-gold remains). — The objective answer to Bertrand’s puzzle — Illustration created by the author (arrows indicate first draws)

We can now compute the final probability by simply counting the contribution to a gold-coin second draw in each (equally probable) case as follows:

Probability of drawing a second gold coin after first draw = 0/3 (Case 1) + 1/3 (Case 2) + 1/3 (Case 3)

= 2/3

Therefore, the probability is, in fact, 2/3 and NOT 1/2.

Final Comments

If you are still struggling to wrap your head around why the final probability is 2/3 (and not an equal split), let us go back to the first draw. If you draw a gold coin in your first draw, you are twice as likely to have drawn from the drawer with two gold coins (2/3) as opposed to the drawer with one gold coin and one silver coin (1/3). So, the assumption that it is a 50–50 split is incorrect.

Update post publishing:

After publishing this essay, I wrote a program to simulate Bertrand’s box experiment (you can find this below at the end of the essay). I realised that the probability split of drawing a gold coin in the first draw is actually 50–50 (there are three gold coins and three silver coins at the start).

But the probability of drawing a gold coin in the second draw having already drawn a gold coin in the first draw rises to 2/3. I made a mistake in the above paragraph when I stated that the probability of drawing a gold coin in the first draw is 2/3.

I apologise for this mistake and am leaving the above paragraph untouched to keep track of the error historically and for completeness.

Probability is tricky and Bertrand knew it, which is why he made the effort to discuss this (veridical) paradox in his work. The funny thing is, more than a century later, we are arguably none the better.

If you ask me, human beings are not genetically wired to be good at probability. Any probabilistic computation we are good at is not done by actively thinking but by our subconscious subsystems. So, in my opinion, paradoxes like Bertrand’s Box are here to stay.

We need to accept our nature and continue to seek objectivity whenever we are dealing with probabilities. We cannot simply trust our intuition!

Jupyter Notebook Code to Simulate Bertrand’s Box Experiment:

import numpy as np

# Define number of trials
num_trials = 1000000 # Change as per requirement

# Define three boxes: box0, box1, and box2
box = [0, 1, 2]

# Define the coins in each box
box0 = [1, 1] # Two gold coins
box1 = [1, 0] # One gold, one silver
box2 = [0, 0] # Two silver coins

# Initialize the counters for the first and second coins drawn
count_first_draw_gold = 0
count_second_draw_gold_given_first_draw_gold = 0

# Define the simulation loop
for i in range(num_trials):
    # Define variable for conditional probability.
    first_outcome_condition_gold = 1 # The first drawn coin must be gold
    
    # Select one of the three boxes randomly
    selected_box = np.random.choice(box)
    
    # Only box0 and box1 contain at least one gold coin.
    # If selected_box is box2, no need to check since no gold coin is present
    if selected_box == 2:
        continue

    #Draw the first coin from box0 or box1
    if selected_box == 0:
        box_outcome = np.random.choice(box0)
    else:
        box_outcome = np.random.choice(box1)

    # Check if the first draw results in a gold coin:
    if box_outcome == first_outcome_condition_gold:
        # increment count_first_draw_gold counter
        count_first_draw_gold += 1

        # Given, the first draw is gold, the second draw will ALWAYS be gold from box0
        # Given, the first draw is gold, the second draw will NEVER be gold from box1
        if selected_box == 0:
            count_second_draw_gold_given_first_draw_gold += 1

# Compute the frequentist outputs:
probability_first_draw_gold = count_first_draw_gold / num_trials
probability_second_draw_gold_given_first_draw_gold = count_second_draw_gold_given_first_draw_gold / count_first_draw_gold

print("Probability of getting a gold coin in the first draw: ", probability_first_draw_gold)
print("Probability of getting a gold coin in the second draw given a first gold draw: ", probability_second_draw_gold_given_first_draw_gold)

Probability of getting a gold coin in the first draw:  0.501006
Probability of getting a gold coin in the second draw given a first gold draw:  0.6668862249154701

If you’d d like to get notified when interesting content gets published here, consider subscribing.

Further reading that might interest you:

If you would like to support me as an author, consider contributing on Patreon

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_R5WSNS3HKS	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_131795354_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
_learn_press_session_a7b7f6513d11f58744fec86fbc57b116	2 days	No description
_wordpress_lp_guest	1 hour	No description
GoogleAdServingTest	session	No description

How To Make Sense Of Bertrand’s Box Paradox?

The Intuitive Answer to Bertrand’s Puzzle

The Objective Answer to Bertrand’s Puzzle

Final Comments

Jupyter Notebook Code to Simulate Bertrand’s Box Experiment:

Explore humanity's most curious questions!

Sign up to receive more of our awesome content in your inbox!

Comments

Leave a Reply Cancel reply