How To Really Eliminate The Staircase Paradox?

The staircase paradox: A staircase with a base of 4 centimetres (cm) and height of 4 cm. There are two stairs, each with a length of 2 cm and height of 2 cm.

The staircase paradox has made me think a lot lately. Ever since I wrote the article “Can You Really Solve The Staircase Paradox?”, there have been quite a few hard technical discussions in the comments section.

When I originally wrote that article, I aimed to avoid a strict mathematical approach and keep things simple. By doing this, I thought it would make the topic accessible for anyone (which, in fairness, turned out to be the case). Having said this, as the comments and criticism grew more and more technical, I realized that the original article was ill-equipped to do justice to the mathematical finesse behind this paradox.

In this article, I aim to address the technical challenges behind this phenomenon. To start, I’ll briefly recap what this paradox is about. Then, I’ll report some of the common technical push-backs/criticisms that the original article received and give my take on them. Finally, I will employ a rigorous mathematical approach to demonstrate how to really eliminate this paradox.

In doing so, I hope to address all open questions and concerns. If this is not the case, consider this article a technical conversation starter. One way or another, science shall move forward.

This essay is supported by Generatebg

Recap — What is the Staircase Paradox?

The staircase paradox is a geometrical problem in which you are trying to reach the top of a staircase from a diagonally opposite starting point. The path you are to trace is up and along the stairs. It is important to note that we are interested in the distance covered along the stairs in both the horizontal direction as well as the vertical direction.

To aid convenient calculations later on, let us consider a staircase that is 4 Centimetres (cm) long and 4 cm high. Each equidistant step is (4/n) cm long and (4/n) cm high, where n is the total number of stairs. Now, we are interested in the total distance covered by the stairs (both horizontally and vertically). Henceforth, I’ll refer to this distance as the total length of the stairs.

The staircase paradox: The left-most figure features a staircase with 8 stairs. The middle figure features a staircase with 16 stairs. The right-most figure features a staircase with 32 stairs. In each of these cases, the total length of the stairs sums up to 8 cm. — Image created by the author

Here, you can see that the leftmost staircase has 8 stairs. The centre staircase has 16 stairs, and the rightmost staircase has 32 stairs. Although we have doubled the number of stairs twice from 8, we see that the total length of the stairs remains constant at 8 cm, which is explained by the following generalized equation for any number of stairs (n):

Total length of stairs = [Sum of horizontal components] + [Sum of vertical components] = [n*(4/n)] + [n*(4/n)] = 4 + 4 = 8 cm.

This equation also shows that the total length of the stairs is seemingly independent of n.

What Happens at The Limit of Infinity?

Something that you might have already taken note of is the fact that as n increases, the stairs start to resemble a line. It is important to note here that they only resemble a line as long as n is very large, yet finite. In fact, they are arguably a bad approximation of a line at this stage.

Although the total length of the stairs is seemingly independent of the number of stairs (n), something very peculiar happens when we enforce a limit of infinity. As n tends to infinity, the stairs seem to transform into a straight line, which is also a hypotenuse to the right-angled triangle that represents the staircase at this point.

The staircase paradox: a right angled triangle with a base of 4 cm and height of 4 cm. The hypoteneuse is calculated using the Pythagorean theorem as: √(4² + 4²) = 4√2 cm. — Image created by the author

When we calculate the length of the hypotenuse using the Pythagorean theorem, we get a length of 4√2 cm instead of 8 cm. How did we jump from 8 cm to 4√2 cm? This is essentially the problem statement of the paradox.

Total length of the stairs = √(4² + 4²) = √(16 + 16) = 4√2 cm.

Now that we have covered the basic problem statement, let’s jump directly into some of the criticisms that the previous article received from readers and math enthusiasts.

Comparing Apples and Oranges

alvaro ferreira suggested that this is a pseudo paradox because we are comparing apples with oranges. According to his argument, on the one hand, Curve A is the composition of n-segments of the length and the height (4/n and 3/n) of the triangle. On the other hand, Curve B is the union of n-segments of the hypotenuse (5/n) of the Pythagorean triangle. Please note that the numbers quoted here vary because different numbers were used in the original article.

“For sure (and naturally), the corresponding lengths ought to conduct to different values. QED”
— alvaro ferreira

Here is another argument along a similar vein:

“What paradox?

You have two measures of the length after n divisions. By one measure the length is n*7/n = 7, the other is n*5/n=5. These numbers don’t change as n goes to infinity. In other words

lim n->infinity (n*7/n)/(n*5/n) = 7/5, not 1.

Your paradox is akin to arguing that:

7×0=5×0 proves that 7=5.”

– Roeland Rengelink

My favourite critique came from brian vant-hull

“Your segments never approach a tangent to the line, unlike Archimedes approximation of the circumference of a circle.”
— brian vant-hull

This is my favourite critique because it knocks at the door of the core issue with the paradox. It is a great start that I would have liked to build further upon. brian vant-hull, however, concludes by saying that the fact that the stairs do not tangentially approach the hypotenuse is sufficient to say that the lengths will never converge. According to brian vant-hull, this can be considered a sufficient solution to the paradox.

Summarising Critique and Laying Foundations for Proof

All of these critiques can be summarized in three points:

1. The so-called staircase paradox compares two different measurement systems (metrics) that are bound to produce different lengths. — C1

2. The stair-based measurement system (metric) does not reduce the error term, that is, the total length of the stairs does not approach the length of the hypotenuse. — C2

3. The approximation of the hypotenuse only works accurately if the perimeter function approximating it does so tangentially to the hypotenuse. This way, the error term can reduce to zero as n tends to infinity. With stair-like sawtooth perimeter functions, this is not possible. — C3

Henceforth, I will be referring to these points as C1, C2, and C3 respectively. Furthermore, I will be referring to the stair based measurement system as M1 (metric-1) and the hypotenuse based measurement system as M2 (metric-2).

Laying the Foundations for Proof

In order to challenge all of the critique points listed above, I have to somehow manage to find a way for M1 to converge smoothly to M2 as n tends to infinity.

If I accomplish this, the following results would occur:

1. M1’s convergence to M2 as n tends to infinity would disprove the claim that the M1 is bound to produce a different length as compared to M2. That is, c1 would be disproved.

2. M1’s convergence to M2 as n tends to infinity would mean that the error term in approximation diminishes to zero. That is, C2 would be satisfied.

3. M1’s convergence to M2 as n tends to infinity would mean that tangential geometry is not necessary and sufficient for accurate approximation. That is, C3 would be disproved.

4. Finally, M1’s convergence to M2 as n tends to infinity would also establish a way to eliminate the staircase paradox — our main goal here!

Now that I have laid the foundations for potential proof, let us see where I get with this.

Flipping the Staircase Paradox on its Head

I’ll start by dissecting the geometrical anatomy of each stair in relation to the hypotenuse. To do this, let us first zoom into a section of the stairs and then rotate this section such that the hypotenuse appears horizontally to us.

The staircase paradox: A section of the stairs is zoomed in, and rotated such that the hypotenuse becomes horizontal, and the stairs appear like triangles with sections of the hypotenuse as their base. — Image created by the author

Now that we are all zoomed in, note that each stair forms 2 triangles with 3 measurable quantities each: the base (from the hypotenuse) — b, the height perpendicular to the hypotenuse — h, and the one remaining side of the triangle of length — l.

The staircase paradox: One of the triangles from the zoomed-in section is considered, and a perpendicular line from its tip is projected to the base (hypotenuse segment). Thus, this triangle is split into two. We now have two triangles, each with a base b, height h, and length l. — Image created by the author

The base of the triangle is given by the length of the hypotenuse divided by 2n. This is because we have split each stair into two equal triangles. There are 2n base segments in total.

b = (4√2)/2n cm

Using geometric symmetry, we can prove that the height of the smaller triangle is the same as its base.

The staircase paradox: Using geometric symmetry, each smaller triangle forms one quarter of a square. Since the diagonals of a square are equal in length, the height of each triangle is equal to is base. — Image created by the author

This is because, this triangle forms one quarter of a projected square. When reflected, the base and height form the diagonals of the square.

h = b = (4√2)/2n cm.

However, there is a subtle but important detail we need to note here. The base b is solely a function of M2 (the hypotenuse based measurement system) and n (total number of stairs). The height h is a function of M1 (the stair based measurement system), M2, and n. The weird thing about the total length of the stairs (henceforth denoted as M1_n) is that it need not be 8 cm long; 8 cm is just a special case. It can be both bigger than 8 cm and smaller than 8 cm (for any given n) depending upon what we choose as the M1 perimeter function.

The staircase paradox: An image that shows that M1_n can be greater than, or equal to, or lesser than 8 cm. It’s length depends upon the choice of the perimeter function. — Image created by the author

To take this effect into consideration, let us define ‘s’ as a variable (s is restricted to positive numbers only) to represent variable height (note that variable height here does not refer to ‘h’, but rather the fact that sum of all h terms for any given n is variable depending upon the choice of the perimeter function).

I take this step is to generalize the solution for any M1 stair-like sawtooth perimeter function we may choose. Consequently, we obtain the following height:

h = s/2n cm.

Now, using the Pythagorean theorem, we can calculate the length of the side l as follows:

l = √[((4√2)/2n)² + (s/2n)²] = (1/2n)* {√[(4√2)² + s²]} cm.

Computing M1_n — The Total Length of Stairs

Now, we know that the shortest length that connects the bottom end of the staircase to the top end is the hypotenuse (4√2 cm). Therefore, by logical deduction, the path along the stairs has to be longer than the hypotenuse. That is, M1_n has to be greater than M2_n, where M2_n denotes the length of the hypotenuse for n stairs.

In order to calculate the total length of the stairs (M1_n), we need to multiply the length ‘l’ we just calculated by 2n since we have 2n such segments.

M1_n = (2n)* (1/2n)*{√[(4√2)² + s²]} = √[(4√2)² + s²] cm.

If 0 < s < infinity, then 4√2 < M1_n < infinity

It turns out that the total length of the stairs, M1_n, does not depend on n. This is not a surprising result. The ‘s’ term in there can be considered the error term. For an ‘s’ value of 4√2 cm, we get the total length of the stairs for any n value as M1_n = 8 cm (the special case).

The fact that M1_n is not dependent on n goes to show that whenever n increases, the height h drops by the perfect proportion to keep the total length M1_n constant. This also seems to align with the argument as to why M1 and M2 never converge.

Bounded Area and Boundless Perimeter

We can summarise our mathematical findings so far into 3 points:

1. M1_n = √[(4√2)² + s²], where 0 < s < infinity and 4√2 < M1_n < infinity

2. M1_n does not depend on n.

3. Height of each stair, h = s/2n

The first significant point to notice here is that M1_n is greater than 4√2 for any n as long as ‘s’ is positive.

The second significant point is that the area under the stair-like M1-perimeter function would approach the area of the triangle as n increases.

The third significant point to note here is that M1_n can be as large as we want by defining an ‘s’ as large as we want (both have an upper bound of infinity).

Consequently, the area of the stair-like sawtooth perimeter function is bounded by the minimized error term plus the area of the staircase triangle (that is completed by the hypotenuse), whereas its perimeter is boundless (depending upon our choice of ‘s’). This is a vital point to consider when trying to eliminate this paradox.

How to Really Eliminate the Staircase Paradox?

We know that each stair’s height is given by h = s/2n. On the other hand, the total length of the stairs at n divisions (stairs) is given by M1_n = √[(4√2)² + s²].

Based on this, we know that M1_n would approach M2_n (4√2 cm) only when ‘s’ approaches 0. If s = 0, the height, h = s/2n = 0 as well. So, setting ‘s’ to zero is not really an option.

However, there is another subtle, yet important realization here. The choice of the variable ‘s’ matters. With h = s/2n, there is already an argument that with a very small choice of s, at the limit of infinity, h would tend towards zero, and thereby M1_n would smoothly converge to M2_n (4√2).

However, we can get even more rigorous here! Since l = (1/2n)* √[(4√2)² + s²], we have more freedom of choice when it comes to the variable ‘s’.

What if we consider non-linear sequences for s that decrease at a faster rate towards zero as n increases linearly towards infinity?

Final Stretch

Let this class of s-variable be represented by s_n (as we now choose ‘s’ as a function of n). Some examples of such sequences include 1/n!, 1/n², 1/2^n, etc. Let us consider that s_n = 1/n.

Then, the respective base, height, and length of the small triangle are calculated as follows:

b = 4√2/2n cm. h = s_n/2n = 1/2n² cm. l = √[(4√2/2n)² + (s_n/2n)²] = √[(4√2/2n)² + (1/2n²)²] cm. — Math illustrated by the author

Here are the respective dimensions represented graphically along with the triangle from each stair:

The staircase paradox: A graphical representation of the base, height, and length that we just calculated along with the triangle that belongs to each stair. — Image created by the author

Based on this, M1_n is calculated as follows:

M1_n = = (2n)* √[(4√2/2n)² + (1/2n²)²] = √[(4√2)² + (1/n²)] cm — Math illustrated by the author

When we apply the limit, that is, as n tends to infinity, 1/n² (the error term!) tends to zero. This version of M1_n then smoothly approaches M2_n.

lim n->infinity [1/n²] = 0. Therefore, lim n->infinity [M1_n] = 4√2 cm. — Math illustrated by the author

By proving this, I have established the following:

1. C1 is disproved — M1 is NOT necessarily bound to produce a different length as compared to M2.

2. C2 is satisfied — The error term diminishes to zero as n tends to infinity.

3. C3 is disproved — Tangential geometry is NOT necessary and sufficient for accurate approximation. Update post publishing: This statement is incorrect and is retracted, as proven by the argument from brian vant-hull (please refer to quote below).

4. The staircase paradox is eliminated!

Update Post Publishing — C3 is NOT Disproved!

“In showing an example of triangles for which the height decreases faster than 1/n, you’ve actually created triangles that approach the tangent as n increases! So you did not in fact demonstrate that tangential functions are not necessary.”
— brian vant-hull

The Mathematics Behind Eliminating the Staircase Paradox

Here is a summary of all the mathematical steps that I took to eliminate the staircase paradox:

1. I first conducted the geometrical anatomy of each stair.

2. I established that the base is solely a function of M2 and n, whereas the height and length of each stair are functions of M1, M2, and n. I also established the corresponding mathematical relations.

3. I established that choice of the stair-like sawtooth perimeter function via the variable ‘s’ matters. Consequently, the height (sum of all perpendicular distances from the tip of each stair to the hypotenuse) plays a significant role.

4. I then demonstrated that ‘s’ can be chosen as a function of n such that it diminishes to zero at a faster rate than the rate at which n increases (linearly) towards infinity.

Key Takeaway

The primary takeaway from this exercise is that whenever it comes to building approximation systems, minimizing the error term smoothly and establishing non-linear error-minimization (acceleration) are of paramount importance.

Final Remarks

It is important to note that with the original version of the stairs we started with, M1_n would never approach M2_n.

What I have shown here is that there exists a class of stair-like sawtooth functions that enable M1_n to smoothly approach M2_n as n increases. So, in my opinion, we have just figured out a way of eliminating this paradox and not solving it outright.

I do not claim that I am correct with this analysis. I have tried my best and am willing to be wrong. Normally, work like this would be scientifically peer-reviewed. In this case, it is not. I am the only person who has worked on it, and I have tried my best to verify everything. There could be mistakes. So, one could consider this article as an appeal to open science.

Constructive criticism is welcome. However, I implore mathematical rigour from anyone who wishes to challenge the contents of this article.

Credit: My approach to attacking this paradox was inspired by the work of Mr. Hassan Sedaghat.

I hope you found this article interesting and useful. If you’d like to get notified when interesting content gets published here, consider subscribing.

Further reading that might interest you: How To Solve The Dartboard Paradox? and Why Do You See Mirrors Flipping Words?

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_R5WSNS3HKS	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_131795354_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.

Cookie	Duration	Description
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.

Cookie	Duration	Description
_learn_press_session_a7b7f6513d11f58744fec86fbc57b116	2 days	No description
_wordpress_lp_guest	1 hour	No description
GoogleAdServingTest	session	No description