Limit associated with a recursion

Question

Update: a full solution to the recursion below has now been found, and it is discussed here.

If $z_n < 2y_n$ then

$y_{n+1} = 4y_n - 2z_n$
$z_{n+1} = 2z_n + 3$

Else

$y_{n+1} = 4y_n$
$z_{n+1} = 2 z_n - 1$

Consider the following limit:

$$\lim_{n\rightarrow\infty} \frac{1}{n}\left(z_{n+1} - \sum_{k=1}^n z_k\right) \tag{$\star$}$$

The limit may or may not exist, and it may or may not depend on the initial conditions. Let us assume that the initial values $y_1$ and $z_1$ are strictly positive integers.

Question: If $y_1 \neq 2 z_1$ and $2y_1 \neq z_1$, is it true that $(\star)$ is always $1$, and otherwise the limit is always $3$?

Now let's the fun begin...

Let $d_n=\frac14(z_n-2z_{n-1}+1)$. The sequence $d_n$ represents the binary digits of some unknown number $x$, a number that depends on the initial conditions. It turns out that if $y_1=2, z_1=5$ then that number is $x=\sqrt{2}$. You may ask: so what?

Here is where it becomes fascinating:

If you answer my question about the limit converging to $1$, say for $y_1=2, z_1=5$, then you proved that exactly 50% of the binary digits of $\sqrt{2}$ are zero.

An article about this recursion (with connection to the digits distribution) can be found here. An application to the gaming industry can be found in section 2.1 in the following article. The source code I used in my computations is accessible here (Perl code using the Bignum library for exact arithmetic.) I can produce a large number of $z_n$ on request, for those interested and not familiar with high precision computing.

Hints and expectations

The case with the limit converging towards $1$, I call it the standard case. At this point, the result about the limit is still a conjecture. It is based on the fact that for a number such as $\sqrt{2}$, the distribution of the binary digits is believed to be uniform on $\{0,1\}$. Generally speaking, the limit is equal to $4p-1$ where $p$ is the proportion of binary digits equal to one in the number $x$ in question: this fact is easy to prove, almost trivial, as the formula in the limit was built with that goal in mind.

Also note that in the standard case, you can never have $y_n = 2z_n$ or $2y_n = z_n$ at any iteration $n$, otherwise, it would force the convergence towards $3$ according to this conjecture, and imply that $x$ is rational. Also, if the limit, rather than being $1$ was (say) $1.2$, it would imply that 55% of the binary digits of $\sqrt{2}$ are equal to one. Even proving (in the standard case) that the limit is strictly above $-1$ or strictly below $3$ would be a spectacular discovery: it would imply that the proportion of digits of $\sqrt{2}$ equal to zero is strictly positive (that is, it does not tend to zero as you look at longer and longer sequences of successive digits.) To this day, whether this fact is true or not remains a mystery.

I don't know if proving my conjecture is easy, extremely difficult, or impossible to prove or disprove. But I will accept any answer that brings some light to help answer my question, even if it is just a reference to relevant, very interesting previous work. Also, insights about the case where $y_1$ or $z_1$ are not integers, are welcome: for the few cases I tested so far, the limit was equal to $1$, unless $y_1=2z_1$ or $2y_1 = z_1$, but convergence can be slow when we are close to $y_1=2z_1$ or $2y_1 = z_1$.

Below is a chart showing convergence to $1$ (relatively slow, of the order $1/\sqrt{n}$ just like for the central limit theorem - not a coincidence - and chaotic) using $y_1=2, z_1=5$. The X-axis represents $n$, the number of iterations. The convergence is much faster when the limit is $3$.

Suggested path to solve this problem

One way to handle this problem is to consider the sequence $d_n$ of digits of $x$ as a realization of an ergodic stochastic process. Such processes have an equilibrium distribution, called attractor in chaos theory, even though the dynamical system considered here is entirely deterministic. See my article on the theory of randomness, here, and my book Applied Stochastic Processes, Chaos Modeling, and Probabilistic Properties of Numeration Systems, here. The equilibrium distribution is solution to a stochastic integral equation. Here we are dealing with discrete values, and with a discrete stochastic equation that seems to only have two solutions (the standard case, and the other one.) Assume that the probability (at any iteration $n$) for $d_n$ to be equal to one, is $p$. At equilibrium, the probability for $d_{n+1}$ to be equal to one, must also be $p$. This yields a stochastic equation, based on the recurrence system mentioned in the introduction, and the only unknown is $p$. In order for the equilibrium to hold (that is, solving this equation with respect to $p$) must yield $p=1/2$ in the standard case.

Update: One thing that could also help is looking at what happens with non-integer initial values. For instance, $y_1 \rightarrow 2, z_1 \rightarrow 5$. I did notice during this experiment that $y_1=1.2, z=5.3$ leads to the non standard case, with the limit converging to $3$ and $x$ being a rational number. So the definition of the standard case needs some refinement (at least to handle non-integer initial values) and examples leading to the non-standard case (though rare) may be more numerous than initially thought.

Another interesting chart

Here I try to build a second order approximation for the limit. Let

$$L(n)= \frac{1}{n}\left(z_{n+1} - \sum_{k=1}^n z_k\right) \tag{$\star$}$$

I am interested in the error $E(n) = \sqrt{n}\cdot \Big(L(n)-1\Big)$. In a traditional problem, you would expect $E(n)$ to tend to a constant as $n \rightarrow\infty$. Not here, the error behaves like a Brownian motion, again, just like in the central limit theorem. Yet the system is fully deterministic here. Note that I used $y(1)=2, z(1)=5$ to produce the chart below.

While I did not test it, I would expect that the same Brownian behavior occurs regardless of the initial conditions, as long as we are dealing with the standard case.

Also, if you look closely at the above chart, I believe it is NOT a true Brownian motion. It is too regular, and most importantly, the error seems to be bounded. The (apparently) bounded error, together with the non-dependence on the initial conditions (yet to be verified) for the probabilistic behavior, makes me think that this problem, after all, might be solvable. And maybe another way to sole this problem is to get good enough asymptotic expansions for $y_n$ and $z_n$.

Note

The recursion mentioned here is identical to the one featured in section 2.1 in this article after the change of variable $z_n = 4x_n + 1$. An additional change of variables, $w_n=z_n - 2y_n$, could prove useful.

A different approach

If the goal is to prove that that binary digits of $\sqrt{2}$ are uniformly distributed, a different approach is as follows. Consider the sequence $q_k=2^{-\{ k \log_2 3 \}}$ where the brackets represent the fractional part function. The number $q_k$ is rational, it has a period of $2 \cdot 3^{k-1}$ in its binary expansion, and the proportion of digits equal to zero is always 50%. The median of $\{q_1, q_2, \cdots q_n\}$ tends to $\sqrt{2}/2$ and if $n$ is odd, it is equal to one of the $q_k (1\leq k\leq n$): it is the middle value when these numbers are ordered. Thus the proportion of zero in the median is always exactly 50% if $n$ is odd. But is this also true at the limit as $n\rightarrow \infty$? Not necessarily, this is not the case if you consider the minimum or the maximum, instead of the median. So it is more complicated: some $q_k$ (infinitely many) must be removed to guarantee this fact, and they must be chosen carefully so as to not change the value of the limiting median.

One way to do this successfully is as follows. Keep only those $q_k$ that satisfy $|p_m(q_k) - \frac{1}{2}| < \frac{C}{\log m}$ for all $m=2,3, \cdots, \lfloor \log k\rfloor$ where $p_m(\alpha)$ is the proportion of digits of $\alpha$ that are equal to 1 among the first $m$ digits, and $C$ is a constant. Would this eventually eliminate $\sqrt{2}/2$? Probably not. Would this impact the limiting value of the median? Probably not. But these are extremely challenging questions. It is probably not hard to compute the exact proportion of $q_k$ that you eliminated by doing so. Yet it is not impossible that $\sqrt{2}/2$ can't be reached anymore after applying this thinning process, but only a neighboring irrational that shares (say) the same first $10^{10,000}$ digits.

Update: If the $q_k$ being removed were evenly spread (they may not), we are left with the same limit distribution (that of $2^{-X}$ where $X$ is uniform on $[0, 1]$). Thus its median $\sqrt{2}/2$ would stay the same. Or perhaps, after removing the set $S$ all those $q_k$, the median $M_n$ computed on $\{q_1,\cdots,q_n\} \setminus S$ still satisfies $|M_n - \frac{\sqrt{2}}{2}|<\frac{D}{\log n}$, where $D$ is a constant. That would be enough to prove the main result.

If you are only interested in finding a classic math constant $\alpha$ with 50% zeros and 50% ones in its binary expansion, then replace the median $M_n$ by the closest number to $\alpha$ among $\{ q_1, \cdots, q_n\}$. An example is $\alpha = \frac{1}{2 \log 2}$. This number is the expectation of the distribution in question, instead of the median. One interesting fact related to this number is the following. The $m$-th digit of $q_k$ averaged over all $k=1,2,\cdots$ is denoted as $\mu_m$. Its exact value is known and discussed in a previous MST question (here). For $m>1$, the sequence $\mu_m$ is strictly increasing and converges exponentially fast to $\frac{1}{2}$. An immediate consequence of the definition of $\mu_m$ is that $$\sum_{m=1}^\infty \frac{\mu_m}{2^m} = \frac{1}{2\log 2}.$$

Also, less obvious to prove but not difficult: we have $\mu_1=1$ and for $m>1$, we have:

$$\mu_m = \frac{1}{\log 2}\cdot\log \frac{2^{2^{m-1}} \cdot (2^{m-1})!^3 }{(2^{m-2})!^2 \cdot (2^m)!} .$$

More on this in my next article.

Conclusions and next steps

It looks like $\sqrt{n}\cdot\Big(L(n) - 1\Big) = O(1)$. This is enough to prove the main result in this discussion, but this asymptotic relationship has yet to be proved (or disproved). A weaker result, possibly easier to prove, is the following: $(\log n) \cdot \Big(L(n) - 1\Big) = o(1)$. This is also enough to prove the main result. See chart below illustrating this conjecture.

Another approach discussed in the section A different approach, does not seem to be easier.

A related question is whether or not, for two irrational numbers $\alpha, \beta$ linearly independent on the set of rational numbers, the correlation between the digits of $\alpha$ and $\beta$ in base $b$ is zero. To prove this, it suffices to prove that the sequences $\{ b^n\alpha\}$ and $\{ b^n\beta\}$ are not correlated. It was proved that this was true for the sequences $\{ n\alpha\}$ and $\{ n\beta\}$, see here. Unfortunately, this does not help. Here the brackets represent the fractional part function. But this epitomizes the issue that we are dealing with here. If $\alpha$ is irrational, then $\{n\alpha\}$ (sometimes called $n \alpha \mbox{ modulo } 1$) is equidistributed, regardless of $\alpha$ (irrational). But $\{b^n\alpha\}$ is equidistributed only for almost all irrational numbers. The first sequence is sometimes referred to as a universally good averaging sequence, while the latter is termed a universally bad averaging sequence: see this Wikipedia entry.

A weaker conjecture is the following: infinitely many irrational numbers $\sqrt{r}$ with $r$ a rational number, have 50% zeros and 50% ones in their binary expansion. You might be able to prove it without being able to name a single of these numbers satisfying this property (I tried and failed so far; it is probably still a mystery today.)

Finally, if you are interested to see how chaotic the behavior of non-normal numbers is (contrasted with supposedly normal numbers such as $\sqrt{2}$), read my recent MSE discussion, here. It gives some nice insights about what makes a number such as $\sqrt{2}, \pi, e$ or $\log 2$ stand out.

Update

The relation $\sqrt{n}\cdot\Big(L(n) - 1\Big) = O(1)$ is a direct consequence of the Berry-Essen theorem, a refinement of the central limit theorem that applies in this case. It would be true here if the digits of (say) $\sqrt{2}$ were i.i.d. with a Bernouilli distribution of parameter $p=\frac{1}{2}$, as they appear to be. Thus proving this asymptotic result for $\sqrt{2}$ would be very difficult at best, impossible at worst. But a weaker result might be reachable. Along the same lines, see a new question (with answer) that I posted recently on MSE, here, and also here. It shows what would happen if $\sqrt{2}$ was a well-behaved but non-normal number, say with 75% of its digits equal to $1$.

Update #2

I wrote an article based on this question and some other related questions, You can read here.

*"If you answer my question about the limit converging to $1$, say for $y_1=2$, $z_1=5$, then you proved that exactly 50% of the binary digits of $\sqrt{2}$ are zero."* (What if I answer your question with "no"? :) This seems pretty profound. Are you expecting an answer, or are you just proposing a research topic for someone? — Blue, Aug 17 '19 at 18:36
@Blue: I haven't tried to prove that the limit is $1$, so I don't know if it is easy or incredibly difficult, or impossible. The answer could be no (very unlikely), it could be $1.2$ instead of $1$, which would mean in that case, that 60% of the binary digits of $\sqrt{2$ are one. I don't expect an answer, but will accept any efforts that bring some lights on this, even if it is just a very useful reference to some work done before. — Vincent Granville, Aug 17 '19 at 18:49
Interesting stuff. ... Since comments are easily overlooked (and could get hidden), I recommend that you clearly state your (non-)expectations in the body of the question itself, along with some (or all) of your other remarks. — Blue, Aug 17 '19 at 20:13
Perhaps relevant here: it is unknown whether or not $\sqrt{2}$ is a normal number in any base, let alone base 2. So I doubt it's possible to qualitatively answer this question. Maybe there's a quantitative argument though, along the lines of "most numbers have (or don't have) a limit" in the sense of Lebesgue measure. — Alex R., Aug 20 '19 at 17:32
@AlexR: But if the limit exists, most likely it is 50% zero, 50% one. Now the limit may not exist, see e.g. $x=0.100111100000000111111111111111100..$. Or even some deeper thought: it is one of those statements that you can't prove or disprove? That is, you could have two mathematical theories, one in which the limit is (say) 50% zero, and one in which the limit is not 50% zero, and both theories being compatible with the Zermalo-Fraenkel framework. — Vincent Granville, Aug 20 '19 at 19:22
@AlexR.: The assertion about the $\sqrt{2}$ limit is merely that there's a 50-50 split of $1$s and $0$s, not that the number is *normal* (which further requires equal distributions of *every string* of $1$s and $0$s). I don't know how this affects the plausibility of the limits, however. — Blue, Aug 20 '19 at 20:18
@Blue: Sorry, I meant "simple" ("simply?") normal, and I'm not sure that we even know whether $\sqrt{2}$ is or isn't. — Alex R., Aug 20 '19 at 21:11
Let $\alpha \in (0, 1)$ be irrational and $z_1 > 1$. Set $y_1 = (2\alpha - 1)\alpha + z_1 \alpha$. (In OP's primary example, we have $\alpha = \sqrt{2}-1$ and $z_1 = 5$. This gives $y_1 = 2$.) Then for $y_n$ and $z_n$ defined by OP's algorithm, it seems to me that $$ z_n = 1 + 2^{n-1}(z_1 - 1) + 4\lfloor 2^{n-1}\alpha \rfloor $$ and that $2y_n - z_n > 0$ if and only if the $n$-th binary digit of $\alpha$ is $1$. I checked this for various inputs, but currently has no time to delve into the proof. I will provide an update when available. — Sangchul Lee, Aug 20 '19 at 23:21
Looks like $y_1=1.2, z_1=5.3$ leads to the non-standard case. — Vincent Granville, Aug 21 '19 at 18:43

Sextus Empiricus · Accepted Answer · 2020-02-20T19:50:28.943

I doubt that this way of expression/equation is gonna bring you much further.

Based on the answer to your other question we can relate the values of $z_n$ to the numbers $p_n$, which express as a series of binary fractions with increasingly more digits (where $n$ referes to the number of binary digits). That is: $$p_n = \sum_{k=0}^n a_n 2^{-n}$$ where $a_n$ are the binary digits of the root that is being estimated.

Your $z_n$ can be related to this and have the value:

$$z_n - 1 = p_n 2^{n+1}$$

The term $z_n$ is more or less the estimate with increasingly more binary digits multiplied by an increasing power of two.

Then this sum of $z_n$ (in your limit) can be seen visually/intuitive by putting it into a triangular form (algebraically you can see it as a double sum like, $\sum_{i=1}^n \sum_{j=1}^i $ where you reverse the order/direction)

$$\begin{array}{} p_0 & = & a_0 \frac{1}{1} \\ p_1 & = & a_0 \frac{1}{1} &+& a_1 \frac{1}{2} \\ p_2 & = & a_0 \frac{1}{1} &+& a_1 \frac{1}{2} &+& a_2 \frac{1}{4}\\ p_3 & = & a_0 \frac{1}{1} &+& a_1 \frac{1}{2} &+& a_2 \frac{1}{4} &+& a_3 \frac{1}{8} \\ p_4 & = & a_0 \frac{1}{1} &+& a_1 \frac{1}{2} &+& a_2 \frac{1}{4} &+& a_3 \frac{1}{8} &+& a_4 \frac{1}{16}\\ \end{array}$$

in terms of $z_i-1$

$$\begin{array}{r} z_0 -1 & = & a_0 \frac{1}{1} \\ z_1 -1 & = & a_0 \frac{2}{1} &+& a_1 \frac{1}{2} \\ z_2 -1 & = & a_0 \frac{4}{1} &+& a_1 \frac{4}{2} &+& a_2 \frac{4}{4}\\ z_3 -1 & = & a_0 \frac{8}{1} &+& a_1 \frac{8}{2} &+& a_2 \frac{8}{4} &+& a_3 \frac{8}{8} \\ z_4 -1 & = & a_0 \frac{16}{1} &+& a_1 \frac{16}{2} &+& a_2 \frac{16}{4} &+& a_3 \frac{16}{8} &+& a_4 \frac{16}{16} & \\ \hline \sum_{k=0}^{4} z_k -1 & = & a_0 (32-1) &+& a_1 (16-1) &+& a_2 (8-1) &+& a_3 (4-1) &+& a_4 (2-1) \\ \end{array}$$ which compares with

$$z_5 - 1 = a_0 32 + a_1 16 + a_2 8 + a_3 4 + a_4 2 + a_5 $$

So this is a different way to see why the difference $(z_n -1) - \sum_{k=0}^{n-1} (z_k -1) = \sum_{k=0}^{n} a_k $

I have written it down in this graphical form. Instead of using summation terms which I sometimes find a bit awkward. I hope the idea is clear (possibly I may have made some mistakes with counting the multiples of two, but the basic idea is that your $z_n$ is a power of $p_n$ and you can arrange them like above).

Your search for the limit with some terms $z_n$ is very directly related to a more trivial computation of $p_n$ but just multiplied with a factor $2^n$.

You have to start from somewhere. Some proof of the prime number theorem also starts with some "triviality" of the same level (the Legendre formula, see http://bit.ly/38KpOIN) but that's the starting point. — Vincent Granville, Feb 20 '20 at 19:10
@VincentGranville yes, possibly it may help to use this sort of recursion, and yes you need to start somewhere. The point of my post here is to mostly demystify the recursion. In the beginning, I was a bit confused why this recursion did what it did. But now I see it as a modified version of a recursion formula for a series $p_0, p_1, p_2, ...$ that represents the number expressed as a binary fraction rounded down to an increasingly number of digits. Your value $z_n$ is only equal to this series multiplied with a power of two. — Sextus Empiricus, Feb 20 '20 at 19:20
The charts that you showed seem promising, but basically they seem to be related to the fraction of zeros:ones divided by $n$. It does not represent something special that the recursion formula teaches us or makes clear to us. — Sextus Empiricus, Feb 20 '20 at 19:25
A next step (and it may lead to nowhere) is to compare the digits of (say) $\sqrt{2}$ and $\sqrt{6}$ or $3\sqrt{2}$. Cross-correlation is expected to be zero, but it might reveal something interesting, and the recursions do not explicitly involve irrational numbers (just integers) which is the nice thing. These recursions are also very simple (but computationally inefficient, better ones do exist) which makes it attractive as a starting point. — Vincent Granville, Feb 20 '20 at 19:36
The Berry-Essen bounds are useful in the sense that they provide an explicit formula for 2nd order behavior of the digit distribution. Interestingly, it does not yield a Brownian motion, but instead some king of bounded Brownian motion. Another thing is that when looking at a sequence $x_n$ that matches the digits of $\sqrt{2}$ at infinity, you need only to focus at the first $\log_2 n$ digits of $x_n$. — Vincent Granville, Feb 20 '20 at 19:36

Limit associated with a recursion

1 Answers1

Linked