Introduction
The Collatz conjecture has the infamy of a black hole. Young, promising mathematicians are sucked into it, never to be seen again, because this conjecture has been unsolved for so long (and is quite difficult, as expected).
The statement of the Collatz conjecture is that and a function operating on it repeatedly, where the sequence of will eventually reach
Additionally, the aforementioned sequence is referred to as the Collatz sequence.
As can be seen, it is simple to state, and yet there are still no solutions. A good introductory video to the conjecture can be found here: https://www.youtube.com/watch?v=094y1Z2wpJg by Veritasium.
Intuitively, the Collatz Conjecture makes sense. The of growth for in the function is implying that the conjecture does eventually converge at (Veritasium).
One might think that the growth rate is implying that sequences will eventually diverge, since either increases by a factor of or decreases by a factor of Since both cases happen with roughly equal probability (half of all numbers are odd, the other half are even), we can say that the overall growth factor is likely to be about
However, for all odd numbers, yields an even number (as 3 times an odd number is still odd, adding 1 makes it even). This means that we know the next move will be to divide the new value by 2, and then the parity of the number is random. This means that we can instead consider
And thus the growth rate is, on average, which does imply an eventual convergence.
Interesting Facts
There are several interesting observations on the Collatz conjecture that are also pretty basic and serve as a nice introduction.
- The Collatz conjecture can never yield a number that is a multiple of 3 except at the very beginning of the sequence, since all odd terms become and all even terms cannot gain a factor of This means, for example, that when looking at the numbers in Collatz sequences, only of the natural numbers could be used in the sequence.
Cycle
The Collatz conjecture includes the following:
The only positive integer cycle (where the repeated application of reaches a value previously reached) is As is evident, and
An important theorem that initially guided cycle length calculation follows. Define as the number of operations, and as the number of operations. If there is a cycle besides the 4-2-1 cycle, name it’s lowest element as greater than some number Then,
(Sinisalo 2003).
We can continue with the proof!
Denote the Collatz sequence of values in the cycle such that each consecutive term is the result of applying the recursive function on the previous value. Denote this sequence Additionally, note that is the result of the Collatz function applied to
Now, let’s look at the ratios of the successive terms in the sequence. This would be Trivially, this telescoping series is equal to 1.
Then, note that every one of the operations where the previous value is divided by two has a ratio of This means that these values, multiplied, give For the series above to equal 1, the product of the ratios of successive elements in each of the elements must also equal
Then, note that the ratio of successive elements for each of the elements is also greater than and less than (since we are operating ) and is the first and smallest value without loss of generality.
This means that the product of each of these ratios is bounded by and
Thus, we can construct the following inequality:
Taking of each section of the expression, and applying the log rule we get that and dividing all parts by we get that
This can be used for the following theorem:
Let be the rational number with least possible denominator such that Then, the least possible cycle length for a new cycle (not 4-2-1) is (Sinisalo 2003). A rigorous proof requires math outside the scope of this article, but can be found here! https://www.probleme-syracuse.fr/cargo/Sinisalo\_collatz.pdf.
However, a rough outline of the proof comes from taknig the floor function on this theorem and coming up with integer bounds for the values of and as well as noting that and using this to prove the theorem!
Then, calculating the actual value of gives The total length of a cycle is thus greater than
Thus, we have a huge lower bound, which is helpful! We are likely to be able to push this bound up even higher. In fact, Tomas Oliveira e Silva confirmed with a computer that all Collatz sequences with initial values up to converge to the 4-2-1 cycle.
Another significant advancement in looking at cycles came in 2019 when Terrence Tao proved essentially the name of his paper “Almost all Collatz orbits attain almost bounded values.” He proved the Collatz Conjecture for almost all numbers, and with the following theorem:
For any function of positive integers such that the minimal value of the Collatz sequence must be less than or equal to for the starting integer in the Collatz sequence (Tao 2019).
A simple example of this is just ascending integers: . This theorem thus proves that the minimal value in a Collatz sequence starting with is always less than
Stopping Times
The stopping time of a number as the amount of iterations before a number reaches
A lot of work has been done for the Collatz conjecture, but there has not been as much analysis on the actual stopping times - namely, which values are most common for the stopping time of a function and how the stopping times are viewed.
Graphing the stopping times with transitions is rather unhelpful due to the density of the lines.
Removing the connections between adjacent points, we can see the following graph of stopping times.
I calculated these stopping times just by running the recursive function, and with dynamic programming to speed up the process. Still, it was extremely fast, and easily replicable.
{L}{0.7 \textwidth}
In fact, we can predict that this pattern will probably continue to a higher number of x-values, too. It is a pretty pattern, and there seems to be a few different values that have the highest amount of stopping times - clustered around 105, and around 25. A paper by Lagarias from 1985 presents similar results for the range just beyond 1000, with a continued cluster at 23 and at 80, both different numbers in terms of primality and value. It’s strange!
To confirm the continued pattern, we can look at 10,000 datapoints.
As shown in the larger figure, the pattern does continue. Taking the log of the values in the hopes that it could clarify the stopping times gives the following graph, but the graph does not have a clearer pattern.
{R}{0.5 \textwidth} Log of 10,000 stopping times}
However, it is pretty cool to look at!
The stopping times have been calculated for all as of 2022, and for all of these values, the Collatz conjecture reached 1.