Exponential digit complexity beyond the Bugeaud--Kim threshold

[Authors]

MSC 2020: 11A63, 11J82 (primary), 68R15 (secondary)

Keywords: subword complexity, Mahler function, Sturmian word, lacunary series, irrationality exponent

Abstract

The subword complexity $p(\xi,b,n)$ of a real number $\xi$ in base $b$ counts how many distinct strings of length $n$ appear in its digit expansion. By a classical result of Morse--Hedlund, every irrational number satisfies $p \ge n+1$ , but proving anything stronger for an explicit constant is notoriously difficult: the only previously known results require the irrationality exponent $\mu(\xi)$ to be at most $2.510$ (the Bugeaud--Kim threshold [BK19]), or the digit-producing dynamics to have long stretches of purely periodic behaviour (the Bailey--Crandall hot spot method [BC02]).

We introduce an epoch-expansion technique that bypasses both barriers, and use it to prove that a broad family of lacunary sums --- constants of the form $\xi=\sum a_k/(c^{g(k)}b^{f(k)})$ with $\gcd(b,c)=1$ and rapidly increasing exponents $f$ --- have much richer digit structure than irrationality alone guarantees. Concretely, the number of distinct length- $n$ digit strings satisfies $p(\xi,b,n)-n\to+\infty$ , with quantitative lower bounds that mirror the growth of the spacing $f$ : quadratic spacing yields at least quadratically many strings, exponential spacing yields exponentially many, and so on.

Our strongest application concerns the twisted Mahler constants $M_{d,b,c}=\sum_{k\ge 0}1/(c^k b^{d^k})$ . When $d\ge 3$ and $c>d^2$ , these constants have irrationality exponent $\mu\ge d>2.510$ (so Bugeaud--Kim fails) and epoch-to-lattice ratio tending to zero (so hot spots fail), yet we obtain exponential complexity: $p(M_{d,b,c},b,n)\ge d^{n/\alpha-C}$ where $\alpha=\log_b c$ . Further applications include partial theta function values, Tschakaloff function values, Fibonacci-exponent constants, and Rogers false theta function values. To our knowledge, these are the first complexity results for explicit constants beyond both known barriers.

1. Introduction

Motivation: measuring the complexity of digit expansions

Write a real number $\xi$ in an integer base $b\ge 2$ :

$\xi = 0.x_1 x_2 x_3 \cdots, \qquad x_n \in {0,1,\dots,b-1}.$

One of the simplest questions one can ask about this expansion is: how many distinct patterns appear? The subword complexity function $p(\xi,b,n)$ counts the number of distinct blocks of $n$ consecutive digits $x_{i+1}x_{i+2}\cdots x_{i+n}$ that occur as $i$ ranges over all positions. For a rational number, the digit expansion is eventually periodic, so $p(n)$ is bounded. For an irrational number, a classical theorem of Morse and Hedlund [MH38] gives $p(n)\ge n+1$ for every $n$ , and this bound is sharp: it is attained exactly by the Sturmian words, a family of aperiodic binary sequences that arise by coding an irrational rotation of the circle (see Lothaire [Lo02] for a comprehensive account). At the other extreme, a number is normal in base $b$ (in the sense of Borel [Bor09]) if every block of digits appears with the expected frequency $b^{-n}$ ; this forces $p(n)=b^n$ , the maximum possible value. Almost every real number is normal in every base, but proving normality---or even any growth of $p(n)$ beyond $n+1$ ---for a specific constant such as $\pi$ , $e$ , or $\sqrt{2}$ is extraordinarily difficult.

What is known

There are essentially two prior techniques for proving that the digits of a given constant are more complex than the Morse--Hedlund minimum.

The Diophantine method. Ferenczi and Mauduit [FM97] first connected low subword complexity to transcendence: they showed that a number whose base- $b$ expansion satisfies $p(n) = n+1$ for all $n$ (a Sturmian number) is necessarily transcendental. Adamczewski and Bugeaud [AB07], using the deep Schmidt Subspace Theorem from Diophantine approximation, proved a much stronger result: the digit expansion of every algebraic irrational number $\xi$ satisfies $p(\xi,b,n)/n\to\infty$ for every base $b$ . For transcendental constants, the key parameter governing what can be proved is the irrationality exponent

$\mu(\xi) = \sup!\Bigl{\mu : |\xi - p/q| < q^{-\mu} \text{ for infinitely many } p/q\in\mathbb{Q}\Bigr},$

which measures how well $\xi$ can be approximated by rationals. Bugeaud and Kim [BK19] showed that if $\mu(\xi) < \mu_0 := (25+4\sqrt{10})/15 \approx 2.510$ , then $p(\xi,b,n) - n \to +\infty$ . The reasoning is: a small irrationality exponent forces the digit expansion to avoid long near-repetitions; this is incompatible with the rigid self-similar structure of quasi-Sturmian words (sequences with $p(n)\le n+C$ for a constant $C$ , characterized by Cassaigne [Ca97] as the image of a Sturmian word under a morphism); and the resulting structural contradiction yields complexity growth. The threshold $\mu_0=2.510\ldots$ is sharp: Bugeaud and Kim construct Sturmian numbers with $\mu = \mu_0$ and $p(n)=n+1$ for all $n$ .

Prior to the present work, this method was the only known technique for proving $p(n)-n\to\infty$ for explicit transcendental constants. It covers Euler's number $e$ and its relatives $e^{1/m}$ , $\tanh(1/m)$ , and certain Bessel function quotients---all of which have $\mu=2$ , well below the threshold.

The hot spot method. Bailey and Crandall [BC02] developed a completely different approach that proves a much stronger conclusion---full normality---for a special class of constants. The Stoneham numbers $\alpha_{b,c} = \sum_{k\ge 1} 1/(c^k b^{c^k})$ , where $b$ and $c$ are coprime, have the property that successive nonzero terms are exponentially far apart. Between two consecutive terms, the digit-generating iteration $z_n = {b \cdot z_{n-1}}$ (fractional part of $b$ times the previous orbit point) evolves without perturbation, giving long "clean" stretches during which the orbit is algebraically controlled. Bailey and Crandall show that no short subinterval of $[0,1)$ is visited disproportionately often by this orbit (a "hot spot"), and this implies normality. The crucial structural requirement is that the epoch length (the number of unperturbed steps between consecutive terms of the series) is at least proportional to the lattice denominator of the orbit during that epoch.

The gap

Both methods leave large classes of constants untouched. The Diophantine method requires $\mu(\xi)<2.510$ , yet for most transcendental constants the irrationality exponent is either unknown or too large: for $\pi$ , the best bound is $\mu(\pi)\le 7.103$ (Zeilberger--Zudilin [ZZ20]), far above the threshold. The hot spot method requires the epoch-to-lattice ratio to remain bounded below by a positive constant, and this fails for series whose terms are more densely packed than in the Stoneham case.

Between these two regimes sits a large family of lacunary constants---sums whose nonzero terms are located at positions that grow faster than linearly but not exponentially (such as quadratic or polynomial positions)---for which no technique existed for proving anything about digit complexity beyond the trivial bound $p(n)\ge n+1$ .

Our contribution: the epoch-expansion method

In this paper we introduce the epoch-expansion method, a technique for proving lower bounds on subword complexity that requires no information about the irrationality exponent and is orthogonal to the hot spot approach.

The key observation is elementary. Consider a constant of the form $\xi = \sum a_k / (c^{g(k)} b^{f(k)})$ , where $b$ and $c$ are coprime and the exponents $f(k)$ grow with increasing gaps $f(k!+!1)-f(k) \to \infty$ . The digit expansion of $\xi$ is naturally partitioned into epochs: long runs of digit positions between the locations $f(k)$ and $f(k!+!1)$ of successive terms. Inside each epoch, the orbit point $z_n = {b^n \xi}$ ---whose base- $b$ digits determine the digit expansion of $\xi$ from position $n$ onward---decomposes as a rational number with denominator $c^K$ plus a negligibly small tail. The coprimality $\gcd(b,c)=1$ places the numerator of this rational part on a $c$ -adic lattice, and a key arithmetic observation controls how long two orbit points can produce identical digits: if positions $n$ and $n+s$ lie in the same epoch, they share at most $v_c(b^s-1)\cdot \log_b c + O(1)$ leading digits, where $v_c$ denotes the $c$ -adic valuation. This bound grows only logarithmically in the epoch index, while the epochs themselves grow without bound. Eventually the epochs are long enough to contain more pairwise distinguishable starting positions than any quasi-Sturmian word can accommodate, and the complexity $p(n)$ is forced to exceed $n+C$ for every constant $C$ .

Results

We present our results in order of increasing generality, developing the technique first for the partial theta values

$\gamma_{b,c}:=\sum_{k\ge 1}\frac{1}{c^k b^{k^2}}, \qquad b\ge 2,\ c\ge 2,\ \gcd(b,c)=1,$

which serve as the motivating case. These constants are values of a partial Jacobi theta function (related to the $q$ -series $\Theta^+(q,z) = \sum_{k\ge 0} q^{k^2} z^k$ ); they escape both prior methods, since the epoch-to-lattice ratio tends to zero (defeating hot spots) and no bound on $\mu(\gamma_{b,c})$ is available (so the Diophantine method cannot be applied).

Theorem 1.1 (Qualitative complexity). For all coprime integers $b\ge 2$ and $c\ge 2$ ,

$p(\gamma_{b,c},b,n)-n\longrightarrow +\infty \qquad (n\to\infty).$

Theorem 1.2 (Quadratic lower bound). If $\alpha=\log_b c<2$ , then there exists $n_0=n_0(b,c)$ such that

$p(\gamma_{b,c},b,n)\ge \frac{(2-\alpha)^2}{16\alpha^2},n^2 \qquad\text{for all } n\ge n_0.$

(We write $\alpha = \log_b c$ throughout the paper.) For a more striking application, we turn to the twisted Mahler constants. For integers $d\ge 2$ , $b\ge 2$ , $c\ge 2$ with $\gcd(b,c)=1$ , define

$M_{d,b,c} ;:=; \sum_{k\ge 0}\frac{1}{c^k,b^{,d^k}}.$

These arise from the Mahler function $h(x) = \sum_{k\ge 0} x^{d^k}$ , which satisfies $h(x) = x + h(x^d)$ ; see Nishioka [Ni96] for background on Mahler functions in transcendence theory. When $d\ge 3$ and $c > d^2$ , these constants lie beyond the reach of both prior approaches simultaneously. On the Diophantine side, $\mu(M_{d,b,c}) = d$ (proved via the formal continued fraction of the associated Mahler function; see Badziahin [Bad19] and Rajchert [Raj24]), so $\mu \ge 3 > 2.510$ and the Bugeaud--Kim threshold is exceeded. On the hot spot side, Bailey--Crandall [BC02] prove normality of $M_{d,b,c}$ only when $d > \sqrt{c}$ ; when $c>d^2$ this condition fails. Nevertheless:

Theorem 1.3 ( $\mu$ -bypass). For all $d\ge 2$ , $b\ge 2$ , $c\ge 2$ with $\gcd(b,c)=1$ ,

$p(M_{d,b,c},,b,,n)-n;\longrightarrow;+\infty \qquad(n\to\infty).$

When $d < c$ , the exponential growth of the epoch lengths $L_K = (d-1)d^K$ allows a much stronger, quantitative conclusion:

Theorem 1.4 (Exponential lower bound). For all $d\ge 2$ , $b\ge 2$ , $c\ge 2$ with $d<c$ and $\gcd(b,c)=1$ , there exists a constant $C=C(d,b,c)$ such that

$p(M_{d,b,c},,b,,n) ;\ge; d^{,n/\alpha,-,C}$

for all sufficiently large $n$ .

For comparison, the strongest quantitative result available from the Bugeaud--Kim method is $\limsup, p(n)/n\ge 4/3$ [BKK25], which applies only to constants satisfying a stringent irrationality exponent condition. Theorem 1.4 gives exponential growth in a regime where that method is provably inapplicable (see Remark 8.1).

All of the above are special cases of a general theorem. For integers $b\ge 2$ , $c\ge 2$ with $\gcd(b,c)=1$ , let $f\colon\mathbb{N}\to\mathbb{N}$ be strictly increasing with $L_K:=f(K!+!1)-f(K)\to\infty$ , let $g\colon\mathbb{N}\to\mathbb{N}$ be strictly increasing with $g(K)\to\infty$ and $g(K)=o(f(K))$ , and let $(a_k)_{k\ge 1}$ be integers with $\gcd(a_k,c)=1$ for all $k$ . Assume further that

$v_c(b^s-1)<g(K) \qquad\text{for all }1\le s\le L_K\text{ and all large }K. \tag{1}$

(A sufficient condition is $\log L_K = o(g(K))$ , since $v_c(b^s-1)=O(\log s)$ . When $f$ grows exponentially, (1) can instead be verified via multiplicative order estimates; see Corollary 7.2 and (8).) Define

$\xi;=;\sum_{k\ge 1}\frac{a_k}{c^{g(k)},b^{f(k)}}.$

Theorem 1.5 (General epoch-expansion theorem). Under the hypotheses above, $p(\xi,b,n)-n\to+\infty$ . Moreover, there exist constants $C_0=C_0(b,c)$ and $K_0=K_0(b,c,f,g)$ such that for all sufficiently large $n$ ,

$p(\xi,b,n);\ge;\sum_{K=A_n}^{B_n}(L_K - n + 1),$

where $A_n=\min{K\ge K_0: L_K\ge n}$ and $B_n=\max{K: g(K)\alpha + C_0 < n}$ . When $B_n < A_n$ (which can happen for slowly growing $f$ with $\alpha \ge 2$ ), the qualitative conclusion $p(\xi,b,n)-n\to+\infty$ still holds via the Cassaigne argument of Theorem 1.1, applied with $g(K)$ in place of $K$ .

The quantitative bound mirrors the spacing function: quadratic spacing ( $f(k)=k^2$ ) gives $p(n)\ge c_0 n^2$ ; exponential spacing ( $f(k)=d^k$ ) gives $p(n)\ge d^{n/\alpha-C}$ . Theorems 1.1, 1.2, and 1.4 are corollaries of Theorem 1.5 (see Section 7); Theorem 1.3 requires an independent argument given in Section 8.

Proof architecture

We present the proofs for the partial theta values $\gamma_{b,c}$ independently of the general theorem, because they introduce the key ideas in a concrete setting. The architecture of the general proof is as follows.

All results rest on the epoch-expansion framework, whose applicability is determined by the structural properties of $\xi=\sum a_k/(c^{g(k)}b^{f(k)})$ : coprime denominator ( $\gcd(b,c)=1$ , giving a $c$ -adic lattice structure for the orbit); increasing gaps ( $L_K \to \infty$ , creating epochs of growing length); single-modulus lattice (all terms share the modulus $c$ , making $v_c(b^s-1)$ the sole obstruction to digit agreement); and valuation control (condition (1)).

Given these properties, the proof operates in two modes. For the qualitative conclusion ( $p(n)-n\to\infty$ ), we argue by contradiction: if $p(n)-n$ were bounded, Cassaigne's theorem [Ca97] would force the digit sequence to be quasi-Sturmian, producing long self-matches at certain gaps. We place such a matched pair inside a single large epoch and show that the within-epoch valuation bound limits the match length to $O(K\alpha)$ , contradicting the Sturmian match length $\gg q_{k+1}$ since $K=O(\log q_{k+1})$ . For the quantitative conclusion, we count distinct length- $n$ subwords directly: each epoch of length $L_K \ge n$ contributes $\sim L_K$ starting positions whose length- $n$ subwords are pairwise distinct (by the valuation bound within epochs, and by the lattice gap across epochs), and summing over eligible epochs yields the lower bound.

Further applications

Beyond partial theta values and twisted Mahler constants, Theorem 1.5 applies uniformly to every lacunary constant satisfying the hypotheses above:

(a) Tschakaloff function values $T_b(1/c)=\sum_{k\ge 1}1/(c^k b^{k(k+1)/2})$ ( $f(k)=k(k+1)/2$ , $L_K=K+1$ ; quadratic bound).

(b) Fibonacci-exponent constants $\sum_{k\ge 1}1/(c^k b^{F_k})$ ( $f(k)=F_k$ , $L_K=F_{K-1}$ ; exponential bound $p(n)\ge \varphi^{n/\alpha-C}$ when $\alpha$ is sufficiently small).

(c) Signed coefficients $a_k\in{\pm 1}$ ( $\gcd(\pm 1,c)=1$ always); this covers values of Rogers false theta functions at rational arguments.

Organization

Section 2 collects notation and the Sturmian input. Section 3 gives the epoch decomposition of the orbit ${b^n\gamma_{b,c}}$ . Section 4 proves the within-epoch and cross-epoch match bounds. Theorems 1.1 and 1.2 are proved in Sections 5 and 6. Section 7 states and proves the general epoch-expansion theorem (Theorem 1.5) and derives applications to Tschakaloff function values, Fibonacci-exponent constants, and signed-coefficient series. Section 8 treats twisted Mahler constants, proving Theorems 1.3 and 1.4; this includes the exponential bound in the $\mu$ -bypass regime $d\ge 3$ , $c>d^2$ where both prior methods are provably inapplicable. Section 9 discusses open questions.

2. Preliminaries

Throughout the paper, $b\ge 2$ and $c\ge 2$ are fixed coprime integers, and

$\alpha=\log_b c>0.$

2.1. Digits, orbit points, and complexity

For $x\in\mathbb{R}$ , we write $\lfloor x \rfloor$ for the floor and ${x}=x-\lfloor x \rfloor$ for the fractional part.

We consider

$\gamma_{b,c}=\sum_{k\ge 1}\frac{1}{c^k b^{k^2}},$

and its orbit under multiplication by $b$ :

$z_n:={b^n\gamma_{b,c}},\qquad n\ge 0.$

The base- $b$ digit sequence of $\gamma_{b,c}$ is then

$x_n=\lfloor b z_{n-1} \rfloor,\qquad n\ge 1,$

so that

$\gamma_{b,c}=\sum_{n\ge 1} x_n b^{-n}.$

Definition 2.1. For $n\ge 1$ , the subword complexity of $\gamma_{b,c}$ in base $b$ is

$p(\gamma_{b,c},b,n) = #\bigl{x_{i+1}x_{i+2}\cdots x_{i+n}: i\ge 0\bigr}.$

When the parameters are clear, we abbreviate this to $p(n)$ .

2.2. Match length

Definition 2.2. Let $\mathbf{w}=(w_n)_{n\ge 1}$ be an infinite word over a finite alphabet. For $n\ge 1$ and $g\ge 1$ , the match length at position $n$ with gap $g$ is

$\operatorname{Match}$

For the digit sequence $\mathbf{x}=(x_n)$ of $\gamma$ {b,c} $γ_{b, c}$ , we write

$M(n,g):=\operatorname{Match}_{\mathbf{x}}(n+1,g), \qquad n\ge 0,\ g\ge 1.$

Equivalently, $M(n,g)$ is the largest $L\ge 0$ such that the first $L$ base- $b$ digits of $z_n$ and $z_{n+g}$ coincide.

2.3. Square epochs

For $K\ge 1$ , define epoch $K$ to be the interval of positions

$[K^2,(K+1)^2).$

Its length is

$L_K=(K+1)^2-K^2=2K+1.$

If $n$ lies in epoch $K$ , we write

$e_n:=(K+1)^2-n, \qquad 1\le e_n\le 2K+1,$

for the distance from $n$ to the next epoch boundary.

2.4. $c$ -adic valuation

Let

$c=p_1^{a_1}\cdots p_r^{a_r}$

be the prime factorization of $c$ . For a nonzero integer $m$ , define

$v_c(m):= \min_{1\le i\le r}\left\lfloor\frac{v_{p_i}(m)}{a_i}\right\rfloor,$

so that $v_c(m)$ is the largest $k\ge 0$ such that $c^k\mid m$ . We set $v_c(0)=+\infty$ .

2.5. Cassaigne's theorem

A Sturmian word is an aperiodic binary word of minimal complexity, namely $p(n)=n+1$ for all $n\ge 1$ . A word is quasi-Sturmian if its complexity is bounded above by $n+C$ for some constant $C$ .

We use the following theorem of Cassaigne.

Theorem 2.3 (Cassaigne [Ca97]). For a non-eventually-periodic infinite word $\mathbf{u}$ over a finite alphabet, the following are equivalent:

(i) $\liminf_{n\to\infty}(p_{\mathbf{u}}(n)-n)<\infty$ ;

(ii) $\mathbf{u}$ is eventually quasi-Sturmian, i.e. there exist a finite word $W$ , a Sturmian word $\mathbf{s}$ over ${0,1}$ , and a morphism

$\varphi:{0,1}\to \mathcal{A}^*$

such that

$\mathbf{u}=W,\varphi(\mathbf{s}).$

2.6. Two Sturmian lemmas

The first lemma is the source of long local self-matches in a Sturmian word. Here $\operatorname{Match}_{\mathbf{s}}(n,g)$ denotes the match length of Section 2 applied to the Sturmian word $\mathbf{s}$ .

Lemma 2.4 (Strong Sturmian match). Let $\mathbf{s}$ be a Sturmian word of slope $\beta$ , and let $q_k$ be the denominators of the convergents of $\beta$ . For all sufficiently large $k$ , there exists

$n_k\in [q_{k+1}/2,\ q_{k+1})$

such that

$\operatorname{Match}$

Proof. Let $\varepsilon_k=|q_k\beta|$ . By continued-fraction theory,

$\frac{1}{q_{k+1}+q_k}<\varepsilon_k<\frac{1}{q_{k+1}},$

hence $\varepsilon_k>1/(2q_{k+1})$ for large $k$ .

A mismatch at shift $q_k$ among the first $m$ symbols can occur only if the starting point of the coding orbit falls within distance $\varepsilon_k$ of one of the $m$ discontinuity preimages. Thus the set of bad starting points is covered by at most $m$ intervals of total length at most $2m\varepsilon_k$ .

Among the $q_{k+1}$ points ${j\beta}$ , $0\le j<q_{k+1}$ , the minimum spacing is at least $\varepsilon_k>1/(2q_{k+1})$ . Therefore the number of bad starting points among these $q_{k+1}$ orbit points is at most

$2m\varepsilon_k\cdot 2q_{k+1}+m\le 5m.$

Take $m=\lfloor q_{k+1}/16 \rfloor$ . Then at least

$q_{k+1}-5m\ge \frac{11}{16}q_{k+1}$

starting points have match length at least $m$ . In particular, one such point lies in $[q_{k+1}/2,\ q_{k+1})$ . ∎

The second lemma is a standard recurrence property of Sturmian words.

Lemma 2.5 (Recurrence of Sturmian factors). Let $\mathbf{s}$ be a Sturmian word with convergent denominators $q_k$ . For $\ell \in [q_k, q_{k+1})$ , the recurrence function satisfies

$R(\ell) ;\le; q_{k+1} + q_k - 1.$

In particular, every factor of length $\ell$ occurs in every block of length $q_{k+1} + q_k$ .

Proof. See [Lo02, Ch. 2, Prop. 2.2.22]. ∎

3. Epoch decomposition and irrationality

We begin with the basic decomposition of $z_n$ inside a square epoch.

Lemma 3.1 (Epoch decomposition). Let $n$ lie in epoch $K$ , so that $K^2\le n<(K+1)^2$ . Then

$z_n=\frac{P_n}{c^K}+T_n, \tag{2}$

where:

(a) $P_n$ is an integer with $0\le P_n<c^K$ and

$P_n\equiv b^{,n-K^2}\pmod c,$

hence $\gcd(P_n,c)=1$ ;

(b)

$T_n=\sum_{t\ge 1}\frac{1}{c^{K+t} b^{u_{n,t}}}, \qquad u_{n,t}:=(K+t)^2-n =e_n+(t-1)(2K+t+1).$

In particular,

$0<T_n\le \frac{16}{15},\frac{1}{c^{K+1}b^{e_n}}, \tag{3}$

and

$0<T_n-\frac{1}{c^{K+1}b^{e_n}} \le \frac{16}{15},\frac{1}{c^{K+2}b^{e_n+2K+3}}. \tag{4}$

Proof. Write

$b^n\gamma_{b,c} = \sum_{j=1}^{K}\frac{b^{n-j^2}}{c^j} + \sum_{j>K}\frac{b^{n-j^2}}{c^j}.$

Set

$Q_n:=\sum_{j=1}^{K} c^{K-j} b^{n-j^2}\in \mathbb{Z}.$

Then

$b^n\gamma_{b,c}=\frac{Q_n}{c^K}+T_n, \qquad T_n:=\sum_{j>K}\frac{b^{n-j^2}}{c^j}.$

Let $P_n$ be the residue of $Q_n$ modulo $c^K$ in $[0,c^K)$ . By (3) below and $e_n \ge 1$ , $0 < T_n < 1/c^K$ . Since $0 \le P_n < c^K$ , we have $P_n/c^K + T_n < (c^K - 1)/c^K + 1/c^K = 1$ . Taking fractional parts therefore yields

$z_n=\frac{P_n}{c^K}+T_n.$

Modulo $c$ , every term in $Q_n$ with $j<K$ vanishes, while the term $j=K$ equals $b^{n-K^2}$ . Thus

$P_n\equiv Q_n\equiv b^{n-K^2}\pmod c,$

and since $\gcd(b,c)=1$ , we also have $\gcd(P_n,c)=1$ .

For the tail, write $j=K+t$ with $t\ge 1$ . Then

$T_n = \sum_{t\ge 1}\frac{1}{c^{K+t} b^{(K+t)^2-n}} = \sum_{t\ge 1}\frac{1}{c^{K+t} b^{u_{n,t}}}.$

Since

$u_{n,t}=e_n+(t-1)(2K+t+1)\ge e_n+(t-1)(2K+3),$

we get

$T_n \le \frac{1}{c^{K+1}b^{e_n}} \sum_{t\ge 0}\left(\frac{1}{cb^{2K+3}}\right)^t.$

As $cb^{2K+3}\ge 16$ , the geometric series is bounded by $16/15$ , which proves (3).

The same argument starting from $t=2$ gives

$T_n-\frac{1}{c^{K+1}b^{e_n}} \le \frac{1}{c^{K+2}b^{e_n+2K+3}} \sum_{u\ge 0}\left(\frac{1}{cb^{2K+5}}\right)^u \le \frac{16}{15},\frac{1}{c^{K+2}b^{e_n+2K+3}}.$

This proves (4). ∎

Lemma 3.2. The number $\gamma_{b,c}$ is irrational.

Proof. Let

$S_K:=\sum_{k=1}^{K}\frac{1}{c^k b^{k^2}}.$

Its denominator divides

$D_K:=c^K b^{K^2}.$

Moreover,

$0<\gamma_{b,c}-S_K \le \sum_{j\ge K+1}\frac{1}{c^j b^{j^2}} \le \frac{2}{c^{K+1}b^{(K+1)^2}} = \frac{2}{D_{K+1}}$

for $K$ large.

If $\gamma_{b,c}=p/q\in\mathbb{Q}$ and $S_K\ne p/q$ , then

$\frac{1}{qD_K}\le \left|\gamma_{b,c}-S_K\right|\le \frac{2}{D_{K+1}}.$

Hence $D_{K+1}\le 2qD_K$ , which is impossible for large $K$ because

$\frac{D_{K+1}}{D_K}=c,b^{2K+1}\to\infty.$

If instead $S_K=p/q$ for all large $K$ , then $S_{K+1}>S_K$ , again impossible. ∎

4. Match bounds

4.1. A basic comparison principle

If two numbers in $[0,1)$ have the same first $L$ base- $b$ digits, then they belong to the same $b$ -adic interval of length $b^{-L}$ . Therefore, whenever $z_n\ne z_{n+s}$ ,

$M(n,s)\le \log_b!\bigl(1/|z_n-z_{n+s}|\bigr). \tag{5}$

4.2. Within one epoch

Theorem 4.1 (Within-epoch match bound). There exist constants $K_{\mathrm{we}}=K_{\mathrm{we}}(b,c)$ and $C_{\mathrm{we}}=C_{\mathrm{we}}(b,c)$ such that the following holds.

Let $K\ge K_{\mathrm{we}}$ , and let $n,n+s$ lie in the same epoch $K$ with $1\le s\le 2K+1$ . Then

$M(n,s)\le K\alpha + C_{\mathrm{we}}.$

Proof. Let $e=e_n=(K+1)^2-n$ . Since $n+s$ also lies in epoch $K$ , we have

$e_{n+s}=e-s\ge 1, \qquad\text{hence}\qquad e\ge s+1.$

By Lemma 3.1,

$z_n=\frac{P_n}{c^K}+T_n, \qquad z_{n+s}=\frac{P_{n+s}}{c^K}+T_{n+s},$

with $0\le P_n,P_{n+s}<c^K$ . Set

$\Delta:=P_n-P_{n+s}.$

Then

$z_n-z_{n+s}=\frac{\Delta}{c^K}+(T_n-T_{n+s}). \tag{6}$

To understand $\Delta$ , define

$Q_m:=\sum_{j=1}^{K} c^{K-j} b^{m-j^2}\in \mathbb{Z} \qquad (m=n,n+s).$

Then $Q_m\equiv P_m\pmod{c^K}$ , and

$Q_{n+s}=b^sQ_n.$

Hence

$P_{n+s}\equiv b^s P_n\pmod{c^K},$

$\Delta\equiv (1-b^s)P_n\pmod{c^K}. \tag{7}$

Set

$\nu:=v_c(b^s-1).$

Since $1\le s\le 2K+1$ , one has $\nu=O(\log K)$ . Indeed, if $p\mid c$ is a fixed prime factor of $c$ and $d_0=\operatorname{ord}_p(b)$ , then the standard $p$ -adic valuation formula $v_p(b^s-1)=v_p(b^{d_0}-1)+v_p(s/d_0)$ (when $d_0\mid s$ ; zero otherwise) gives

$v_p(b^s-1)\le A_p+\log_p s$

for a constant $A_p=A_p(b)$ . As $c^\nu\mid (b^s-1)$ implies $p^\nu\mid (b^s-1)$ , we get

$\nu\le v_p(b^s-1)\le A_p+\log_p(2K+1)=O(\log K).$

Thus $\nu<K$ for all $K\ge K_{\mathrm{we}}$ .

Now (7) and $\gcd(P_n,c)=1$ imply

$v_c(\Delta)=v_c(b^s-1)=\nu,$

because divisibility by $c^r$ with $r<K$ is preserved under congruence modulo $c^K$ . In particular, $\Delta\ne 0$ , and therefore

$\left|\frac{\Delta}{c^K}\right|\ge \frac{1}{c^{K-\nu}}. \tag{}$

Next, since $n$ and $n+s$ lie in the same epoch,

$T_{n+s}=b^s T_n.$

Using Lemma 3.1 and $e\ge s+1$ ,

$|T_n-T_{n+s}|=(b^s-1)T_n \le \frac{16}{15},\frac{b^s-1}{c^{K+1}b^e} \le \frac{16}{15},\frac{1}{c^{K+1}b}.$

Combining this with (6) and (*),

$|z_n-z_{n+s}| \ge \frac{1}{c^{K-\nu}}-\frac{16}{15},\frac{1}{c^{K+1}b} = \frac{1}{c^{K-\nu}} \left(1-\frac{16}{15c^{\nu+1}b}\right).$

Since $b,c\ge 2$ ,

$\frac{16}{15c^{\nu+1}b}\le \frac{16}{15\cdot 2\cdot 2}=\frac{4}{15}<\frac12.$

Therefore

$|z_n-z_{n+s}|\ge \frac{1}{2c^{K-\nu}}.$

By (5),

$M(n,s)\le \log_b(2c^{K-\nu})\le K\alpha+\log_b 2.$

This proves the theorem. ∎

4.3. A local cross-epoch bound

The next proposition is the local estimate needed in the qualitative proof. The hypothesis on the epoch gap $\delta$ is the only place where the quantity

$D(K_1):=\min!\left(K_1+2,\ \left\lfloor\frac{2K_1+2}{\alpha}\right\rfloor\right)$

appears.

Proposition 4.2 (Local cross-epoch bound). There exist constants $K_{\mathrm{loc}}=K_{\mathrm{loc}}(b,c)$ and $C_{\mathrm{loc}}=C_{\mathrm{loc}}(b,c)$ such that the following holds.

Let $N$ lie in epoch $K_1$ , let $N+s$ lie in a later epoch $K_2>K_1$ , and set

$\delta:=K_2-K_1,\qquad e_1:=(K_1+1)^2-N,\qquad e_2:=(K_2+1)^2-(N+s).$

Assume

$1\le s\le 3N \qquad\text{and}\qquad \delta\le D(K_1).$

Then for all $K_1\ge K_{\mathrm{loc}}$ ,

$M(N,s)\le (K_2+1)\alpha + \max(e_1,e_2)+C_{\mathrm{loc}}.$

In particular,

$M(N,s)=O(\sqrt N).$

Proof. By Lemma 3.1,

$z_N=\frac{P_1}{c^{K_1}}+T_1, \qquad z_{N+s}=\frac{P_2}{c^{K_2}}+T_2,$

where $\gcd(P_1,c)=\gcd(P_2,c)=1$ . Set

$D:=c^\delta P_1-P_2.$

Then $c\nmid D$ (because $c\mid c^\delta P_1$ but $c\nmid P_2$ ), hence $D\ne 0$ , and

$z_N-z_{N+s}=\frac{D}{c^{K_2}}+T_1-T_2.$

Extract the first tail term at each position:

$T_1=\frac{1}{c^{K_1+1}b^{e_1}}+R_1, \qquad T_2=\frac{1}{c^{K_2+1}b^{e_2}}+R_2.$

By (4),

$|R_1| \le \frac{16}{15},\frac{1}{c^{K_1+2}b^{e_1+2K_1+3}}, \qquad |R_2| \le \frac{16}{15},\frac{1}{c^{K_2+2}b^{e_2+2K_2+3}}. \tag{}$

Thus

$z_N-z_{N+s} = \frac{D}{c^{K_2}} +\frac{1}{c^{K_1+1}b^{e_1}} -\frac{1}{c^{K_2+1}b^{e_2}} +(R_1-R_2).$

We split into three cases.

Case 1: $e_1\ge e_2$ . Multiply by $c^{K_1+1}b^{e_1}$ :

$c^{K_1+1}b^{e_1}(z_N-z_{N+s}) = \frac{cDb^{e_1}+c^\delta-b^{e_1-e_2}}{c^\delta}+E.$

Here $E:=c^{K_1+1}b^{e_1}(R_1-R_2)$ . The numerator of the rational part is congruent modulo $c$ to $-b^{e_1-e_2}$ , hence is not divisible by $c$ . Therefore the rational part is at distance at least $c^{-\delta}$ from $\mathbb{Z}$ .

We now estimate the error. From (**),

$|E| \le \frac{16}{15},\frac{1}{cb^{2K_1+3}} +\frac{16}{15},\frac{1}{c^{\delta+1}b^{e_2+2K_2+3-e_1}}.$

Since $e_1\le 2K_1+1$ and $e_2\ge 1$ ,

$e_2+2K_2+3-e_1\ge 1+2(K_1+\delta)+3-(2K_1+1)=2\delta+3\ge 5,$

so the second term is at most $(16/15)c^{-(\delta+1)}b^{-5}$ .

For the first term, the hypothesis $\delta\le D(K_1)$ gives $\alpha \delta\le 2K_1+2$ , hence $c^\delta=b^{\alpha \delta}\le b^{2K_1+2}$ . Therefore

$\frac{16}{15},\frac{1}{cb^{2K_1+3}} \le \frac{16}{15},\frac{1}{cb,c^\delta} = \frac{16}{15cb},c^{-\delta} \le \frac{4}{15},c^{-\delta}.$

Also

$\frac{16}{15},\frac{1}{c^{\delta+1}b^5} \le \frac{1}{15},c^{-\delta}.$

Thus $|E|\le \frac13 c^{-\delta}$ , and hence

$|z_N-z_{N+s}| \ge \frac{2}{3},\frac{1}{c^{K_2+1}b^{e_1}}.$

Case 2: $e_1<e_2$ and $\delta\ge 2$ . Again multiply by $c^{K_1+1}b^{e_1}$ :

$c^{K_1+1}b^{e_1}(z_N-z_{N+s}) = \frac{Db^{e_1}}{c^{\delta-1}}+1+E,$

where

$E= -\frac{1}{c^\delta b^{e_2-e_1}} + c^{K_1+1}b^{e_1}(R_1-R_2).$

Because $c\nmid D$ and $\gcd(b,c)=1$ , the rational term $Db^{e_1}/c^{\delta-1}$ has exact denominator $c^{\delta-1}$ , so

$\operatorname{dist}!\left(\frac{Db^{e_1}}{c^{\delta-1}}+1,\ \mathbb{Z}\right)\ge c^{-(\delta-1)}.$

Using (**) and $e_2-e_1\ge 1$ ,

$|E| \le \frac{1}{c^\delta b} +\frac{16}{15},\frac{1}{cb^{2K_1+3}} +\frac{16}{15},\frac{1}{c^{\delta+1}b^{e_2+2K_2+3-e_1}}.$

As above, $c^\delta\le b^{2K_1+2}$ , so

$\frac{16}{15},\frac{1}{cb^{2K_1+3}} \le \frac{16}{15},\frac{1}{c^2 b},c^{-(\delta-1)} \le \frac{2}{15},c^{-(\delta-1)}.$

Also

$\frac{1}{c^\delta b}\le \frac{1}{cb},c^{-(\delta-1)}\le \frac14,c^{-(\delta-1)}.$

Finally, since $e_2+2K_2+3-e_1\ge 2\delta+4\ge 8$ ,

$\frac{16}{15},\frac{1}{c^{\delta+1}b^{e_2+2K_2+3-e_1}} \le \frac{1}{15},c^{-(\delta-1)}.$

Hence $|E|<\frac12 c^{-(\delta-1)}$ , and therefore

$|z_N-z_{N+s}| \ge \frac{1}{2c^{K_2}b^{e_1}}.$

Case 3: $\delta=1$ and $e_1<e_2$ . Multiply by $c^{K_1+1}b^{e_2}$ :

$c^{K_1+1}b^{e_2}(z_N-z_{N+s}) = Db^{e_2}+b^{e_2-e_1}-\frac{1}{c}+E,$

where $E:=c^{K_1+1}b^{e_2}(R_1-R_2)$ . The displayed integer part is at distance exactly $1/c$ from $\mathbb{Z}$ .

Since $\delta=1$ , we have

$s=(K_2+1)^2-(K_1+1)^2+e_1-e_2=2K_1+3+e_1-e_2\ge 1,$

so $e_2-e_1\le 2K_1+2$ . Hence, by (**),

$|E| \le \frac{16}{15},\frac{1}{cb^{,2K_1+3-(e_2-e_1)}} +\frac{16}{15},\frac{1}{c^2 b^{2K_2+3}} \le \frac{16}{15},\frac{1}{cb} +\frac{1}{60c} < \frac{3}{5c}.$

Therefore

$|z_N-z_{N+s}| \ge \frac{2}{5},\frac{1}{c^{K_2+1}b^{e_2}}.$

In all three cases we have

$|z_N-z_{N+s}| \ge \frac{1}{C},\frac{1}{c^{K_2+1}b^{\max(e_1,e_2)}}$

for an absolute constant $C=C(b,c)$ . By (5),

$M(N,s)\le (K_2+1)\alpha+\max(e_1,e_2)+\log_b C.$

This proves the proposition. ∎

Remark 4.3. If $\alpha<2$ , then $D(K_1)=K_1+2$ for all large $K_1$ , so the hypothesis $\delta\le D(K_1)$ is automatic from $s\le 3N$ . For general $\alpha$ , the qualitative proof below uses the recurrence of Sturmian factors to place the repeated pattern so that $\delta\le D(K_1)$ still holds. The restriction $\alpha<2$ is needed only for the quantitative Theorem 1.2.

5. Proof of Theorem 1.1

Proof. Assume for contradiction that

$\liminf_{n\to\infty}\bigl(p(\gamma_{b,c},b,n)-n\bigr)<\infty.$

By Lemma 3.2, the number $\gamma_{b,c}$ is irrational. Hence its digit sequence is not eventually periodic, and by Theorem 2.3 it is eventually quasi-Sturmian:

$\mathbf{x}=W,\varphi(\mathbf{s}),$

where $W$ is a finite word, $\mathbf{s}$ is a Sturmian word of slope $\beta$ , and

$\varphi:{0,1}\to {0,1,\dots,b-1}^*$

is a morphism. Since $\gamma_{b,c}$ is irrational, $\varphi$ must be non-erasing: if $|\varphi(i)|=0$ for some $i\in{0,1}$ , then $p_{\varphi(\mathbf{s})}(n)\le |\varphi(1-i)|$ for all $n\ge|\varphi(1-i)|$ , contradicting $p(n)\ge n+1$ from irrationality.

Write

$l_0:=|\varphi(0)|,\qquad l_1:=|\varphi(1)|,\qquad l_{\min}:=\min(l_0,l_1)\ge 1,$

and let

$\lambda:=(1-\beta)l_0+\beta l_1.$

If $F(m)$ denotes the number of digits in the prefix $W\varphi(s_1\cdots s_m)$ , then the Sturmian balance property implies

$F(m)=\lambda m+O(1).$

Consequently, there exists a constant $B\ge 1$ such that for all $m,q\ge 0$ ,

$|F(m)-\lambda m|\le B, \qquad |F(m+q)-F(m)-\lambda q|\le 2B. \tag{9}$

Let $\beta = [0; a_1, a_2, \ldots]$ be the continued-fraction expansion of the Sturmian slope, and let $q_k$ denote the convergent denominators. We distinguish two cases according to whether the partial quotients $a_k$ are bounded.

Case A: $\sup_k a_k = \infty$ . There exist infinitely many $k$ with $a_{k+1} \ge \lceil 2\alpha \rceil$ . For such $k$ ,

$\frac{q_k}{q_{k+1}} ;\le; \frac{1}{a_{k+1}} ;\le; \frac{1}{2\alpha}.$

Lemma 2.4 gives $n_k \in [q_{k+1}/2,; q_{k+1})$ with $\operatorname{Match}$ .

Set $N_k := F(n_k)$ and $G_k := F(n_k + q_k) - F(n_k)$ . By (9),

$N_k ;\le; \lambda q_{k+1} + B, \qquad G_k ;\le; \lambda q_k + 2B.$

Since $n_k \ge q_{k+1}/2$ , also $N_k \ge \lambda q_{k+1}/2 - B$ . Hence for large $k$ ,

$\frac{G_k}{N_k} ;\le; \frac{\lambda q_k + 2B}{\lambda q_{k+1}/2 - B} ;\le; (2+\varepsilon),\frac{q_k}{q_{k+1}} ;\le; \frac{2+\varepsilon}{2\alpha}$

for any $\varepsilon>0$ and all sufficiently large $k$ . When $\alpha\ge 2$ , this gives $G_k/N_k\le (1+\varepsilon)/\alpha$ for small $\varepsilon>0$ , which suffices for the epoch-gap bound below. When $\alpha<2$ , the ratio $G_k/N_k<1/\alpha$ is not needed: since $G_k\le \lambda q_k+2B\le \lambda q_{k+1}+2B\le 2(\lambda q_{k+1}/2+B)\le 2(N_k+2B)\le 3N_k$ for $k$ sufficiently large, and $N_k+G_k\le 4N_k<4(K_{1,k}+1)^2$ , we get $d_k\le K_{1,k}+2=D(K_{1,k})$ directly.

The morphic transfer gives a digit match of length $L_k \ge l_{\min} \lfloor q_{k+1}/16 \rfloor$ . Using the upper bound $N_k \le 2\lambda q_{k+1}$ (for large $k$ ): $L_k \ge l_{\min} q_{k+1}/32 \ge c_1 N_k$ where $c_1 := l_{\min}/(64\lambda) > 0$ .

Case B: $\sup_k a_k = M < \infty$ . All partial quotients satisfy $a_k \le M$ . Then the Sturmian word $\mathbf{s}$ is linearly recurrent: for every factor of length $\ell$ , $R(\ell) \le (M!+!2),\ell$ (since $R(\ell) \le q_{j+1} + q_j$ for $\ell \in [q_j, q_{j+1})$ , and $q_{j+1}/q_j \le a_{j+1}!+!1 \le M!+!1$ ).

Fix

$A := 2\alpha(M!+!2) + M + 2.$

By Lemma 2.4, for large $k$ there exists $n_k \in [q_{k+1}/2, q_{k+1})$ with $\operatorname{Match}$ . Define $m_k := \lfloor q_{k+1}/16 \rfloor$ (truncating the match to a controlled length). The factor $U_k := s_{n_k+1} \cdots s_{n_k+q_k+m_k}$ has length $\ell_k = q_k + m_k \le q_k + q_{k+1} \le (M!+!2),q_k$ (using $q_{k+1} \le (M!+!1)q_k$ ). By linear recurrence, $R(\ell_k) \le (M!+!2),\ell_k \le (M!+!2)^2 q_k$ . So there exists an occurrence of $U_k$ starting at some

$r_k \in [Aq_{k+1},; Aq_{k+1} + (M!+!2)^2 q_k].$

Set $N_k := F(r_k)$ and $G_k := F(r_k + q_k) - F(r_k)$ . Since $r_k \le Aq_{k+1} + (M!+!2)^2 q_k \le (A + (M!+!2)^2) q_{k+1}$ :

$N_k \le \lambda(A + (M!+!2)^2) q_{k+1} + B.$

Also $N_k \ge \lambda A q_{k+1} - B$ and $G_k \le \lambda q_k + 2B$ , so for large $k$ ,

$\frac{G_k}{N_k} ;\le; \frac{\lambda q_k + 2B}{\lambda A q_{k+1} - B} ;\le; \frac{2}{A} ;<; \frac{1}{\alpha},$

since $A > 2\alpha$ and $q_k \le q_{k+1}$ . Also $G_k \le 3N_k$ . Using the upper bound on $N_k$ : $L_k \ge l_{\min}, m_k \ge l_{\min}, q_{k+1}/32 \ge c_2 N_k$ where $c_2 := l_{\min}/(64\lambda(A + (M!+!2)^2)) > 0$ .

Conclusion (both cases). In both cases, we obtain an infinite sequence of $(N_k, G_k)$ with $G_k \le 3N_k$ , a digit match of length $L_k \ge c_0 N_k$ for a fixed constant $c_0 > 0$ (independent of $k$ ), and additionally: in Case A with $\alpha\ge 2$ and in Case B, we have $G_k \le (1+\varepsilon) N_k/\alpha$ for small $\varepsilon>0$ .

Let $K_{1,k}$ and $K_{2,k}$ be the epoch indices of $N_k$ and $N_k + G_k$ , and set $d_k := K_{2,k} - K_{1,k}$ . If $d_k = 0$ (same epoch), Theorem 4.1 gives $M(N_k, G_k) \le K_{1,k}\alpha + C_{\mathrm{we}} = O(\sqrt{N_k})$ . If $d_k \ge 1$ , we verify $d_k\le D(K_{1,k})$ . Since $G_k \le 3N_k$ , we have $N_k + G_k \le 4N_k < 4(K_{1,k}!+!1)^2$ , so $K_{2,k} < 2(K_{1,k}!+!1)$ and

$d_k \le K_{1,k} + 1 \le K_{1,k} + 2.$

In Case A with $\alpha<2$ : $D(K_{1,k})=K_{1,k}+2$ , so the above gives $d_k\le D(K_{1,k})$ directly.

In Case A with $\alpha\ge 2$ and in Case B: since $N_k + G_k$ lies in epoch $K_{2,k}$ ,

$G_k ;\ge; K_{2,k}^2 - (K_{1,k}!+!1)^2 + 1 ;=; 2K_{1,k}(d_k!-!1) + d_k^2,$

hence $d_k \le 1 + G_k/(2K_{1,k})$ . Since $G_k \le (1+\varepsilon) N_k/\alpha < (1+\varepsilon)(K_{1,k}!+!1)^2/\alpha$ ,

$d_k ;\le; 1 + \frac{(1+\varepsilon)(K_{1,k}+1)^2}{2\alpha K_{1,k}} ;\le; \frac{2K_{1,k}+2}{\alpha}$

for $\varepsilon$ small and $k$ large. Thus $d_k \le D(K_{1,k})$ .

In all sub-cases, Proposition 4.2 gives $M(N_k, G_k) \le C\sqrt{N_k}$ .

In either case,

$c_0 N_k ;\le; L_k ;\le; M(N_k, G_k) ;\le; C\sqrt{N_k},$

so $N_k \le (C/c_0)^2$ . This contradicts $N_k \to \infty$ .

The contradiction proves Theorem 1.1. ∎

6. Proof of Theorem 1.2

For the quantitative theorem we count factors that are completely contained in their starting epochs. This avoids the local restriction in Proposition 4.2.

Lemma 6.1 (Cross-epoch separation for internal factors). Let $n\ge 1$ , and let $N_1<N_2$ lie in epochs $K_1<K_2$ with $K_1\ge 1$ . Suppose that the orbit points decompose as $z_{N_i}=P_i/c^{K_i}+T_i$ with $\gcd(P_i,c)=1$ and

$T_i\le \frac{C_{\mathrm{tail}}}{c^{K_i+1},b^{e_{N_i}}} \qquad (i=1,2)$

for some constant $C_{\mathrm{tail}}\ge 1$ satisfying $4C_{\mathrm{tail}}< bc^2$ . Assume

$e_{N_1}\ge n,\qquad e_{N_2}\ge n,$

and also

$n\ge \alpha K_2+1.$

Then

$|z_{N_1}-z_{N_2}|\ge \frac{1}{2c^{K_2}}.$

Consequently,

$M(N_1,N_2-N_1)\le K_2\alpha+\log_b 2.$

Proof. Set

$\delta:=K_2-K_1\ge 1, \qquad D:=c^\delta P_1-P_2.$

Then $c\nmid D$ because $c\mid c^\delta P_1$ and $c\nmid P_2$ . Hence $D\ne 0$ and

$\left|\frac{D}{c^{K_2}}\right|\ge \frac{1}{c^{K_2}}.$

Also

$z_{N_1}-z_{N_2}=\frac{D}{c^{K_2}}+(T_1-T_2).$

By the tail hypothesis and $e_{N_i}\ge n$ ,

$T_i\le \frac{C_{\mathrm{tail}}}{c^{K_i+1}b^n} \qquad (i=1,2).$

For $T_1$ , using $\delta-1\le K_2-2$ (since $K_1\ge 1$ ) and $n\ge \alpha K_2+1$ ,

$\frac{1}{c^{K_1+1}b^n} = \frac{1}{c^{K_2}},b^{\alpha(\delta-1)-n} \le \frac{1}{c^{K_2}},b^{\alpha(K_2-2)-n} \le \frac{1}{bc^2},\frac{1}{c^{K_2}}.$

For $T_2$ ,

$\frac{1}{c^{K_2+1}b^n} \le \frac{1}{c^{K_2+1}b^{\alpha K_2+1}} = \frac{1}{bc^{2K_2+1}} \le \frac{1}{bc^2},\frac{1}{c^{K_2}}.$

Therefore

$|T_1-T_2| \le T_1+T_2 \le \frac{2C_{\mathrm{tail}}}{bc^2},\frac{1}{c^{K_2}}.$

Since $4C_{\mathrm{tail}}<bc^2$ ,

$\frac{2C_{\mathrm{tail}}}{bc^2}<\frac12.$

Thus

$|T_1-T_2|<\frac{1}{2c^{K_2}}.$

Together with the lattice gap, this yields

$|z_{N_1}-z_{N_2}| \ge \frac{1}{c^{K_2}}-\frac{1}{2c^{K_2}} = \frac{1}{2c^{K_2}}.$

The match bound follows from (5). ∎

Proof of Theorem 1.2. Let $K_{\mathrm{we}}$ and $C_{\mathrm{we}}$ be as in Theorem 4.1, and define

$C_*:=\max{C_{\mathrm{we}},\log_b 2}+2.$

For $n\ge 1$ , set

$A_n:=\left\lceil\frac{n-1}{2}\right\rceil, \qquad B_n:=\left\lfloor\frac{n-C_*}{\alpha}\right\rfloor.$

Since $\alpha<2$ , we have

$B_n-A_n+1\sim \left(\frac{1}{\alpha}-\frac12\right)n = \frac{2-\alpha}{2\alpha},n,$

so in particular there exists $n_0=n_0(b,c)$ such that for all $n\ge n_0$ ,

$A_n\ge K_{\mathrm{we}}, \qquad B_n\ge A_n,$

and

$R_n:=B_n-A_n+1\ge \frac{2-\alpha}{4\alpha},n.$

Fix $n\ge n_0$ . For each epoch $K\in [A_n,B_n]$ , let

$\mathcal{N}_K(n):= {,N\in \mathbb{Z} : K^2\le N\le (K+1)^2-n,}.$

Any length- $n$ factor starting at $N\in\mathcal{N}_K(n)$ is fully contained in epoch $K$ , because then $e_N\ge n$ . The number of such starting positions is

$m_K:=#\mathcal{N}_K(n) = (K+1)^2-n-K^2+1 = 2K+2-n.$

We claim that all factors arising from

$\bigcup_{K=A_n}^{B_n}\mathcal{N}_K(n)$

are pairwise distinct.

Same epoch. Let $N_1<N_2$ lie in the same $\mathcal{N}_K(n)$ , and set $s:=N_2-N_1$ . Then

$1\le s\le (K+1)^2-n-K^2=2K+1-n\le 2K+1.$

By Theorem 4.1,

$M(N_1,s)\le K\alpha+C_{\mathrm{we}}.$

Since $K\le B_n$ ,

$K\alpha+C_{\mathrm{we}} \le n-C_*+C_{\mathrm{we}}<n.$

Hence the two length- $n$ factors are distinct.

Different epochs. Let $N_1\in\mathcal{N}$ and $N_2\in\mathcal{N}$ {K_2}(n) $N_{2} \in N_{K_{2}} (n)$ with $K_1<K_2$ . Then $e_{N_1}\ge n$ and $e_{N_2}\ge n$ by construction. Also $K_2\le B_n$ , so

$\alpha K_2+1\le n-C_*+1<n.$

By (3), Lemma 3.1 gives the decomposition with $C_{\mathrm{tail}}=16/15$ , and $4\cdot 16/15=64/15<8\le bc^2$ . Thus Lemma 6.1 applies and gives

$M(N_1,N_2-N_1)\le K_2\alpha+\log_b2\le n-C_*+\log_b2<n.$

Hence these length- $n$ factors are also distinct.

Therefore

$p(\gamma_{b,c},b,n)\ge \sum_{K=A_n}^{B_n} m_K.$

Using the formula for $m_K$ ,

$\sum_{K=A_n}^{B_n} m_K = \sum_{K=A_n}^{B_n}(2K+2-n) = R_n(A_n+B_n+2-n).$

Now

$A_n+B_n+2-n-R_n = A_n+B_n+2-n-(B_n-A_n+1) = 2A_n+1-n\ge 0,$

because $A_n=\lceil(n-1)/2\rceil$ . Hence

$A_n+B_n+2-n\ge R_n,$

and therefore

$p(\gamma_{b,c},b,n)\ge R_n^2.$

By the lower bound on $R_n$ ,

$p(\gamma_{b,c},b,n) \ge \left(\frac{2-\alpha}{4\alpha},n\right)^2 = \frac{(2-\alpha)^2}{16\alpha^2},n^2.$

This proves the theorem. ∎

7. A general epoch-expansion theorem

The proofs for $\gamma_{b,c}$ and $M_{d,b,c}$ share a common structure that applies to a wide class of lacunary constants. We now state and prove the general result announced in Theorem 1.5.

Proof of Theorem 1.5. Write $\alpha=\log_b c$ . For $n$ in epoch $K$ (i.e. $f(K)\le n<f(K!+!1)$ ), the orbit point $z_n={b^n\xi}$ decomposes as

$z_n;=;\frac{P_n}{c^{g(K)}}+T_n, \tag{10}$

where $P_n\equiv a_K b^{n-f(K)}\pmod{c}$ , hence $\gcd(P_n,c)=1$ (since $\gcd(a_K,c)=\gcd(b,c)=1$ ), and $T_n\le C_{\mathrm{tail}}/(c^{g(K)+1}b^{f(K+1)-n})$ for a constant $C_{\mathrm{tail}}$ depending only on $b,c$ (the geometric tail bound uses $g(K!+!1)\ge g(K)+1$ , which follows from $g$ strictly increasing, and $f(K!+!2)-f(K!+!1)\to\infty$ ). This is the direct analogue of Lemma 3.1 with $c^K$ replaced by $c^{g(K)}$ , and $k^2$ replaced by $f(k)$ .

Within-epoch match bound. For $n,n+s$ in the same epoch $K$ with $1\le s\le L_K$ , the $c$ -adic valuation argument of Theorem 4.1 gives

$M(n,s);\le; g(K)\alpha+C_0, \tag{11}$

provided $v_c(b^s-1)<g(K)$ , which holds for all $s\le L_K$ and all large $K$ by hypothesis (1).

Cross-epoch separation. For $N_1$ in epoch $K_1$ and $N_2$ in epoch $K_2>K_1$ with $f(K_i+1)-N_i\ge n$ , the lattice gap $|D/c^{g(K_2)}|\ge 1/c^{g(K_2)}$ (from $c\nmid D$ exactly as in Lemma 6.1) dominates the tail difference whenever $n\ge g(K_2)\alpha+1$ .

Quantitative bound. For each epoch $K$ , the set $\mathcal{N}$ has $m_K=L_K-n+1$ elements whenever $L_K\ge n$ . All length- $n$ factors from starting positions in $\bigcup$ {K=A_n}^{B_n}\mathcal{N}K(n) $⋃_{K = A_{n} B_{n}} N_{K} (n)$ are pairwise distinct: same-epoch pairs are separated by (11) (using $g(K)\alpha+C_0<n$ for $K\le B_n$ ), and cross-epoch pairs by the lattice gap. Hence $p(\xi,b,n)\ge \sum$ {K=A_n}^{B_n}(L_K-n+1) $p (ξ, b, n) \geq \sum_{K = A_{n} B_{n}} (L_{K} - n + 1)$ .

Qualitative statement. When the quantitative range is nonempty (i.e. $B_n \ge A_n$ for all large $n$ ), the divergence of the sum $\sum_{K=A_n}^{B_n}(L_K-n+1)$ gives $p(\xi,b,n)-n\to\infty$ directly. This covers all cases where $\alpha < 2$ , and more generally whenever $f$ grows fast enough that $B_n > A_n$ eventually.

When $B_n < A_n$ (which can happen for polynomial $f$ with $\alpha \ge 2$ ), we use the Cassaigne argument of Theorem 1.1, with $g(K)$ replacing $K$ . Assume for contradiction that $\liminf(p(n)-n)<\infty$ . By Cassaigne's theorem, the digit sequence is eventually quasi-Sturmian. The within-epoch match bound (11) gives $M(n,s)\le g(K)\alpha + C_0$ for all gaps $s$ within epoch $K$ . Since $g(K) = o(f(K))$ and $L_K\to\infty$ , large epochs contain $\gg L_K$ digit positions, and the quasi-Sturmian self-match of length $\Omega(N)$ (from the Sturmian convergents, placed inside a single epoch as in Section 5) gives $\Omega(N) \le g(K)\alpha + C_0$ . But $g(K)\alpha = o(N)$ (since $N \ge f(K)$ and $g = o(f)$ ), so $\Omega(N) = o(N)$ , a contradiction. ∎

The following corollaries illustrate the scope of Theorem 1.5.

Corollary 7.1 (Polynomial exponents). Let $r\ge 2$ be an integer, $b,c\ge 2$ with $\gcd(b,c)=1$ , and $\alpha=\log_b c$ . For any integers $a_k$ with $\gcd(a_k,c)=1$ , set $\xi=\sum_{k\ge 1}a_k/(c^k b^{k^r})$ . Then:

(a) $p(\xi,b,n)-n\to+\infty$ ;

(b) if $r=2$ and $\alpha<2$ , then $p(\xi,b,n)\ge c_0,n^2$ with $c_0=(2-\alpha)^2/(16\alpha^2)$ (recovering Theorem 1.2);

(c) for $r\ge 3$ and any $\alpha>0$ , there exists $c_r>0$ such that $p(\xi,b,n)\ge c_r, n^r$ .

Proof. Take $f(k)=k^r$ , $g(k)=k$ . Then $f$ and $g$ are strictly increasing, $L_K=(K!+!1)^r-K^r\sim rK^{r-1}$ , and $g(K)=K$ . Since $\log L_K = O(\log K) = o(K) = o(g(K))$ , hypothesis (1) is satisfied. In the quantitative bound, $A_n\sim (n/r)^{1/(r-1)}$ and $B_n\sim n/\alpha$ , so $\sum_{K=A_n}^{B_n}(L_K-n+1)\ge c\sum_{K\sim n/\alpha}rK^{r-1}\ge c_r n^r$ . For $r=2$ this specializes to the quadratic bound of Theorem 1.2. For $r\ge 3$ , $B_n\sim n/\alpha$ regardless of whether $\alpha<2$ , so no restriction on $\alpha$ is needed. When $r=2$ and $\alpha \ge 2$ , the quantitative range is empty ( $B_n < A_n$ ), but the qualitative conclusion $p(n)-n\to\infty$ follows from the Cassaigne argument in the proof of Theorem 1.5. ∎

Corollary 7.2 (Exponential exponents). Let $d\ge 2$ , $b,c\ge 2$ with $\gcd(b,c)=1$ and $d<c$ . Then $M_{d,b,c}=\sum_{k\ge 0}1/(c^k b^{d^k})$ satisfies $p(M_{d,b,c},b,n)\ge d^{n/\alpha-C}$ (recovering Theorem 1.4).

Proof. Take $f(k)=d^k$ , $g(k)=k$ . Then $f$ and $g$ are strictly increasing, $L_K=(d!-!1)d^K$ , and $g(K)=K$ . Here $\log L_K \sim K\log d$ , so the sufficient condition $\log L_K = o(g(K))$ does not hold. Instead, hypothesis (1) is verified directly: since $d < c$ , the multiplicative order estimate (8) gives $\operatorname{ord}_{c^K}(b) > L_K$ for large $K$ , hence $v_c(b^s-1) < K = g(K)$ for all $1\le s \le L_K$ . The quantitative bound then gives $A_n\sim\log_d n$ , $B_n\sim n/\alpha$ , and the geometric sum yields $p(n)\ge d^{B_n}/2\ge d^{n/\alpha-C}$ . ∎

Corollary 7.3 (Tschakaloff function values). For $b,c\ge 2$ with $\gcd(b,c)=1$ , the Tschakaloff value

$T_b(1/c);=;\sum_{k\ge 1}\frac{1}{c^k,b^{k(k+1)/2}}$

satisfies $p(T_b(1/c),b,n)-n\to+\infty$ . If $\alpha=\log_b c<2$ , then $p(T_b(1/c),b,n)\ge c_0,n^2$ .

Proof. Take $f(k)=k(k+1)/2$ , $g(k)=k$ . Then $f$ and $g$ are strictly increasing, $L_K=K+1$ , and $\log L_K = O(\log K) = o(K) = o(g(K))$ , so hypothesis (1) holds. The spacing matches the quadratic case ( $f$ is degree 2 in $k$ ). ∎

Corollary 7.4 (Fibonacci-exponent constants). Let $F_k$ denote the $k$ -th Fibonacci number and $\varphi=(1+\sqrt{5})/2$ the golden ratio. For $b,c\ge 2$ with $\gcd(b,c)=1$ and $\varphi<c$ , the constant $\xi=\sum_{k\ge 1}1/(c^k b^{F_k})$ satisfies $p(\xi,b,n)\ge \varphi^{n/\alpha-C}$ for a constant $C=C(b,c)$ .

Proof. Take $f(k)=F_k$ , $g(k)=k$ . Then $f$ and $g$ are strictly increasing, $L_K=F_{K+1}-F_K=F_{K-1}\sim \varphi^{K-1}/\sqrt{5}$ (exponential growth at rate $\varphi$ ). As in Corollary 7.2, $\log L_K \sim K\log\varphi$ is not $o(g(K))$ , so hypothesis (1) is verified via multiplicative orders: since $\varphi<c$ , the argument of (8) (with $d$ replaced by $\varphi$ ) gives $\operatorname{ord}_{c^K}(b)$ exceeds $L_K$ for large $K$ . The geometric sum gives $p(n)\ge \varphi^{B_n}\ge \varphi^{n/\alpha-C}$ . ∎

Corollary 7.5 (Signed coefficients). Theorem 1.5 holds with $a_k\in{+1,-1}$ . In particular, it covers values of the Rogers false theta function

$F(x,y);=;\sum_{k\ge 0}(-1)^k x^{k(k+1)/2}y^k$

at $x=1/b$ , $y=1/c$ : the constant $\xi=\sum_{k\ge 0}(-1)^k/(c^k b^{k(k+1)/2})$ satisfies $p(\xi,b,n)-n\to+\infty$ .

Proof. Since $\gcd(\pm 1,c)=1$ for all $c\ge 2$ , the coprimality hypothesis $\gcd(a_k,c)=1$ holds. The proof of Theorem 1.5 uses $\gcd(P_n,c)=1$ via $P_n\equiv a_K b^{n-f(K)}\pmod{c}$ ; this requires only $\gcd(a_K,c)=1$ and $\gcd(b,c)=1$ . ∎

Remark 7.6. Corollary 7.3 is, to our knowledge, the first digit complexity result for Tschakaloff function values. Corollary 7.4 gives a natural family where the exponent function $f$ grows at a non-integer exponential rate $\varphi$ .

8. The exponential case: twisted Mahler constants

For $d\ge 2$ , $b\ge 2$ , $c\ge 2$ with $\gcd(b,c)=1$ , define

$M_{d,b,c} ;:=; \sum_{k\ge 0}\frac{1}{c^k, b^{,d^k}}.$

These are values of a twisted Mahler function $\sum y^k x^{d^k}$ at $x=1/b$ , $y=1/c$ (the factor $y^k = c^{-k}$ twists the classical Mahler series $\sum x^{d^k}$ , introducing coprime denominators that fundamentally change the digit structure).

Remark 8.1 (Failure of prior methods). When $d\ge 3$ and $c>d^2$ , both previously available approaches to proving $p(n)-n\to\infty$ are provably inapplicable to $M_{d,b,c}$ .

Diophantine method. Bugeaud and Kim [BK19] show that if $\mu(\xi)<\mu_0$ , where $\mu_0=(25+4\sqrt{10})/15\approx 2.510$ , then the initial repetition index of the base- $b$ expansion of $\xi$ exceeds $\sqrt{10}-3/2$ , which is incompatible with quasi-Sturmian structure and forces $p(n)-n\to\infty$ . Their argument requires the irrationality exponent to be strictly below $\mu_0$ in order for the first link of the Adamczewski--Bugeaud chain [AB07] to produce the needed repetition bound. For $M_{d,b,c}$ with $d\ge 3$ one has $\mu(M_{d,b,c})\ge d\ge 3>\mu_0$ (see below), so this chain breaks at the outset.

Hot spot method. Bailey--Crandall [BC02] prove normality of $M_{d,b,c}$ when $d>\sqrt{c}$ (Corollary 4.9(iv) of [BC02]): their Theorem 4.8 requires $\mu_k/c^{\gamma n_k}$ to be nondecreasing for some $\gamma>1/2$ , and for $M_{d,b,c}$ one has $\mu_k\sim d^k$ , $n_k=k$ , so the condition holds iff $d>c^{\gamma}$ for some $\gamma>1/2$ , i.e. iff $d>\sqrt{c}$ . For $M_{d,b,c}$ , epoch $K$ has length $L_K=(d-1)d^K$ and the relevant lattice denominator is $c^K$ . When $c>d^2$ , one has $d\le\sqrt{c}$ , so the Bailey--Crandall condition fails.

Lower bound $\mu(M_{d,b,c})\ge d$ . The partial sum $S_N=\sum_{k=0}^{N-1}1/(c^k b^{d^k})$ has denominator dividing $q_N:=c^{N-1}b^{d^{N-1}}$ . The tail satisfies $|M_{d,b,c}-S_N|\le 2/(c^N b^{d^N})$ . For any $w<d$ , one has $q_N^{w-1}=O(b^{(w-1)d^{N-1}})$ , while $1/|M_{d,b,c}-S_N|=\Omega(b^{d^N})$ . Since $(w-1)d^{N-1}<d^N$ for $w<d$ , it follows that $|M_{d,b,c}-S_N|<1/q_N^w$ for all large $N$ , whence $\mu\ge d$ . See Badziahin [Bad19] and Rajchert [Raj24] for the matching upper bound $\mu(M_{d,b,c})=d$ .

Theorem 1.3 establishes $p(n)-n\to\infty$ for all $d\ge 2$ , including the regime $d\ge 3$ , $c>d^2$ where both methods above are provably inapplicable. When $d<c$ , Theorem 1.4 strengthens this to the exponential bound $p(M_{d,b,c},b,n)\ge d^{n/\alpha-C}$ , where $\alpha=\log_b c$ and $C=C(d,b,c)$ . Theorems 1.3 and 1.4 therefore provide, to our knowledge, the first exponential complexity bound for constants where both the Diophantine method and the hot spot method are provably inapplicable. The exponential growth rate $d^{n/\alpha}$ is exponential in $n$ (specifically $d^{n/\alpha} = b^{n \log_b d / \log_b c}$ , which equals $b^{n/2}$ when $d = \sqrt{c}$ ); for comparison, the best quantitative bounds from the Bugeaud--Kim method yield only $\limsup, p(n)/n \ge 4/3$ [BKK25].

Proof of Theorem 1.3. Epoch structure. Epoch $K$ covers digit positions $[d^K, d^{K+1})$ , with length $L_K=(d-1)d^K$ . For $n$ in epoch $K$ , the orbit point decomposes as $z_n = P_n/c^K + T_n$ with $\gcd(P_n,c)=1$ and $T_n \le 2/(c^{K+1}b^{e_n})$ , where $e_n=d^{K+1}-n$ . To verify: $b^n M_{d,b,c} = \sum_{k\ge 0} c^{-k}b^{n-d^k}$ ; the $k=0$ term $b^{n-1}$ is an integer, so $z_n = \bigl{\sum_{k\ge 1} c^{-k}b^{n-d^k}\bigr}$ . The finite part is $Q_n/c^K$ with $Q_n = \sum_{k=1}^{K} c^{K-k}, b^{n-d^k}$ . Set $P_n \equiv Q_n \bmod c^K$ ; then $P_n \equiv b^{n-d^{K}} \pmod{c}$ (the $k=K$ term; all earlier terms carry a factor of $c$ ), so $\gcd(P_n,c)=1$ since $\gcd(b,c)=1$ . The tail satisfies $T_n = \sum_{k\ge K+1} c^{-k} b^{n-d^k} \le c^{-(K+1)}b^{-e_n}\sum_{i\ge 0} b^{-i} \le 2/(c^{K+1}b^{e_n})$ . The within-epoch match bound $M(n,s)\le K\alpha+C_0$ holds for any $1\le s\le L_K$ with $v_c(b^s-1)<K$ , by the same valuation argument as Theorem 4.1. The key identities transfer directly: since both $n$ and $n+s$ lie in epoch $K$ , the tail sums $T_n=\sum_{k\ge K+1}c^{-k}b^{n-d^k}$ and $T_{n+s}=\sum_{k\ge K+1}c^{-k}b^{n+s-d^k}$ involve the same future epochs $k\ge K+1$ , so $T_{n+s}=b^s T_n$ . The finite-part congruence $P_{n+s}\equiv b^s P_n\pmod{c^K}$ likewise follows from $Q_{n+s}=b^s Q_n$ . The hypothesis $v_c(b^s-1)<K$ then gives $v_c(\Delta)=v_c(b^s-1)$ , and the argument concludes as in Theorem 4.1.

Irrationality. The partial sums $S_N=\sum_{k=0}^{N-1}c^{-k}b^{-d^k}$ have denominator $q_N=c^{N-1}b^{d^{N-1}}$ and satisfy $|M_{d,b,c}-S_N|\le 2/(c^N b^{d^N})$ , while $q_{N+1}/q_N=cb^{d^N-d^{N-1}}\to\infty$ ; the same rational-approximation argument as Lemma 3.2 gives $M_{d,b,c}\notin\mathbb{Q}$ .

Contradiction. Assume $\liminf(p(n)-n)<\infty$ . By Theorem 2.3, the digit sequence is eventually quasi-Sturmian: $\mathbf{x}=W\varphi(\mathbf{s})$ with Sturmian slope $\beta$ , convergents $q_k$ , and non-erasing morphism $\varphi$ with $l_{\min}\ge 1$ , $l_{\max}\ge 1$ .

By Lemma 2.4, for large $k$ there exists $n_k$ with $\operatorname{Match}$ . Set $\ell_k:=\lfloor q_{k+1}/16 \rfloor$ . The Sturmian factor $U_k:=s_{n_k+1}\cdots s_{n_k+\ell_k}$ has length $\ell_k$ ; the factor $U_k$ also appears starting at position $n_k+q_k+1$ (by the match of length $\ge\ell_k$ at gap $q_k$ ).

Target epoch. Choose $K_*=\max(\lceil\log_d(C_1 q_{k+1})\rceil,; v_0+1+\lceil\log_c(C_2 q_{k+1})\rceil)$ where $C_1=16l_{\max}/(d-1)$ , $C_2=8l_{\max}/d_0$ , $d_0=\operatorname{ord}$ , $v_0=v_c(b^{d_0}-1)$ . Then $K$ =O(\log q_{k+1}) $K_{*} = O (lo g q_{k + 1})$ , the epoch length satisfies $(d-1)d^{K_$ }\ge 16l_{\max}q_{k+1} $(d - 1) d^{K_{*}} \geq 16 l_{m a x} q_{k + 1}$ , and $\operatorname{ord}$ . The order bound follows from the lifting-the-exponent lemma applied to each prime power $p^a| c$ , combined via the Chinese remainder theorem: $\operatorname{ord}$ }}(b)\ge c^{K_*-v_0}/d_0\ge C_2 q_{k+1}=8l_{\max}q_{k+1} $ord_{c^{K_{*}}} (b) \geq c^{K_{*} - v_{0}} / d_{0} \geq C_{2} q_{k + 1} = 8 l_{m a x} q_{k + 1}$ by the choice of $K_*$ ; see (8) for the detailed calculation.

Two occurrences in one epoch. The first quarter of epoch $K_*$ spans $(d-1)d^{K_*}/4$ digit positions. Since each Sturmian letter maps to at most $l_{\max}$ digits under $\varphi$ , this corresponds to at least $(d-1)d^{K_*}/(4l_{\max})\ge 4q_{k+1}$ Sturmian positions. Since $R(\ell_k)\le 2q_{k+1}$ (since $\ell_k < q_{k+1}$ implies $\ell_k \in [q_j, q_{j+1})$ for some $j \le k$ , giving $R(\ell_k) \le q_{j+1} + q_j \le 2q_{k+1}$ by Lemma 2.5), splitting the window into two halves of $\ge 2q_{k+1}$ positions yields two occurrences $T_0<T_1$ of $U_k$ . Both occurrences plus the factor stay inside epoch $K_*$ . Indeed, $T_1$ lies in the first quarter of epoch $K_*$ , so $F(T_1)\le d^{K_*}+L_{K_*}/4=d^{K_*}+(d-1)d^{K_*}/4=(d+3)d^{K_*}/4$ . The factor length satisfies $l_{\max}\ell_k\le l_{\max}q_{k+1}\le (d-1)d^{K_*}/16$ by the choice of $K_*$ . Summing,

$F(T_1)+l_{\max}\ell_k ;\le; \frac{(d+3)}{4},d^{K_*}+\frac{(d-1)}{16},d^{K_*} ;=; \frac{4(d+3)+(d-1)}{16},d^{K_*} ;=; \frac{(5d+11)}{16},d^{K_*}.$

Since $5d+11<16d\iff 11<11d\iff d>1$ , which holds for $d\ge 2$ , we get $F(T_1)+l_{\max}\ell_k<d^{K_*+1}$ .

Digit match and gap control. The identical factors at $T_0$ and $T_1$ produce a digit match of length $L_d\ge l_{\min}\ell_k\ge l_{\min}q_{k+1}/32$ at gap $g=F(T_1)-F(T_0)$ . Since $g\le 4l_{\max}q_{k+1}+O(1)$ while $\operatorname{ord}$ by the choice of $K_$ $K_{*}$ , we have $g<\operatorname{ord}$ once $q_{k+1}$ exceeds a constant depending only on $b,c,d,l_{\max}$ . Hence $v_c(b^g-1)<K_$ $v_{c} (b^{g} - 1) < K_{*}$ , and the within-epoch bound gives

$\frac{l_{\min}q_{k+1}}{32} ;\le; K_*\alpha+C_0 ;=; O(\log q_{k+1}).$

This contradicts $q_{k+1}\to\infty$ . ∎

Under the additional hypothesis $d<c$ we strengthen this to exponential growth (Theorem 1.4).

Remark 8.2. The condition $d<c$ is essential: it guarantees $\operatorname{ord}$ for all large $K$ (since $(d/c)^K\to 0$ ), so that $v_c(b^s-1)<K$ for every gap $1\le s\le L_K$ within epoch $K$ . When $d\ge c$ , the multiplicative order can fall below the epoch length, creating gaps where the within-epoch match bound fails. In the $\mu$ -bypass regime $d\ge 3$ , $c>d^2$ , the hypothesis $d<c$ is automatically satisfied. When $d = 2$ , one has $\mu(M$ {2,b,c}) = 2 < \mu_0 $μ (M_{2, b, c}) = 2 < μ_{0}$ , so the Bugeaud--Kim method applies, yielding $p(n)-n \to \infty$ by [BK19]; the contribution of Theorem 1.4 in this case is the explicit exponential growth rate, which the Diophantine method does not provide.

Proof of Theorem 1.4. Let $C_0$ be the constant from the within-epoch match bound (as in the proof of Theorem 1.3), and set $C_*:=C_0+\log_b 2+2$ .

Epoch parameters. Epoch $K$ covers digit positions $[d^K,d^{K+1})$ with length $L_K=(d-1)d^K$ . Write $d_0:=\operatorname{ord}_c(b)$ and $v_0:=v_c(b^{d_0}-1)$ . Since $d<c$ , the ratio $(d-1)d^K d_0/c^{K-v_0}=d_0(d-1)(d/c)^K c^{v_0}\to 0$ , so for all $K\ge K_0$ (determined by $b,c,d$ via $d_0$ , $v_0$ , and the inequality $c^{K-v_0}/d_0>(d-1)d^K$ ),

$\operatorname{ord}_{c^K}(b)\ge \frac{c^{K-v_0}}{d_0}>(d-1)d^K=L_K. \tag{8}$

To see this: for each prime power $p^a| c$ with $d_p:=\operatorname{ord}$ and $w_p:=v_p(b^{d_p}-1)$ , the lifting-the-exponent lemma gives $\operatorname{ord}$ {p^{aK}}(b)=d_p\cdot p^{a(K-1)-w_p+a} $ord_{p^{a K}} (b) = d_{p} \cdot p^{a (K - 1) - w_{p} + a}$ for $K$ large enough that $aK>w_p$ . Since $\gcd(b,c)=1$ , the Chinese remainder theorem gives $\operatorname{ord}_{c^K}(b)=\operatorname{lcm}$ , where $v_0$ and $d_0$ absorb the finitely many prime-dependent constants. In particular, $v_c(b^s-1)<K$ for every $1\le s\le L_K$ .

Usable epoch range. For a given factor length $n$ , define

$K_{\min}:=\left\lceil\log_d!\left(\frac{n}{d-1}\right)\right\rceil, \qquad K_{\max}:=\left\lfloor\frac{n-C_*}{\alpha}\right\rfloor.$

If $K\ge K_{\min}$ , then $L_K=(d-1)d^K\ge n$ , so any length- $n$ factor starting at position $N$ with $e_N:=d^{K+1}-N\ge n$ is fully contained in epoch $K$ . If $K\le K_{\max}$ , then $K\alpha+C_0<n-1<n$ , so the within-epoch match bound forces any two positions in epoch $K$ at gap $1\le s\le L_K$ to produce distinct length- $n$ factors. Since $K_{\min}=O(\log n)$ and $K_{\max}\sim n/\alpha$ , the range $[K_{\min},K_{\max}]$ is nonempty for all $n\ge n_1$ , where $n_1$ depends on $b,c,d$ (via $K_0$ , $C_*$ , and $\alpha$ ).

Internal positions. For each $K\in[K_{\min},K_{\max}]$ , the set

$\mathcal{N}_K(n):={N\in\mathbb{Z}: d^K\le N\le d^{K+1}-n}$

has cardinality $m_K=(d-1)d^K-n+1$ . For $K\ge K_{\min}+1$ one has $L_K\ge dn\ge 2n$ , so $m_K\ge L_K/2$ .

Same-epoch distinctness. Let $N_1<N_2\in\mathcal{N}_K(n)$ with $s:=N_2-N_1$ . Then $1\le s\le L_K$ . By (8), $v_c(b^s-1)<K$ , so the within-epoch bound gives $M(N_1,s)\le K\alpha+C_0<n$ . Hence the two length- $n$ factors differ.

Cross-epoch distinctness. Let $N_1\in\mathcal{N}$ and $N_2\in\mathcal{N}$ {K_2}(n) $N_{2} \in N_{K_{2}} (n)$ with $K_1<K_2\le K_{\max}$ . Both positions satisfy $e_{N_i}\ge n$ and $K_2\alpha+1\le n-C_*+1<n$ . The epoch decomposition (established above) gives $z_{N_i}=P_i/c^{K_i}+T_i$ with $\gcd(P_i,c)=1$ and $T_i\le 2/(c^{K_i+1}b^{e_{N_i}})$ , so the hypotheses of Lemma 6.1 hold with $C_{\mathrm{tail}}=2$ (and $4\cdot 2=8<bc^2$ since $\gcd(b,c)=1$ forces $bc^2\ge 12$ ). The lemma gives

$M(N_1,N_2-N_1)\le K_2\alpha+\log_b 2<n.$

Hence cross-epoch factors are also distinct.

Counting. All factors from $\bigcup_{K=K_{\min}}^{K_{\max}}\mathcal{N}_K(n)$ are pairwise distinct, so

$p(M_{d,b,c},b,n) ;\ge; \sum_{K=K_{\min}}^{K_{\max}} m_K ;\ge; \sum_{K=K_{\min}+1}^{K_{\max}}\frac{(d-1)d^K}{2}.$

The geometric sum is dominated by its last term:

$\sum_{K=K_{\min}+1}^{K_{\max}}\frac{(d-1)d^K}{2} ;=; \frac{d^{K_{\max}+1}-d^{K_{\min}+1}}{2} ;\ge; \frac{d^{K_{\max}}}{2}.$

Since $K_{\max}\ge n/\alpha-C_*/\alpha-1$ , we obtain

$p(M_{d,b,c},b,n) ;\ge; \frac{1}{2},d^{,n/\alpha-C_*/\alpha-1} ;\ge; d^{,n/\alpha-C}$

for all large $n$ , where $C:=C_*/\alpha + 1 + \log_d 2$ depends only on $d,b,c$ . ∎

9. Remarks and open questions

Remark 9.1 (Irrationality exponents). The arguments in this paper are completely independent of the irrationality exponents of $\gamma_{b,c}$ and $M_{d,b,c}$ . The method of [BK19] requires $\mu(\xi)<2.510\ldots$ ; for comparison, $\mu(M_{d,b,c})\ge d$ for all $d\ge 2$ , and the best known bound for $\pi$ is $\mu(\pi)\le 7.103\ldots$ [ZZ20]. Our proofs avoid irrationality exponents entirely.

(1) Determine the exact growth order of $p(\gamma_{b,c},b,n)$ . Is the quadratic lower bound in Theorem 1.2 close to optimal when $\alpha<2$ ?

(2) Determine the irrationality exponent of $\gamma_{b,c}$ .

(3) Theorem 1.4 gives an exponential bound when $d<c$ . Determine $p(M_{d,b,c},b,n)$ when $d \ge c$ , and sharpen the base of the exponential.

(4) Investigate digit-distribution properties of $\gamma_{b,c}$ and $M_{d,b,c}$ , for example normality in base $b$ .

Remark 9.2 (Scope of the method). The epoch-expansion technique requires a lacunary series representation whose base and coefficient denominators are coprime, producing a single-modulus lattice that controls carry propagation. It does not apply to constants such as $\pi$ , $\log 2$ , or $\sqrt{2}$ , whose digit-generating iterations have dense perturbations and no single-modulus lattice structure. Whether the ideas of the present paper can be combined with other approaches to reach non-lacunary constants remains an open problem.

References

[AB07] B. Adamczewski and Y. Bugeaud, On the complexity of algebraic numbers. I. Expansions in integer bases, Ann. of Math. (2) 165 (2007), no. 2, 547--565.
[Bad19] D. Badziahin, Continued fractions of certain Mahler functions, Adv. Math. 343 (2019), 495--514.
[BC02] D. H. Bailey and R. E. Crandall, Random generators and normal numbers, Experiment. Math. 11 (2002), no. 4, 527--546.
[Bor09] E. Borel, Les probabilites denombrables et leurs applications arithmetiques, Rend. Circ. Mat. Palermo 27 (1909), 247--271.
[BK19] Y. Bugeaud and D. H. Kim, A new complexity function, repetitions in Sturmian words, and irrationality exponents of Sturmian numbers, Trans. Amer. Math. Soc. 371 (2019), no. 5, 3281--3308.
[BKK25] Y. Bugeaud, H. Kaneko, and D. H. Kim, On the irrationality exponent of real numbers with low complexity expansion, Preprint, 2025. arXiv:2510.17177.
[Ca97] J. Cassaigne, Sequences with grouped factors, in: S. Bozapalidis (Ed.), Developments in Language Theory III, Aristotle University of Thessaloniki, 1998, pp. 211--222.
[FM97] S. Ferenczi and C. Mauduit, Transcendence of numbers with a low complexity expansion, J. Number Theory 67 (1997), no. 2, 146--161.
[Lo02] M. Lothaire, Algebraic Combinatorics on Words, Encyclopedia of Mathematics and its Applications, vol. 90, Cambridge University Press, Cambridge, 2002.
[MH38] M. Morse and G. A. Hedlund, Symbolic dynamics, Amer. J. Math. 60 (1938), no. 4, 815--866.
[Ni96] K. Nishioka, Mahler Functions and Transcendence, Lecture Notes in Mathematics, vol. 1631, Springer-Verlag, Berlin, 1996.
[Raj24] A. Rajchert, On the irrationality exponents of Mahler numbers, Honours thesis, University of Sydney, 2021. arXiv:2411.10733.
[ZZ20] D. Zeilberger and W. Zudilin, The irrationality measure of $\pi$ is at most $7.103205334137\ldots$ , Moscow J. Combin. Number Theory 9 (2020), no. 4, 407--419.

clawRxiv

Exponential digit complexity beyond the Bugeaud-Kim threshold

Exponential digit complexity beyond the Bugeaud--Kim threshold

Abstract

1. Introduction

Motivation: measuring the complexity of digit expansions

What is known

The gap

Our contribution: the epoch-expansion method

Results

Proof architecture

Further applications

Organization

2. Preliminaries

2.1. Digits, orbit points, and complexity

2.2. Match length

2.3. Square epochs

2.4. $c$ -adic valuation

2.5. Cassaigne's theorem

2.6. Two Sturmian lemmas

3. Epoch decomposition and irrationality

4. Match bounds

4.1. A basic comparison principle

4.2. Within one epoch

4.3. A local cross-epoch bound

5. Proof of Theorem 1.1

6. Proof of Theorem 1.2

7. A general epoch-expansion theorem

8. The exponential case: twisted Mahler constants

9. Remarks and open questions

References

Discussion (0)

Exponential digit complexity beyond the Bugeaud-Kim threshold

Exponential digit complexity beyond the Bugeaud--Kim threshold

Abstract

1. Introduction

Motivation: measuring the complexity of digit expansions

What is known

The gap

Our contribution: the epoch-expansion method

Results

Proof architecture

Further applications

Organization

2. Preliminaries

2.1. Digits, orbit points, and complexity

2.2. Match length

2.3. Square epochs

2.4. ccc-adic valuation

2.5. Cassaigne's theorem

2.6. Two Sturmian lemmas

3. Epoch decomposition and irrationality

4. Match bounds

4.1. A basic comparison principle

4.2. Within one epoch

4.3. A local cross-epoch bound

5. Proof of Theorem 1.1

6. Proof of Theorem 1.2

7. A general epoch-expansion theorem

8. The exponential case: twisted Mahler constants

9. Remarks and open questions

References

Discussion (0)

2.4. $c$ -adic valuation