The *subword complexity* $p(\xi,b,n)$ of a real number $\xi$ in base $b$ counts how many distinct strings of length $n$ appear in its digit expansion. By a classical result of Morse--Hedlund, every irrational number satisfies $p \ge n+1$, but proving anything stronger for an *explicit* constant is notoriously difficult: the only previously known results require the irrationality exponent $\mu(\xi)$ to be at most $2.510$ (the Bugeaud--Kim threshold [BK19]), or the digit-producing dynamics to have long stretches of purely periodic behaviour (the Bailey--Crandall hot spot method [BC02]).
We introduce an *epoch-expansion* technique that bypasses both barriers, and use it to prove that a broad family of lacunary sums
Exponential digit complexity beyond the Bugeaud--Kim threshold
The subword complexityp(ξ,b,n) of a real number ξ in base b counts how many distinct strings of length n appear in its digit expansion. By a classical result of Morse--Hedlund, every irrational number satisfies p≥n+1, but proving anything stronger for an explicit constant is notoriously difficult: the only previously known results require the irrationality exponent μ(ξ) to be at most 2.510 (the Bugeaud--Kim threshold [BK19]), or the digit-producing dynamics to have long stretches of purely periodic behaviour (the Bailey--Crandall hot spot method [BC02]).
We introduce an epoch-expansion technique that bypasses both barriers, and use it to prove that a broad family of lacunary sums --- constants of the form ξ=∑ak/(cg(k)bf(k)) with gcd(b,c)=1 and rapidly increasing exponents f --- have much richer digit structure than irrationality alone guarantees. Concretely, the number of distinct length-n digit strings satisfies p(ξ,b,n)−n→+∞, with quantitative lower bounds that mirror the growth of the spacing f: quadratic spacing yields at least quadratically many strings, exponential spacing yields exponentially many, and so on.
Our strongest application concerns the twisted Mahler constantsMd,b,c=∑k≥01/(ckbdk). When d≥3 and c>d2, these constants have irrationality exponent μ≥d>2.510 (so Bugeaud--Kim fails) and epoch-to-lattice ratio tending to zero (so hot spots fail), yet we obtain exponential complexity: p(Md,b,c,b,n)≥dn/α−C where α=logbc. Further applications include partial theta function values, Tschakaloff function values, Fibonacci-exponent constants, and Rogers false theta function values. To our knowledge, these are the first complexity results for explicit constants beyond both known barriers.
1. Introduction
Motivation: measuring the complexity of digit expansions
Write a real number ξ in an integer base b≥2:
ξ=0.x1x2x3⋯,xn∈{0,1,…,b−1}.
One of the simplest questions one can ask about this expansion is: how many distinct patterns appear? The subword complexity functionp(ξ,b,n) counts the number of distinct blocks of n consecutive digits xi+1xi+2⋯xi+n that occur as i ranges over all positions. For a rational number, the digit expansion is eventually periodic, so p(n) is bounded. For an irrational number, a classical theorem of Morse and Hedlund [MH38] gives p(n)≥n+1 for every n, and this bound is sharp: it is attained exactly by the Sturmian words, a family of aperiodic binary sequences that arise by coding an irrational rotation of the circle (see Lothaire [Lo02] for a comprehensive account). At the other extreme, a number is normal in base b (in the sense of Borel [Bor09]) if every block of digits appears with the expected frequency b−n; this forces p(n)=bn, the maximum possible value. Almost every real number is normal in every base, but proving normality---or even any growth of p(n) beyond n+1---for a specific constant such as π, e, or 2 is extraordinarily difficult.
What is known
There are essentially two prior techniques for proving that the digits of a given constant are more complex than the Morse--Hedlund minimum.
The Diophantine method. Ferenczi and Mauduit [FM97] first connected low subword complexity to transcendence: they showed that a number whose base-b expansion satisfies p(n)=n+1 for all n (a Sturmian number) is necessarily transcendental. Adamczewski and Bugeaud [AB07], using the deep Schmidt Subspace Theorem from Diophantine approximation, proved a much stronger result: the digit expansion of every algebraic irrational number ξ satisfies p(ξ,b,n)/n→∞ for every base b. For transcendental constants, the key parameter governing what can be proved is the irrationality exponent
μ(ξ)=sup{μ:∣ξ−p/q∣<q−μ for infinitely many p/q∈Q},
which measures how well ξ can be approximated by rationals. Bugeaud and Kim [BK19] showed that if μ(ξ)<μ0:=(25+410)/15≈2.510, then p(ξ,b,n)−n→+∞. The reasoning is: a small irrationality exponent forces the digit expansion to avoid long near-repetitions; this is incompatible with the rigid self-similar structure of quasi-Sturmian words (sequences with p(n)≤n+C for a constant C, characterized by Cassaigne [Ca97] as the image of a Sturmian word under a morphism); and the resulting structural contradiction yields complexity growth. The threshold μ0=2.510… is sharp: Bugeaud and Kim construct Sturmian numbers with μ=μ0 and p(n)=n+1 for all n.
Prior to the present work, this method was the only known technique for proving p(n)−n→∞ for explicit transcendental constants. It covers Euler's number e and its relatives e1/m, tanh(1/m), and certain Bessel function quotients---all of which have μ=2, well below the threshold.
The hot spot method. Bailey and Crandall [BC02] developed a completely different approach that proves a much stronger conclusion---full normality---for a special class of constants. The Stoneham numbersαb,c=∑k≥11/(ckbck), where b and c are coprime, have the property that successive nonzero terms are exponentially far apart. Between two consecutive terms, the digit-generating iteration zn={b⋅zn−1} (fractional part of b times the previous orbit point) evolves without perturbation, giving long "clean" stretches during which the orbit is algebraically controlled. Bailey and Crandall show that no short subinterval of [0,1) is visited disproportionately often by this orbit (a "hot spot"), and this implies normality. The crucial structural requirement is that the epoch length (the number of unperturbed steps between consecutive terms of the series) is at least proportional to the lattice denominator of the orbit during that epoch.
The gap
Both methods leave large classes of constants untouched. The Diophantine method requires μ(ξ)<2.510, yet for most transcendental constants the irrationality exponent is either unknown or too large: for π, the best bound is μ(π)≤7.103 (Zeilberger--Zudilin [ZZ20]), far above the threshold. The hot spot method requires the epoch-to-lattice ratio to remain bounded below by a positive constant, and this fails for series whose terms are more densely packed than in the Stoneham case.
Between these two regimes sits a large family of lacunary constants---sums whose nonzero terms are located at positions that grow faster than linearly but not exponentially (such as quadratic or polynomial positions)---for which no technique existed for proving anything about digit complexity beyond the trivial bound p(n)≥n+1.
Our contribution: the epoch-expansion method
In this paper we introduce the epoch-expansion method, a technique for proving lower bounds on subword complexity that requires no information about the irrationality exponent and is orthogonal to the hot spot approach.
The key observation is elementary. Consider a constant of the form ξ=∑ak/(cg(k)bf(k)), where b and c are coprime and the exponents f(k) grow with increasing gaps f(k+1)−f(k)→∞. The digit expansion of ξ is naturally partitioned into epochs: long runs of digit positions between the locations f(k) and f(k+1) of successive terms. Inside each epoch, the orbit point zn={bnξ}---whose base-b digits determine the digit expansion of ξ from position n onward---decomposes as a rational number with denominator cK plus a negligibly small tail. The coprimality gcd(b,c)=1 places the numerator of this rational part on a c-adic lattice, and a key arithmetic observation controls how long two orbit points can produce identical digits: if positions n and n+s lie in the same epoch, they share at most vc(bs−1)⋅logbc+O(1) leading digits, where vc denotes the c-adic valuation. This bound grows only logarithmically in the epoch index, while the epochs themselves grow without bound. Eventually the epochs are long enough to contain more pairwise distinguishable starting positions than any quasi-Sturmian word can accommodate, and the complexity p(n) is forced to exceed n+C for every constant C.
Results
We present our results in order of increasing generality, developing the technique first for the partial theta values
γb,c:=k≥1∑ckbk21,b≥2,c≥2,gcd(b,c)=1,
which serve as the motivating case. These constants are values of a partial Jacobi theta function (related to the q-series Θ+(q,z)=∑k≥0qk2zk); they escape both prior methods, since the epoch-to-lattice ratio tends to zero (defeating hot spots) and no bound on μ(γb,c) is available (so the Diophantine method cannot be applied).
Theorem 1.1 (Qualitative complexity).For all coprime integers b≥2 and c≥2,
p(γb,c,b,n)−n⟶+∞(n→∞).
Theorem 1.2 (Quadratic lower bound).If α=logbc<2, then there exists n0=n0(b,c) such that
p(γb,c,b,n)≥16α2(2−α)2n2for all n≥n0.
(We write α=logbc throughout the paper.) For a more striking application, we turn to the twisted Mahler constants. For integers d≥2, b≥2, c≥2 with gcd(b,c)=1, define
Md,b,c:=k≥0∑ckbdk1.
These arise from the Mahler function h(x)=∑k≥0xdk, which satisfies h(x)=x+h(xd); see Nishioka [Ni96] for background on Mahler functions in transcendence theory. When d≥3 and c>d2, these constants lie beyond the reach of both prior approaches simultaneously. On the Diophantine side, μ(Md,b,c)=d (proved via the formal continued fraction of the associated Mahler function; see Badziahin [Bad19] and Rajchert [Raj24]), so μ≥3>2.510 and the Bugeaud--Kim threshold is exceeded. On the hot spot side, Bailey--Crandall [BC02] prove normality of Md,b,c only when d>c; when c>d2 this condition fails. Nevertheless:
Theorem 1.3 (μ-bypass).For all d≥2, b≥2, c≥2 with gcd(b,c)=1,
p(Md,b,c,b,n)−n⟶+∞(n→∞).
When d<c, the exponential growth of the epoch lengths LK=(d−1)dK allows a much stronger, quantitative conclusion:
Theorem 1.4 (Exponential lower bound).For all d≥2, b≥2, c≥2 with d<c and gcd(b,c)=1, there exists a constant C=C(d,b,c) such that
p(Md,b,c,b,n)≥dn/α−C
for all sufficiently large n.
For comparison, the strongest quantitative result available from the Bugeaud--Kim method is limsupp(n)/n≥4/3 [BKK25], which applies only to constants satisfying a stringent irrationality exponent condition. Theorem 1.4 gives exponential growth in a regime where that method is provably inapplicable (see Remark 8.1).
All of the above are special cases of a general theorem. For integers b≥2, c≥2 with gcd(b,c)=1, let f:N→N be strictly increasing with LK:=f(K+1)−f(K)→∞, let g:N→N be strictly increasing with g(K)→∞ and g(K)=o(f(K)), and let (ak)k≥1 be integers with gcd(ak,c)=1 for all k. Assume further that
vc(bs−1)<g(K)for all 1≤s≤LK and all large K.(1)
(A sufficient condition is logLK=o(g(K)), since vc(bs−1)=O(logs). When f grows exponentially, (1) can instead be verified via multiplicative order estimates; see Corollary 7.2 and (8).) Define
ξ=k≥1∑cg(k)bf(k)ak.
Theorem 1.5 (General epoch-expansion theorem).Under the hypotheses above, p(ξ,b,n)−n→+∞. Moreover, there exist constants C0=C0(b,c) and K0=K0(b,c,f,g) such that for all sufficiently large n,
p(ξ,b,n)≥K=An∑Bn(LK−n+1),
where An=min{K≥K0:LK≥n} and Bn=max{K:g(K)α+C0<n}. When Bn<An (which can happen for slowly growing f with α≥2), the qualitative conclusion p(ξ,b,n)−n→+∞ still holds via the Cassaigne argument of Theorem 1.1, applied with g(K) in place of K.
The quantitative bound mirrors the spacing function: quadratic spacing (f(k)=k2) gives p(n)≥c0n2; exponential spacing (f(k)=dk) gives p(n)≥dn/α−C. Theorems 1.1, 1.2, and 1.4 are corollaries of Theorem 1.5 (see Section 7); Theorem 1.3 requires an independent argument given in Section 8.
Proof architecture
We present the proofs for the partial theta values γb,c independently of the general theorem, because they introduce the key ideas in a concrete setting. The architecture of the general proof is as follows.
All results rest on the epoch-expansion framework, whose applicability is determined by the structural properties of ξ=∑ak/(cg(k)bf(k)): coprime denominator (gcd(b,c)=1, giving a c-adic lattice structure for the orbit); increasing gaps (LK→∞, creating epochs of growing length); single-modulus lattice (all terms share the modulus c, making vc(bs−1) the sole obstruction to digit agreement); and valuation control (condition (1)).
Given these properties, the proof operates in two modes. For the qualitative conclusion (p(n)−n→∞), we argue by contradiction: if p(n)−n were bounded, Cassaigne's theorem [Ca97] would force the digit sequence to be quasi-Sturmian, producing long self-matches at certain gaps. We place such a matched pair inside a single large epoch and show that the within-epoch valuation bound limits the match length to O(Kα), contradicting the Sturmian match length ≫qk+1 since K=O(logqk+1). For the quantitative conclusion, we count distinct length-n subwords directly: each epoch of length LK≥n contributes ∼LK starting positions whose length-n subwords are pairwise distinct (by the valuation bound within epochs, and by the lattice gap across epochs), and summing over eligible epochs yields the lower bound.
Further applications
Beyond partial theta values and twisted Mahler constants, Theorem 1.5 applies uniformly to every lacunary constant satisfying the hypotheses above:
(a) Tschakaloff function valuesTb(1/c)=∑k≥11/(ckbk(k+1)/2) (f(k)=k(k+1)/2, LK=K+1; quadratic bound).
(b) Fibonacci-exponent constants∑k≥11/(ckbFk) (f(k)=Fk, LK=FK−1; exponential bound p(n)≥φn/α−C when α is sufficiently small).
(c) Signed coefficientsak∈{±1} (gcd(±1,c)=1 always); this covers values of Rogers false theta functions at rational arguments.
Organization
Section 2 collects notation and the Sturmian input. Section 3 gives the epoch decomposition of the orbit {bnγb,c}. Section 4 proves the within-epoch and cross-epoch match bounds. Theorems 1.1 and 1.2 are proved in Sections 5 and 6. Section 7 states and proves the general epoch-expansion theorem (Theorem 1.5) and derives applications to Tschakaloff function values, Fibonacci-exponent constants, and signed-coefficient series. Section 8 treats twisted Mahler constants, proving Theorems 1.3 and 1.4; this includes the exponential bound in the μ-bypass regime d≥3, c>d2 where both prior methods are provably inapplicable. Section 9 discusses open questions.
2. Preliminaries
Throughout the paper, b≥2 and c≥2 are fixed coprime integers, and
α=logbc>0.
2.1. Digits, orbit points, and complexity
For x∈R, we write ⌊x⌋ for the floor and {x}=x−⌊x⌋ for the fractional part.
We consider
γb,c=k≥1∑ckbk21,
and its orbit under multiplication by b:
zn:={bnγb,c},n≥0.
The base-b digit sequence of γb,c is then
xn=⌊bzn−1⌋,n≥1,
so that
γb,c=n≥1∑xnb−n.
Definition 2.1. For n≥1, the subword complexity of γb,c in base b is
p(γb,c,b,n)=#{xi+1xi+2⋯xi+n:i≥0}.
When the parameters are clear, we abbreviate this to p(n).
2.2. Match length
Definition 2.2. Let w=(wn)n≥1 be an infinite word over a finite alphabet. For n≥1 and g≥1, the match length at position n with gap g is
{\mathbf{w}}(n,g) = \max\bigl{L\ge 0 : w{n+j}=w_{n+g+j}\ \text{for all }0\le j<L\bigr}.Matchw(n,g)=max{L≥0:wn+j=wn+g+jfor all 0≤j<L}.
For the digit sequence {n\ge 1}x=(xn)n≥1 of {b,c}γb,c, we write
M(n,g):=Matchx(n+1,g),n≥0,g≥1.
Equivalently, M(n,g) is the largest L≥0 such that the first L base-b digits of zn and zn+g coincide.
2.3. Square epochs
For K≥1, define epoch K to be the interval of positions
[K2,(K+1)2).
Its length is
LK=(K+1)2−K2=2K+1.
If n lies in epoch K, we write
en:=(K+1)2−n,1≤en≤2K+1,
for the distance from n to the next epoch boundary.
2.4. c-adic valuation
Let
c=p1a1⋯prar
be the prime factorization of c. For a nonzero integer m, define
vc(m):=1≤i≤rmin⌊aivpi(m)⌋,
so that vc(m) is the largest k≥0 such that ck∣m. We set vc(0)=+∞.
2.5. Cassaigne's theorem
A Sturmian word is an aperiodic binary word of minimal complexity, namely p(n)=n+1 for all n≥1. A word is quasi-Sturmian if its complexity is bounded above by n+C for some constant C.
We use the following theorem of Cassaigne.
Theorem 2.3 (Cassaigne [Ca97]).For a non-eventually-periodic infinite word u over a finite alphabet, the following are equivalent:
(i) liminfn→∞(pu(n)−n)<∞;
(ii) u is eventually quasi-Sturmian, i.e. there exist a finite word W, a Sturmian word s over {0,1}, and a morphism
φ:{0,1}→A∗
such that
u=Wφ(s).
2.6. Two Sturmian lemmas
The first lemma is the source of long local self-matches in a Sturmian word. Here Matchs(n,g) denotes the match length of Section 2 applied to the Sturmian word s.
Lemma 2.4 (Strong Sturmian match).Let s be a Sturmian word of slope β, and let qk be the denominators of the convergents of β. For all sufficiently large k, there exists
Proof. Let εk=∥qkβ∥. By continued-fraction theory,
qk+1+qk1<εk<qk+11,
hence εk>1/(2qk+1) for large k.
A mismatch at shift qk among the first m symbols can occur only if the starting point of the coding orbit falls within distance εk of one of the m discontinuity preimages. Thus the set of bad starting points is covered by at most m intervals of total length at most 2mεk.
Among the qk+1 points {jβ}, 0≤j<qk+1, the minimum spacing is at least εk>1/(2qk+1). Therefore the number of bad starting points among these qk+1 orbit points is at most
2mεk⋅2qk+1+m≤5m.
Take m=⌊qk+1/16⌋. Then at least
qk+1−5m≥1611qk+1
starting points have match length at least m. In particular, one such point lies in [qk+1/2,qk+1). ∎
The second lemma is a standard recurrence property of Sturmian words.
Lemma 2.5 (Recurrence of Sturmian factors).Let s be a Sturmian word with convergent denominators qk. For ℓ∈[qk,qk+1), the recurrence function satisfies
R(ℓ)≤qk+1+qk−1.
In particular, every factor of length ℓ occurs in every block of length qk+1+qk.
Proof. See [Lo02, Ch. 2, Prop. 2.2.22]. ∎
3. Epoch decomposition and irrationality
We begin with the basic decomposition of zn inside a square epoch.
Lemma 3.1 (Epoch decomposition).Let n lie in epoch K, so that K2≤n<(K+1)2. Then
Let Pn be the residue of Qn modulo cK in [0,cK). By (3) below and en≥1, 0<Tn<1/cK. Since 0≤Pn<cK, we have Pn/cK+Tn<(cK−1)/cK+1/cK=1. Taking fractional parts therefore yields
zn=cKPn+Tn.
Modulo c, every term in Qn with j<K vanishes, while the term j=K equals bn−K2. Thus
Pn≡Qn≡bn−K2(modc),
and since gcd(b,c)=1, we also have gcd(Pn,c)=1.
For the tail, write j=K+t with t≥1. Then
Tn=t≥1∑cK+tb(K+t)2−n1=t≥1∑cK+tbun,t1.
Since
un,t=en+(t−1)(2K+t+1)≥en+(t−1)(2K+3),
we get
Tn≤cK+1ben1t≥0∑(cb2K+31)t.
As cb2K+3≥16, the geometric series is bounded by 16/15, which proves (3).
Hence DK+1≤2qDK, which is impossible for large K because
DKDK+1=cb2K+1→∞.
If instead SK=p/q for all large K, then SK+1>SK, again impossible. ∎
4. Match bounds
4.1. A basic comparison principle
If two numbers in [0,1) have the same first L base-b digits, then they belong to the same b-adic interval of length b−L. Therefore, whenever zn=zn+s,
M(n,s)≤logb(1/∣zn−zn+s∣).(5)
4.2. Within one epoch
Theorem 4.1 (Within-epoch match bound).There exist constants Kwe=Kwe(b,c) and Cwe=Cwe(b,c) such that the following holds.
Let K≥Kwe, and let n,n+s lie in the same epoch K with 1≤s≤2K+1. Then
M(n,s)≤Kα+Cwe.
Proof. Let e=en=(K+1)2−n. Since n+s also lies in epoch K, we have
en+s=e−s≥1,hencee≥s+1.
By Lemma 3.1,
zn=cKPn+Tn,zn+s=cKPn+s+Tn+s,
with 0≤Pn,Pn+s<cK. Set
Δ:=Pn−Pn+s.
Then
zn−zn+s=cKΔ+(Tn−Tn+s).(6)
To understand Δ, define
Qm:=j=1∑KcK−jbm−j2∈Z(m=n,n+s).
Then Qm≡Pm(modcK), and
Qn+s=bsQn.
Hence
Pn+s≡bsPn(modcK),
so
Δ≡(1−bs)Pn(modcK).(7)
Set
ν:=vc(bs−1).
Since 1≤s≤2K+1, one has ν=O(logK). Indeed, if p∣c is a fixed prime factor of c and d0=ordp(b), then the standard p-adic valuation formula vp(bs−1)=vp(bd0−1)+vp(s/d0) (when d0∣s; zero otherwise) gives
vp(bs−1)≤Ap+logps
for a constant Ap=Ap(b). As cν∣(bs−1) implies pν∣(bs−1), we get
ν≤vp(bs−1)≤Ap+logp(2K+1)=O(logK).
Thus ν<K for all K≥Kwe.
Now (7) and gcd(Pn,c)=1 imply
vc(Δ)=vc(bs−1)=ν,
because divisibility by cr with r<K is preserved under congruence modulo cK. In particular, Δ=0, and therefore
Here E:=cK1+1be1(R1−R2). The numerator of the rational part is congruent modulo c to −be1−e2, hence is not divisible by c. Therefore the rational part is at distance at least c−δ from Z.
We now estimate the error. From (**),
∣E∣≤1516cb2K1+31+1516cδ+1be2+2K2+3−e11.
Since e1≤2K1+1 and e2≥1,
e2+2K2+3−e1≥1+2(K1+δ)+3−(2K1+1)=2δ+3≥5,
so the second term is at most (16/15)c−(δ+1)b−5.
For the first term, the hypothesis δ≤D(K1) gives αδ≤2K1+2, hence cδ=bαδ≤b2K1+2. Therefore
1516cb2K1+31≤1516cbcδ1=15cb16c−δ≤154c−δ.
Also
1516cδ+1b51≤151c−δ.
Thus ∣E∣≤31c−δ, and hence
∣zN−zN+s∣≥32cK2+1be11.
Case 2: e1<e2 and δ≥2. Again multiply by cK1+1be1:
cK1+1be1(zN−zN+s)=cδ−1Dbe1+1+E,
where
E=−cδbe2−e11+cK1+1be1(R1−R2).
Because c∤D and gcd(b,c)=1, the rational term Dbe1/cδ−1 has exact denominator cδ−1, so
Remark 4.3. If α<2, then D(K1)=K1+2 for all large K1, so the hypothesis δ≤D(K1) is automatic from s≤3N. For general α, the qualitative proof below uses the recurrence of Sturmian factors to place the repeated pattern so that δ≤D(K1) still holds. The restriction α<2 is needed only for the quantitative Theorem 1.2.
5. Proof of Theorem 1.1
Proof. Assume for contradiction that
n→∞liminf(p(γb,c,b,n)−n)<∞.
By Lemma 3.2, the number γb,c is irrational. Hence its digit sequence is not eventually periodic, and by Theorem 2.3 it is eventually quasi-Sturmian:
x=Wφ(s),
where W is a finite word, s is a Sturmian word of slope β, and
φ:{0,1}→{0,1,…,b−1}∗
is a morphism. Since γb,c is irrational, φ must be non-erasing: if ∣φ(i)∣=0 for some i∈{0,1}, then pφ(s)(n)≤∣φ(1−i)∣ for all n≥∣φ(1−i)∣, contradicting p(n)≥n+1 from irrationality.
Write
l0:=∣φ(0)∣,l1:=∣φ(1)∣,lmin:=min(l0,l1)≥1,
and let
λ:=(1−β)l0+βl1.
If F(m) denotes the number of digits in the prefix Wφ(s1⋯sm), then the Sturmian balance property implies
F(m)=λm+O(1).
Consequently, there exists a constant B≥1 such that for all m,q≥0,
∣F(m)−λm∣≤B,∣F(m+q)−F(m)−λq∣≤2B.(9)
Let β=[0;a1,a2,…] be the continued-fraction expansion of the Sturmian slope, and let qk denote the convergent denominators. We distinguish two cases according to whether the partial quotients ak are bounded.
Case A: supkak=∞. There exist infinitely many k with ak+1≥⌈2α⌉. For such k,
qk+1qk≤ak+11≤2α1.
Lemma 2.4 gives nk∈[qk+1/2,qk+1) with {\mathbf{s}}(n_k!+!1, q_k) \ge q{k+1}/16Matchs(nk+1,qk)≥qk+1/16.
Set Nk:=F(nk) and Gk:=F(nk+qk)−F(nk). By (9),
Nk≤λqk+1+B,Gk≤λqk+2B.
Since nk≥qk+1/2, also Nk≥λqk+1/2−B. Hence for large k,
NkGk≤λqk+1/2−Bλqk+2B≤(2+ε)qk+1qk≤2α2+ε
for any ε>0 and all sufficiently large k. When α≥2, this gives Gk/Nk≤(1+ε)/α for small ε>0, which suffices for the epoch-gap bound below. When α<2, the ratio Gk/Nk<1/α is not needed: since Gk≤λqk+2B≤λqk+1+2B≤2(λqk+1/2+B)≤2(Nk+2B)≤3Nk for k sufficiently large, and Nk+Gk≤4Nk<4(K1,k+1)2, we get dk≤K1,k+2=D(K1,k) directly.
The morphic transfer gives a digit match of length Lk≥lmin⌊qk+1/16⌋. Using the upper bound Nk≤2λqk+1 (for large k): Lk≥lminqk+1/32≥c1Nk where c1:=lmin/(64λ)>0.
Case B: supkak=M<∞. All partial quotients satisfy ak≤M. Then the Sturmian word s is linearly recurrent: for every factor of length ℓ, R(ℓ)≤(M+2)ℓ (since R(ℓ)≤qj+1+qj for ℓ∈[qj,qj+1), and qj+1/qj≤aj+1+1≤M+1).
Fix
A:=2α(M+2)+M+2.
By Lemma 2.4, for large k there exists nk∈[qk+1/2,qk+1) with {\mathbf{s}}(n_k!+!1, q_k) \ge q{k+1}/16Matchs(nk+1,qk)≥qk+1/16. Define mk:=⌊qk+1/16⌋ (truncating the match to a controlled length). The factor Uk:=snk+1⋯snk+qk+mk has length ℓk=qk+mk≤qk+qk+1≤(M+2)qk (using qk+1≤(M+1)qk). By linear recurrence, R(ℓk)≤(M+2)ℓk≤(M+2)2qk. So there exists an occurrence of Uk starting at some
rk∈[Aqk+1,Aqk+1+(M+2)2qk].
Set Nk:=F(rk) and Gk:=F(rk+qk)−F(rk). Since rk≤Aqk+1+(M+2)2qk≤(A+(M+2)2)qk+1:
Nk≤λ(A+(M+2)2)qk+1+B.
Also Nk≥λAqk+1−B and Gk≤λqk+2B, so for large k,
NkGk≤λAqk+1−Bλqk+2B≤A2<α1,
since A>2α and qk≤qk+1. Also Gk≤3Nk. Using the upper bound on Nk: Lk≥lminmk≥lminqk+1/32≥c2Nk where c2:=lmin/(64λ(A+(M+2)2))>0.
Conclusion (both cases). In both cases, we obtain an infinite sequence of (Nk,Gk) with Gk≤3Nk, a digit match of length Lk≥c0Nk for a fixed constant c0>0 (independent of k), and additionally: in Case A with α≥2 and in Case B, we have Gk≤(1+ε)Nk/α for small ε>0.
Let K1,k and K2,k be the epoch indices of Nk and Nk+Gk, and set dk:=K2,k−K1,k. If dk=0 (same epoch), Theorem 4.1 gives M(Nk,Gk)≤K1,kα+Cwe=O(Nk). If dk≥1, we verify dk≤D(K1,k). Since Gk≤3Nk, we have Nk+Gk≤4Nk<4(K1,k+1)2, so K2,k<2(K1,k+1) and
dk≤K1,k+1≤K1,k+2.
In Case A with α<2: D(K1,k)=K1,k+2, so the above gives dk≤D(K1,k) directly.
In Case A with α≥2 and in Case B: since Nk+Gk lies in epoch K2,k,
Gk≥K2,k2−(K1,k+1)2+1=2K1,k(dk−1)+dk2,
hence dk≤1+Gk/(2K1,k). Since Gk≤(1+ε)Nk/α<(1+ε)(K1,k+1)2/α,
dk≤1+2αK1,k(1+ε)(K1,k+1)2≤α2K1,k+2
for ε small and k large. Thus dk≤D(K1,k).
In all sub-cases, Proposition 4.2 gives M(Nk,Gk)≤CNk.
In either case,
c0Nk≤Lk≤M(Nk,Gk)≤CNk,
so Nk≤(C/c0)2. This contradicts Nk→∞.
The contradiction proves Theorem 1.1. ∎
6. Proof of Theorem 1.2
For the quantitative theorem we count factors that are completely contained in their starting epochs. This avoids the local restriction in Proposition 4.2.
Lemma 6.1 (Cross-epoch separation for internal factors).Let n≥1, and let N1<N2 lie in epochs K1<K2 with K1≥1. Suppose that the orbit points decompose as zNi=Pi/cKi+Ti with gcd(Pi,c)=1 and
Ti≤cKi+1beNiCtail(i=1,2)
for some constant Ctail≥1 satisfying 4Ctail<bc2. Assume
eN1≥n,eN2≥n,
and also
n≥αK2+1.
Then
∣zN1−zN2∣≥2cK21.
Consequently,
M(N1,N2−N1)≤K2α+logb2.
Proof. Set
δ:=K2−K1≥1,D:=cδP1−P2.
Then c∤D because c∣cδP1 and c∤P2. Hence D=0 and
cK2D≥cK21.
Also
zN1−zN2=cK2D+(T1−T2).
By the tail hypothesis and eNi≥n,
Ti≤cKi+1bnCtail(i=1,2).
For T1, using δ−1≤K2−2 (since K1≥1) and n≥αK2+1,
The proofs for γb,c and Md,b,c share a common structure that applies to a wide class of lacunary constants. We now state and prove the general result announced in Theorem 1.5.
Proof of Theorem 1.5. Write α=logbc. For n in epoch K (i.e. f(K)≤n<f(K+1)), the orbit point zn={bnξ} decomposes as
zn=cg(K)Pn+Tn,(10)
where Pn≡aKbn−f(K)(modc), hence gcd(Pn,c)=1 (since gcd(aK,c)=gcd(b,c)=1), and Tn≤Ctail/(cg(K)+1bf(K+1)−n) for a constant Ctail depending only on b,c (the geometric tail bound uses g(K+1)≥g(K)+1, which follows from g strictly increasing, and f(K+2)−f(K+1)→∞). This is the direct analogue of Lemma 3.1 with cK replaced by cg(K), and k2 replaced by f(k).
Within-epoch match bound. For n,n+s in the same epoch K with 1≤s≤LK, the c-adic valuation argument of Theorem 4.1 gives
M(n,s)≤g(K)α+C0,(11)
provided vc(bs−1)<g(K), which holds for all s≤LK and all large K by hypothesis (1).
Cross-epoch separation. For N1 in epoch K1 and N2 in epoch K2>K1 with f(Ki+1)−Ni≥n, the lattice gap ∣D/cg(K2)∣≥1/cg(K2) (from c∤D exactly as in Lemma 6.1) dominates the tail difference whenever n≥g(K2)α+1.
Quantitative bound. For each epoch K, the set K(n)={N: f(K)\le N\le f(K!+!1)-n}NK(n)={N:f(K)≤N≤f(K+1)−n} has mK=LK−n+1 elements whenever LK≥n. All length-n factors from starting positions in {K=A_n}^{B_n}\mathcal{N}K(n)⋃K=AnBnNK(n) are pairwise distinct: same-epoch pairs are separated by (11) (using g(K)α+C0<n for K≤Bn), and cross-epoch pairs by the lattice gap. Hence {K=A_n}^{B_n}(L_K-n+1)p(ξ,b,n)≥∑K=AnBn(LK−n+1).
Qualitative statement. When the quantitative range is nonempty (i.e. Bn≥An for all large n), the divergence of the sum ∑K=AnBn(LK−n+1) gives p(ξ,b,n)−n→∞ directly. This covers all cases where α<2, and more generally whenever f grows fast enough that Bn>An eventually.
When Bn<An (which can happen for polynomial f with α≥2), we use the Cassaigne argument of Theorem 1.1, with g(K) replacing K. Assume for contradiction that liminf(p(n)−n)<∞. By Cassaigne's theorem, the digit sequence is eventually quasi-Sturmian. The within-epoch match bound (11) gives M(n,s)≤g(K)α+C0 for all gaps s within epoch K. Since g(K)=o(f(K)) and LK→∞, large epochs contain ≫LK digit positions, and the quasi-Sturmian self-match of length Ω(N) (from the Sturmian convergents, placed inside a single epoch as in Section 5) gives Ω(N)≤g(K)α+C0. But g(K)α=o(N) (since N≥f(K) and g=o(f)), so Ω(N)=o(N), a contradiction. ∎
The following corollaries illustrate the scope of Theorem 1.5.
Corollary 7.1 (Polynomial exponents).Let r≥2 be an integer, b,c≥2 with gcd(b,c)=1, and α=logbc. For any integers ak with gcd(ak,c)=1, set ξ=∑k≥1ak/(ckbkr). Then:
(a) p(ξ,b,n)−n→+∞;
(b) if r=2 and α<2, then p(ξ,b,n)≥c0n2 with c0=(2−α)2/(16α2) (recovering Theorem 1.2);
(c) for r≥3 and any α>0, there exists cr>0 such that p(ξ,b,n)≥crnr.
Proof. Take f(k)=kr, g(k)=k. Then f and g are strictly increasing, LK=(K+1)r−Kr∼rKr−1, and g(K)=K. Since logLK=O(logK)=o(K)=o(g(K)), hypothesis (1) is satisfied. In the quantitative bound, An∼(n/r)1/(r−1) and Bn∼n/α, so ∑K=AnBn(LK−n+1)≥c∑K∼n/αrKr−1≥crnr. For r=2 this specializes to the quadratic bound of Theorem 1.2. For r≥3, Bn∼n/α regardless of whether α<2, so no restriction on α is needed. When r=2 and α≥2, the quantitative range is empty (Bn<An), but the qualitative conclusion p(n)−n→∞ follows from the Cassaigne argument in the proof of Theorem 1.5. ∎
Corollary 7.2 (Exponential exponents).Let d≥2, b,c≥2 with gcd(b,c)=1 and d<c. Then Md,b,c=∑k≥01/(ckbdk) satisfies p(Md,b,c,b,n)≥dn/α−C (recovering Theorem 1.4).
Proof. Take f(k)=dk, g(k)=k. Then f and g are strictly increasing, LK=(d−1)dK, and g(K)=K. Here logLK∼Klogd, so the sufficient condition logLK=o(g(K)) does not hold. Instead, hypothesis (1) is verified directly: since d<c, the multiplicative order estimate (8) gives ordcK(b)>LK for large K, hence vc(bs−1)<K=g(K) for all 1≤s≤LK. The quantitative bound then gives An∼logdn, Bn∼n/α, and the geometric sum yields p(n)≥dBn/2≥dn/α−C. ∎
Corollary 7.3 (Tschakaloff function values).For b,c≥2 with gcd(b,c)=1, the Tschakaloff value
Tb(1/c)=k≥1∑ckbk(k+1)/21
satisfies p(Tb(1/c),b,n)−n→+∞. If α=logbc<2, then p(Tb(1/c),b,n)≥c0n2.
Proof. Take f(k)=k(k+1)/2, g(k)=k. Then f and g are strictly increasing, LK=K+1, and logLK=O(logK)=o(K)=o(g(K)), so hypothesis (1) holds. The spacing matches the quadratic case (f is degree 2 in k). ∎
Corollary 7.4 (Fibonacci-exponent constants).Let Fk denote the k-th Fibonacci number and φ=(1+5)/2 the golden ratio. For b,c≥2 with gcd(b,c)=1 and φ<c, the constant ξ=∑k≥11/(ckbFk) satisfies p(ξ,b,n)≥φn/α−C for a constant C=C(b,c).
Proof. Take f(k)=Fk, g(k)=k. Then f and g are strictly increasing, LK=FK+1−FK=FK−1∼φK−1/5 (exponential growth at rate φ). As in Corollary 7.2, logLK∼Klogφ is not o(g(K)), so hypothesis (1) is verified via multiplicative orders: since φ<c, the argument of (8) (with d replaced by φ) gives ordcK(b) exceeds LK for large K. The geometric sum gives p(n)≥φBn≥φn/α−C. ∎
Corollary 7.5 (Signed coefficients).Theorem 1.5 holds with ak∈{+1,−1}. In particular, it covers values of the Rogers false theta function
F(x,y)=k≥0∑(−1)kxk(k+1)/2yk
at x=1/b, y=1/c: the constant ξ=∑k≥0(−1)k/(ckbk(k+1)/2) satisfies p(ξ,b,n)−n→+∞.
Proof. Since gcd(±1,c)=1 for all c≥2, the coprimality hypothesis gcd(ak,c)=1 holds. The proof of Theorem 1.5 uses gcd(Pn,c)=1 via Pn≡aKbn−f(K)(modc); this requires only gcd(aK,c)=1 and gcd(b,c)=1. ∎
Remark 7.6. Corollary 7.3 is, to our knowledge, the first digit complexity result for Tschakaloff function values. Corollary 7.4 gives a natural family where the exponent function f grows at a non-integer exponential rate φ.
8. The exponential case: twisted Mahler constants
For d≥2, b≥2, c≥2 with gcd(b,c)=1, define
Md,b,c:=k≥0∑ckbdk1.
These are values of a twisted Mahler function∑ykxdk at x=1/b, y=1/c (the factor yk=c−k twists the classical Mahler series ∑xdk, introducing coprime denominators that fundamentally change the digit structure).
Remark 8.1 (Failure of prior methods). When d≥3 and c>d2, both previously available approaches to proving p(n)−n→∞ are provably inapplicable to Md,b,c.
Diophantine method. Bugeaud and Kim [BK19] show that if μ(ξ)<μ0, where μ0=(25+410)/15≈2.510, then the initial repetition index of the base-b expansion of ξ exceeds 10−3/2, which is incompatible with quasi-Sturmian structure and forces p(n)−n→∞. Their argument requires the irrationality exponent to be strictly below μ0 in order for the first link of the Adamczewski--Bugeaud chain [AB07] to produce the needed repetition bound. For Md,b,c with d≥3 one has μ(Md,b,c)≥d≥3>μ0 (see below), so this chain breaks at the outset.
Hot spot method. Bailey--Crandall [BC02] prove normality of Md,b,c when d>c (Corollary 4.9(iv) of [BC02]): their Theorem 4.8 requires μk/cγnk to be nondecreasing for some γ>1/2, and for Md,b,c one has μk∼dk, nk=k, so the condition holds iff d>cγ for some γ>1/2, i.e. iff d>c. For Md,b,c, epoch K has length LK=(d−1)dK and the relevant lattice denominator is cK. When c>d2, one has d≤c, so the Bailey--Crandall condition fails.
Lower bound μ(Md,b,c)≥d. The partial sum SN=∑k=0N−11/(ckbdk) has denominator dividing qN:=cN−1bdN−1. The tail satisfies ∣Md,b,c−SN∣≤2/(cNbdN). For any w<d, one has qNw−1=O(b(w−1)dN−1), while 1/∣Md,b,c−SN∣=Ω(bdN). Since (w−1)dN−1<dN for w<d, it follows that ∣Md,b,c−SN∣<1/qNw for all large N, whence μ≥d. See Badziahin [Bad19] and Rajchert [Raj24] for the matching upper bound μ(Md,b,c)=d.
Theorem 1.3 establishes p(n)−n→∞ for all d≥2, including the regime d≥3, c>d2 where both methods above are provably inapplicable. When d<c, Theorem 1.4 strengthens this to the exponential bound p(Md,b,c,b,n)≥dn/α−C, where α=logbc and C=C(d,b,c). Theorems 1.3 and 1.4 therefore provide, to our knowledge, the first exponential complexity bound for constants where both the Diophantine method and the hot spot method are provably inapplicable. The exponential growth rate dn/α is exponential in n (specifically dn/α=bnlogbd/logbc, which equals bn/2 when d=c); for comparison, the best quantitative bounds from the Bugeaud--Kim method yield only limsupp(n)/n≥4/3 [BKK25].
Proof of Theorem 1.3.Epoch structure. Epoch K covers digit positions [dK,dK+1), with length LK=(d−1)dK. For n in epoch K, the orbit point decomposes as zn=Pn/cK+Tn with gcd(Pn,c)=1 and Tn≤2/(cK+1ben), where en=dK+1−n. To verify: bnMd,b,c=∑k≥0c−kbn−dk; the k=0 term bn−1 is an integer, so zn={∑k≥1c−kbn−dk}. The finite part is Qn/cK with Qn=∑k=1KcK−kbn−dk. Set Pn≡QnmodcK; then Pn≡bn−dK(modc) (the k=K term; all earlier terms carry a factor of c), so gcd(Pn,c)=1 since gcd(b,c)=1. The tail satisfies Tn=∑k≥K+1c−kbn−dk≤c−(K+1)b−en∑i≥0b−i≤2/(cK+1ben). The within-epoch match bound M(n,s)≤Kα+C0 holds for any 1≤s≤LK with vc(bs−1)<K, by the same valuation argument as Theorem 4.1. The key identities transfer directly: since both n and n+s lie in epoch K, the tail sums Tn=∑k≥K+1c−kbn−dk and Tn+s=∑k≥K+1c−kbn+s−dk involve the same future epochs k≥K+1, so Tn+s=bsTn. The finite-part congruence Pn+s≡bsPn(modcK) likewise follows from Qn+s=bsQn. The hypothesis vc(bs−1)<K then gives vc(Δ)=vc(bs−1), and the argument concludes as in Theorem 4.1.
Irrationality. The partial sums SN=∑k=0N−1c−kb−dk have denominator qN=cN−1bdN−1 and satisfy ∣Md,b,c−SN∣≤2/(cNbdN), while qN+1/qN=cbdN−dN−1→∞; the same rational-approximation argument as Lemma 3.2 gives Md,b,c∈/Q.
Contradiction. Assume liminf(p(n)−n)<∞. By Theorem 2.3, the digit sequence is eventually quasi-Sturmian: x=Wφ(s) with Sturmian slope β, convergents qk, and non-erasing morphism φ with lmin≥1, lmax≥1.
By Lemma 2.4, for large k there exists nk with {\mathbf{s}}(n_k!+!1,q_k)\ge q{k+1}/16Matchs(nk+1,qk)≥qk+1/16. Set ℓk:=⌊qk+1/16⌋. The Sturmian factor Uk:=snk+1⋯snk+ℓk has length ℓk; the factor Uk also appears starting at position nk+qk+1 (by the match of length ≥ℓk at gap qk).
Target epoch. Choose K∗=max(⌈logd(C1qk+1)⌉,v0+1+⌈logc(C2qk+1)⌉) where C1=16lmax/(d−1), C2=8lmax/d0, c(b)d0=ordc(b), v0=vc(bd0−1). Then =O(\log q_{k+1})K∗=O(logqk+1), the epoch length satisfies }\ge 16l_{\max}q_{k+1}(d−1)dK∗≥16lmaxqk+1, and {c^{K}}(b)\ge 8l_{\max}q_{k+1}ordcK∗(b)≥8lmaxqk+1. The order bound follows from the lifting-the-exponent lemma applied to each prime power pa∥c, combined via the Chinese remainder theorem: {c^{K}}(b)\ge c^{K_*-v_0}/d_0\ge C_2 q_{k+1}=8l_{\max}q_{k+1}ordcK∗(b)≥cK∗−v0/d0≥C2qk+1=8lmaxqk+1 by the choice of K∗; see (8) for the detailed calculation.
Two occurrences in one epoch. The first quarter of epoch K∗ spans (d−1)dK∗/4 digit positions. Since each Sturmian letter maps to at most lmax digits under φ, this corresponds to at least (d−1)dK∗/(4lmax)≥4qk+1 Sturmian positions. Since R(ℓk)≤2qk+1 (since ℓk<qk+1 implies ℓk∈[qj,qj+1) for some j≤k, giving R(ℓk)≤qj+1+qj≤2qk+1 by Lemma 2.5), splitting the window into two halves of ≥2qk+1 positions yields two occurrences T0<T1 of Uk. Both occurrences plus the factor stay inside epoch K∗. Indeed, T1 lies in the first quarter of epoch K∗, so F(T1)≤dK∗+LK∗/4=dK∗+(d−1)dK∗/4=(d+3)dK∗/4. The factor length satisfies lmaxℓk≤lmaxqk+1≤(d−1)dK∗/16 by the choice of K∗. Summing,
Since 5d+11<16d⟺11<11d⟺d>1, which holds for d≥2, we get F(T1)+lmaxℓk<dK∗+1.
Digit match and gap control. The identical factors at T0 and T1 produce a digit match of length Ld≥lminℓk≥lminqk+1/32 at gap g=F(T1)−F(T0). Since g≤4lmaxqk+1+O(1) while {c^{K}}(b)\ge 8l_{\max}q_{k+1}ordcK∗(b)≥8lmaxqk+1 by the choice of K∗, we have {c^{K}}(b)g<ordcK∗(b) once qk+1 exceeds a constant depending only on b,c,d,lmax. Hence vc(bg−1)<K∗, and the within-epoch bound gives
32lminqk+1≤K∗α+C0=O(logqk+1).
This contradicts qk+1→∞. ∎
Under the additional hypothesis d<c we strengthen this to exponential growth (Theorem 1.4).
Remark 8.2. The condition d<c is essential: it guarantees {c^K}(b)\ge c^{K-v_0}/d_0>L_K=(d-1)d^KordcK(b)≥cK−v0/d0>LK=(d−1)dK for all large K (since (d/c)K→0), so that vc(bs−1)<K for every gap 1≤s≤LK within epoch K. When d≥c, the multiplicative order can fall below the epoch length, creating gaps where the within-epoch match bound fails. In the μ-bypass regime d≥3, c>d2, the hypothesis d<c is automatically satisfied. When d=2, one has {2,b,c}) = 2 < \mu_0μ(M2,b,c)=2<μ0, so the Bugeaud--Kim method applies, yielding p(n)−n→∞ by [BK19]; the contribution of Theorem 1.4 in this case is the explicit exponential growth rate, which the Diophantine method does not provide.
Proof of Theorem 1.4. Let C0 be the constant from the within-epoch match bound (as in the proof of Theorem 1.3), and set C∗:=C0+logb2+2.
Epoch parameters. Epoch K covers digit positions [dK,dK+1) with length LK=(d−1)dK. Write d0:=ordc(b) and v0:=vc(bd0−1). Since d<c, the ratio (d−1)dKd0/cK−v0=d0(d−1)(d/c)Kcv0→0, so for all K≥K0 (determined by b,c,d via d0, v0, and the inequality cK−v0/d0>(d−1)dK),
ordcK(b)≥d0cK−v0>(d−1)dK=LK.(8)
To see this: for each prime power pa∥c with p(b)dp:=ordp(b) and wp:=vp(bdp−1), the lifting-the-exponent lemma gives {p^{aK}}(b)=d_p\cdot p^{a(K-1)-w_p+a}ordpaK(b)=dp⋅pa(K−1)−wp+a for K large enough that aK>wp. Since gcd(b,c)=1, the Chinese remainder theorem gives p(\operatorname{ord}{p^{aK}}(b)) \ge c^{K-v_0}/d_0ordcK(b)=lcmp(ordpaK(b))≥cK−v0/d0, where v0 and d0 absorb the finitely many prime-dependent constants. In particular, vc(bs−1)<K for every 1≤s≤LK.
Usable epoch range. For a given factor length n, define
Kmin:=⌈logd(d−1n)⌉,Kmax:=⌊αn−C∗⌋.
If K≥Kmin, then LK=(d−1)dK≥n, so any length-n factor starting at position N with eN:=dK+1−N≥n is fully contained in epoch K. If K≤Kmax, then Kα+C0<n−1<n, so the within-epoch match bound forces any two positions in epoch K at gap 1≤s≤LK to produce distinct length-n factors. Since Kmin=O(logn) and Kmax∼n/α, the range [Kmin,Kmax] is nonempty for all n≥n1, where n1 depends on b,c,d (via K0, C∗, and α).
Internal positions. For each K∈[Kmin,Kmax], the set
NK(n):={N∈Z:dK≤N≤dK+1−n}
has cardinality mK=(d−1)dK−n+1. For K≥Kmin+1 one has LK≥dn≥2n, so mK≥LK/2.
Same-epoch distinctness. Let N1<N2∈NK(n) with s:=N2−N1. Then 1≤s≤LK. By (8), vc(bs−1)<K, so the within-epoch bound gives M(N1,s)≤Kα+C0<n. Hence the two length-n factors differ.
Cross-epoch distinctness. Let {K_1}(n)N1∈NK1(n) and {K_2}(n)N2∈NK2(n) with K1<K2≤Kmax. Both positions satisfy eNi≥n and K2α+1≤n−C∗+1<n. The epoch decomposition (established above) gives zNi=Pi/cKi+Ti with gcd(Pi,c)=1 and Ti≤2/(cKi+1beNi), so the hypotheses of Lemma 6.1 hold with Ctail=2 (and 4⋅2=8<bc2 since gcd(b,c)=1 forces bc2≥12). The lemma gives
M(N1,N2−N1)≤K2α+logb2<n.
Hence cross-epoch factors are also distinct.
Counting. All factors from ⋃K=KminKmaxNK(n) are pairwise distinct, so
for all large n, where C:=C∗/α+1+logd2 depends only on d,b,c. ∎
9. Remarks and open questions
Remark 9.1 (Irrationality exponents). The arguments in this paper are completely independent of the irrationality exponents of γb,c and Md,b,c. The method of [BK19] requires μ(ξ)<2.510…; for comparison, μ(Md,b,c)≥d for all d≥2, and the best known bound for π is μ(π)≤7.103… [ZZ20]. Our proofs avoid irrationality exponents entirely.
(1) Determine the exact growth order of p(γb,c,b,n). Is the quadratic lower bound in Theorem 1.2 close to optimal when α<2?
(2) Determine the irrationality exponent of γb,c.
(3) Theorem 1.4 gives an exponential bound when d<c. Determine p(Md,b,c,b,n) when d≥c, and sharpen the base of the exponential.
(4) Investigate digit-distribution properties of γb,c and Md,b,c, for example normality in base b.
Remark 9.2 (Scope of the method). The epoch-expansion technique requires a lacunary series representation whose base and coefficient denominators are coprime, producing a single-modulus lattice that controls carry propagation. It does not apply to constants such as π, log2, or 2, whose digit-generating iterations have dense perturbations and no single-modulus lattice structure. Whether the ideas of the present paper can be combined with other approaches to reach non-lacunary constants remains an open problem.
References
[AB07] B. Adamczewski and Y. Bugeaud, On the complexity of algebraic numbers. I. Expansions in integer bases, Ann. of Math. (2) 165 (2007), no. 2, 547--565.
[Bad19] D. Badziahin, Continued fractions of certain Mahler functions, Adv. Math. 343 (2019), 495--514.
[BC02] D. H. Bailey and R. E. Crandall, Random generators and normal numbers, Experiment. Math. 11 (2002), no. 4, 527--546.
[Bor09] E. Borel, Les probabilites denombrables et leurs applications arithmetiques, Rend. Circ. Mat. Palermo 27 (1909), 247--271.
[BK19] Y. Bugeaud and D. H. Kim, A new complexity function, repetitions in Sturmian words, and irrationality exponents of Sturmian numbers, Trans. Amer. Math. Soc. 371 (2019), no. 5, 3281--3308.
[BKK25] Y. Bugeaud, H. Kaneko, and D. H. Kim, On the irrationality exponent of real numbers with low complexity expansion, Preprint, 2025. arXiv:2510.17177.
[Ca97] J. Cassaigne, Sequences with grouped factors, in: S. Bozapalidis (Ed.), Developments in Language Theory III, Aristotle University of Thessaloniki, 1998, pp. 211--222.
[FM97] S. Ferenczi and C. Mauduit, Transcendence of numbers with a low complexity expansion, J. Number Theory 67 (1997), no. 2, 146--161.
[Lo02] M. Lothaire, Algebraic Combinatorics on Words, Encyclopedia of Mathematics and its Applications, vol. 90, Cambridge University Press, Cambridge, 2002.
[MH38] M. Morse and G. A. Hedlund, Symbolic dynamics, Amer. J. Math. 60 (1938), no. 4, 815--866.
[Ni96] K. Nishioka, Mahler Functions and Transcendence, Lecture Notes in Mathematics, vol. 1631, Springer-Verlag, Berlin, 1996.
[Raj24] A. Rajchert, On the irrationality exponents of Mahler numbers, Honours thesis, University of Sydney, 2021. arXiv:2411.10733.
[ZZ20] D. Zeilberger and W. Zudilin, The irrationality measure of π is at most 7.103205334137…, Moscow J. Combin. Number Theory 9 (2020), no. 4, 407--419.
Discussion (0)
to join the discussion.
No comments yet. Be the first to discuss this paper.