1 Introduction
In this paper we prove a Bombieri–Vinogradov type theorem for general multiplicative functions supported on smooth numbers, with a fixed member of the residue class. Given a multiplicative function
$f$
, we define, whenever
$(a,q)=1$
,

We wish to prove that, for an arbitrary fixed
$A>0$
,

where, here and henceforth, “
$q\sim Q$
” denotes the set of integers
$q$
in the range
$Q<q\leqslant 2Q$
, for as large values of
$Q$
as possible. Let

for Re
$(s)>1$
. Following [Reference Granville, Harper and Soundararajan5], we restrict attention to the class
${\mathcal{C}}$
of multiplicative functions
$f$
for which

This includes most
$1$
-bounded multiplicative functions of interest, including all
$1$
-bounded completely multiplicative functions. Two key observations are that if
$f\in {\mathcal{C}}$
then each
$|f(n)|\leqslant 1$
, and if
$f\in {\mathcal{C}}$
and
$F(s)G(s)=1$
then
$g\in {\mathcal{C}}$
.
In [Reference Granville and Shao6] the last two authors showed that there are two different reasons that the sum in (1.1) might be
$\gg x/\!\log x$
. First
$f$
might be a character of small conductor (for example
$f(n)=(n/3)$
), or might “correlate” with such a character; secondly
$f$
might have been selected so that
$f(p)$
works against us for most primes
$p$
in the range
$x/2<p\leqslant x$
. We handled these potential pretentious problems as follows.
To avoid issues with the values
$f(p)$
at the large primes
$p$
we only allow
$f$
to be supported on
$y$
-smooth integersFootnote
1
for
$y=x^{\unicode[STIX]{x1D703}}$
, for some small
$\unicode[STIX]{x1D703}>0$
.
To avoid issues with the function
$f$
correlating with a given character
$\unicode[STIX]{x1D712}$
, note that this happens when

is “large” (that is,
$\gg X$
, or
$\gg X/(\log X)^{A}$
) for some
$X$
in the range
$x^{1/2}<X\leqslant x$
, in which case (1.1) might well be false. We can either assume that this is false for all
$\unicode[STIX]{x1D712}$
(which is equivalent to what is known as a “Siegel–Walfisz criterion” in the literature), or we can take account of such
$\unicode[STIX]{x1D712}$
in the “Expected Main Term”. We will begin by doing the latter, and then deduce the former as a corollary.
We start by stating the Siegel–Walfisz criterion:
The Siegel–Walfisz criterion: for any fixed
$A>0$
, we say that
$f$
satisfies the
$A$
-Siegel–Walfisz criterion if for any
$(a,q)=1$
and any
$x\geqslant 2$
we have the bound

We say that
$f$
satisfies the Siegel–Walfisz criterion if it satisfies the
$A$
-Siegel–Walfisz criterion for all
$A>0$
.
For a set of primitive characters
$\unicode[STIX]{x1D6EF}$
, let
$\unicode[STIX]{x1D6EF}_{q}$
be the set of those characters
$(\text{mod}~q)$
which are induced by the characters in
$\unicode[STIX]{x1D6EF}$
. Then denote

In [Reference Granville and Shao6] we proved the following result.
Theorem 1.1. Fix
$\unicode[STIX]{x1D6FF},B>0$
. Let
$y=x^{\unicode[STIX]{x1D700}}$
for some
$\unicode[STIX]{x1D700}>0$
sufficiently small in terms of
$\unicode[STIX]{x1D6FF}$
. Let
$f\in {\mathcal{C}}$
be a multiplicative function which is only supported on
$y$
-smooth integers. Then there exists a set,
$\unicode[STIX]{x1D6EF}$
, of primitive characters, containing
$\ll (\log x)^{6B+7+o(1)}$
elements, such that for any
$1\leqslant |a|\ll Q\leqslant x^{20/39-\unicode[STIX]{x1D6FF}}$
, we have

Moreover, if
$f$
satisfies the Siegel–Walfisz criterion then

In this article we develop Theorem 1.1 further, allowing
$Q$
as well as
$y$
to vary over a much wider range, and obtaining upper bounds in terms of (the more appropriate)
$\unicode[STIX]{x1D6F9}(x,y)$
, the number of
$y$
-smooth integers up to
$x$
.
Theorem 1.2. Fix
$\unicode[STIX]{x1D700},A>0$
. Suppose that
$f\in {\mathcal{C}}$
, and is only supported on
$y$
-smooth numbers, where

for some sufficiently small
$\unicode[STIX]{x1D6FF}>0$
. Then there exists a set,
$\unicode[STIX]{x1D6EF}$
, of primitive characters, containing
$\ll (\log x)^{6A+38}$
elements, such that if
$1\leqslant |a_{1}|,|a_{2}|\leqslant x^{\unicode[STIX]{x1D6FF}}$
then

Moreover, if
$f$
satisfies the Siegel–Walfisz criterion then

It would be interesting to extend the range (1.2) in Theorem 1.2 down to any
$y\geqslant (\log x)^{C}$
for some large constant
$C$
. We discuss the main issue that forces us to restrict the range in Theorem 1.2 to
$y>\exp ((\log x)^{1/2+o(1)})$
in Remark 4.3. In our proofs we have used the range
$y\geqslant (\log x)^{C}$
when we can, as an aid to future research on this topic, and to make clear what are the sticking points.
Fouvry and Tenenbaum [Reference Fouvry and Tenenbaum4, Théorème 2] established such a result when
$f$
is the characteristic function of the
$y$
-smooth integers (with
$y<x^{\unicode[STIX]{x1D6FF}}$
) and
$a_{2}=1$
, in the same range
$q\leqslant x^{3/5-\unicode[STIX]{x1D700}}$
, but with the bound
$\ll x/(\log x)^{A}$
. This was improved by Drappeau [Reference Drappeau2] to
$\ll \unicode[STIX]{x1D6F9}(x,y)/(\log x)^{A}$
for
$(\log x)^{C}<y\leqslant x^{\unicode[STIX]{x1D6FF}}$
.
The proof of Theorem 1.2 combines the ideas from our earlier articles [Reference Drappeau2] and [Reference Granville and Shao6]. Perhaps the most innovative feature of this article, given [Reference Drappeau2] and [Reference Granville and Shao6], comes in Theorem 5.1 in which we prove a version of the classical large sieve inequality (towards which Roth’s work [Reference Roth10] played a pivotal role) for (the notably sparse) sequences supported on the
$y$
-smooth numbers, which may be of independent interest.
2 Reduction to a larger set of exceptional moduli
We begin by modifying estimates from [Reference Drappeau2] to prove Theorem 2.1, which is a version of Theorem 1.2 with a far larger exceptional set of characters. This is key to the proof of Theorem 1.2 since we now only need to cope with relatively small moduli. We therefore define
${\mathcal{A}}(D)$
to be the set of all primitive characters of conductor
${\leqslant}D$
.
Theorem 2.1. For fixed
$\unicode[STIX]{x1D700},A>0$
, there exist
$C,\unicode[STIX]{x1D6FF}>0$
such that for any
$y$
in the range
$(\log x)^{C}<y\leqslant x^{\unicode[STIX]{x1D6FF}}$
, and any
$f\in {\mathcal{C}}$
which is only supported on
$y$
-smooth numbers, we have

for any integers
$a_{1},a_{2}$
for which
$1\leqslant |a_{1}|,|a_{2}|\leqslant x^{\unicode[STIX]{x1D6FF}}$
, with
${\mathcal{A}}={\mathcal{A}}(D)$
where
$D=(x/\unicode[STIX]{x1D6F9}(x,y))^{2}(\log x)^{2A+20}$
.
We prove this by modifying some of the estimates in [Reference Drappeau2]. For any
$D\geqslant 1$
and integer
$q\geqslant 1$
let

so that

Note that
$\,\mathfrak{u}_{D}(n;q)=0$
unless
$(n,q)=1$
and
$q>D$
, in which case

For
$(n,q)=1$
, since

by letting
$b=s/d$
we obtain the alternate expression

Theorem 2.1 is an immediate consequence of Theorem 2.2.
Theorem 2.2. For any fixed
$\unicode[STIX]{x1D700}>0$
, there exists
$C,\unicode[STIX]{x1D6FF}>0$
such that whenever

we have, uniformly for
$0<|a_{1}|,|a_{2}|\leqslant x^{\unicode[STIX]{x1D6FF}}$
and
$f\in {\mathcal{C}}$
,

To prove Theorem 2.2, we first prove the following generalization of [Reference Drappeau2, Theorem 3], where the bound
$D$
on the conductor is allowed to vary.
Lemma 2.3. Let
$M,N,L,R\geqslant 1$
and
$(\unicode[STIX]{x1D6FC}_{m})$
,
$(\unicode[STIX]{x1D6FD}_{n})$
,
$(\unicode[STIX]{x1D706}_{\ell })$
be three sequences, bounded in modulus by
$1$
, supported on integers inside
$(M,2M]$
,
$(N,2N]$
, and
$(L,2L]$
respectively. Let
$x=MNL$
. For any fixed
$\unicode[STIX]{x1D700}>0$
, there exists
$\unicode[STIX]{x1D6FF}>0$
such that whenever either the conditions (3.1), or the conditions (3.2) of [Reference Drappeau2] are met, we have

for
$1\leqslant D\leqslant x^{\unicode[STIX]{x1D6FF}}$
.
Theorem 3 of [Reference Drappeau2] is the special case when
$D$
has maximal size,
$D=x^{\unicode[STIX]{x1D6FF}}$
. The conditions (3.1) or (3.2) of [Reference Drappeau2] concern the relative sizes of
$M,N,L$
. They are rather technical, but a critical case when the conditions are met is

Proof. We follow closely the arguments of [Reference Drappeau2]. Roughly speaking, the main point is that reducing the size of
$D$
only reduces the error terms, except in a certain diagonal contribution which yields the dominant error term, and which we analyze more carefully. Proceeding as in [Reference Drappeau2, §3], we reduce to the estimation of
${\mathcal{S}}_{1}-2\,\text{Re}({\mathcal{S}}_{2})+{\mathcal{S}}_{3}$
, where
${\mathcal{S}}_{1}$
is defined in the first display of [Reference Drappeau2, pp. 838],

and

where
$u_{k}=\sum _{k=n\ell }\unicode[STIX]{x1D6FD}_{n}\unicode[STIX]{x1D706}_{\ell }$
, and
$f$
is a smooth function supported inside
$[M/2,3M]$
satisfying
$\Vert f^{(j)}{\Vert _{\infty }\ll }_{j}M^{-j}$
for any
$j\geqslant 0$
.
The quantity
${\mathcal{S}}_{1}$
being the same as in [Reference Drappeau2], we can quote the estimate

from [Reference Drappeau2, formula (3.17)], where
$K=NL$
. Here
$\hat{f}(0)=\int _{\mathbb{R}}f$
, and
$X_{1}$
is defined at [Reference Drappeau2, formula (3.12)]. For the estimation of
${\mathcal{S}}_{2}$
and
${\mathcal{S}}_{3}$
, we reproduce [Reference Drappeau2, §§3.2 and 3.3], the only difference being that the set
${\mathcal{X}}=\{\unicode[STIX]{x1D712}\text{primitive}:~\text{cond}(\unicode[STIX]{x1D712})\leqslant x^{\unicode[STIX]{x1D700}}\}$
is replaced with the subset
${\mathcal{X}}=\{\unicode[STIX]{x1D712}\text{primitive}:~\text{cond}(\unicode[STIX]{x1D712})\leqslant D\}$
. We claim that the estimates

hold for
$j\in \{2,3\}$
, with

To see this, we merely note that reducing the cardinality of
${\mathcal{X}}$
, and the bound
$D$
on the conductors, leads to better error terms in the analysis. This is clear from the bound on
$R_{3}$
in [Reference Drappeau2, formula (3.8)], which grows proportionally to
$|{\mathcal{X}}|D^{2}$
, and from the bound on
$\sum _{z\in {\mathcal{Z}}}\unicode[STIX]{x1D706}(z)|G_{2}(z,\unicode[STIX]{x1D709})|$
in [Reference Drappeau2, formula (3.10)], which grows proportionally to
$|{\mathcal{X}}|$
.
Finally we are left with evaluating
$X_{1}-2\,\text{Re}(X_{2})+X_{3}$
, which makes use of the multiplicative large sieve. Proceeding as in [Reference Drappeau2, §3.6], we find

Here we have used the boundFootnote
2
$|u_{k}|\leqslant \unicode[STIX]{x1D70F}(k)$
, and the hypothesis
$R\leqslant x^{-\unicode[STIX]{x1D700}}K\leqslant K/D$
. Following [Reference Drappeau2, formula (3.34)], this leads to the upper bound

To deduce Theorem 2.2 from Lemma 2.3, we start with the following special case of Theorem 2.2.
Proposition 2.4. Theorem 2.2 holds true for functions
$f$
supported on squarefree integers.
Proof. We extend the arguments of [Reference Drappeau2, pp. 852–853], renaming the variable
$q$
into
$r$
. Suppose first
$R\geqslant x^{4/9}$
. We restrict
$n$
and
$r$
to dyadic intervals
$x<n\leqslant 2x$
and
$R<r\leqslant 2R$
. Choosing the parameters
$(M_{0},N_{0},L_{0})$
as in [Reference Drappeau2, p. 852, last display], we obtain

where we have used the fact that
$f$
is supported on squarefree integers in the last equality. The rest of the argument consists in cutting the sums over
$(m,n,\ell )$
in dyadic segments, and analytically separating the four conditions
$x<mn\ell \leqslant 2x$
,
$P^{+}(m)<P^{-}(\ell )$
and
$P^{+}(n)<P^{-}(m)$
. The details are identical to the proof of Proposition 2 of [Reference Drappeau2], using our Lemma 2.3 instead of [Reference Drappeau2, Theorem 3]; we obtain the bound

The Bombieri–Vinogradov range
$R\leqslant x^{4/9}$
is covered by similar arguments, using [Reference Iwaniec and Kowalski9, Theorem 17.4] instead of Lemma 2.3.◻
Deduction of the full Theorem 2.2 from Proposition 2.4.
We let
${\mathcal{K}}$
be the set of powerful numbers, that is for
$k\in {\mathcal{K}}$
if prime
$p$
divides
$k$
then
$p^{2}$
also divides
$k$
. Note that
$|{\mathcal{K}}\cap [1,x]|\ll x^{1/2}$
. Out of every
$n$
counted in the left-hand side of (2.4), we extract the largest powerful divisor
$k$
. Then from the triangle inequality and the bound
$|f(k)|\leqslant 1$
, the left-hand side of (2.4) is at most

Let
$K\geqslant 1$
be a parameter. We use the trivial bound (2.2) on the contribution of
$k>K$
, getting

say, where we have separated the contribution of the two summands. Executing the sum over
$q$
first, and separating the case
$kn|a_{1}$
, we find

It is easy to see that
$T_{2}\ll x^{1+\unicode[STIX]{x1D700}}DK^{-1/2}$
as well. Next, to each
$1\leqslant k\leqslant K$
in (2.7), by hypothesis, we may use Proposition 2.4 with
$x\leftarrow x/k$
,
$a_{2}\leftarrow ka_{2}$
and
$f(n)\leftarrow \mathbf{1}_{(n,k)=1}\unicode[STIX]{x1D707}^{2}(n)f(n)$
, and obtain the existence of
$C,\unicode[STIX]{x1D6FF}_{1}>0$
such that, for
$|a_{1}|,|a_{2}k|\leqslant x^{\unicode[STIX]{x1D6FF}_{1}}$
and
$(\log x)^{C}\leqslant y\leqslant x^{\unicode[STIX]{x1D6FF}_{1}}$
,

We take
$\unicode[STIX]{x1D6FF}=\unicode[STIX]{x1D6FF}_{1}/7$
,
$K=x^{\unicode[STIX]{x1D6FF}_{1}/2}$
, and sum over
$k\leqslant K$
, using
$\sum _{k\in {\mathcal{K}}}k^{-1}<\infty$
. By hypothesis
$D\leqslant x^{\unicode[STIX]{x1D6FF}}$
, so that
$x^{4/5}\ll DK^{-1/2}x\ll D^{-1/2}x^{1-\unicode[STIX]{x1D6FF}/2}$
, and we find that (2.7) is at most
$\ll D^{-1/2}x(\log x)^{10}$
as claimed.◻
3 Altering the set of exceptional characters
To prove Theorem 1.2 we need to reduce the set of exceptional characters from
${\mathcal{A}}(D)$
to
$\unicode[STIX]{x1D6EF}$
. We shall set this up in Proposition 3.2.
It is convenient to write
$b=a_{1}/a_{2}$
(which is
$\equiv a_{1}\overline{a_{2}}\;(\text{mod}~q)$
) and to define
$(q,b)$
to mean
$(q,a_{1}a_{2})$
. Thus in Theorem 2.1 we are working with

for
$Q=x^{3/5-\unicode[STIX]{x1D700}}$
.
Lemma 3.1. Let
${\mathcal{A}}={\mathcal{A}}(D)$
for some
$D\geqslant 2$
. Suppose that
$\unicode[STIX]{x1D6EF}\subset {\mathcal{A}}$
. If
$(b,q)=1$
then

where
$g$
is the multiplicative function with
$F(s)G(s)=1$
.
Proof. If
$(b,q)=1$
then

where
${\mathcal{P}}(m)$
denotes the set of primitive characters
$(\text{mod}~m)$
, as
${\mathcal{A}}$
is the set of all primitive characters of conductor
${\leqslant}D$
. Let
${\mathcal{C}}(m)$
denote the set of all characters
$(\text{mod}~m)$
. For
$m|q$
we define

By Möbius inversion we deduce that, for
$m|q$
,

Next we wish to better understand
$\unicode[STIX]{x1D6E5}_{\unicode[STIX]{x1D6EF},q}(f,x;m,a)$
. Let
$f_{q}(p^{k})=f(p^{k})$
if
$p|q,~p\nmid m$
, and
$f_{q}(p^{k})=0$
otherwise. Define
$g_{q}$
from
$g$
in a similar way. Note
$f_{q}$
and
$g_{q}$
are simply
$f$
and
$g$
supported on the integers composed from the prime factors of
$q$
. If
$(a,m)=1$
then

and since
$F_{q}G_{q}=1$
we have

Substituting this in above then yields

and the result follows writing
$m=dn$
.◻
Proposition 3.2. Let the notations and assumptions be as in the statement of Theorem 2.1. Suppose that
$\unicode[STIX]{x1D6EF}\subset {\mathcal{A}}$
. Then

where
$X=(1+\unicode[STIX]{x1D6FD})^{2}(\log x)^{2A+6}$
with

Proof. Set
$Q=x^{3/5-\unicode[STIX]{x1D700}}$
. We deduce from Lemma 3.1, as each
$|g(\ell )|\leqslant 1$
since
$f,g\in {\mathcal{C}}$
, that

The first term on the right-hand side is
$\ll _{A}\unicode[STIX]{x1D6F9}(x,y)/(\log x)^{A}$
by Theorem 2.1. For the sum at the end we have an upper bound

where
$\unicode[STIX]{x1D70F}^{\ast }(m)$
denotes the number of squarefree divisors of
$m$
, writing
$q=dLr$
, as
$L$
is squarefree. Therefore

We will attack this last sum first by employing relatively trivial bounds for the terms with
$\ell$
that are not too small, so that we only have to consider
$\ell d$
that are smallish in further detail. Now [Reference Fouvry and Tenenbaum3, Theorem 1] gives the upper bound

provided
$x\geqslant y\geqslant \exp ((\log \log x)^{2})$
and
$q\leqslant x$
. Therefore

in this range. Substituting in, the upper bound on the
$\ell$
th term above becomes

The second sum over
$d$
is therefore

For the first term we use [Reference Harper7, Theorem 1] which yields that

as
$D\leqslant \sqrt{\unicode[STIX]{x1D6F9}(x,y)}/(\log x)^{B}$
since
$y\geqslant (\log x)^{C}$
. Therefore, expanding the sum and using (3.1), we obtain

By [Reference de la Bretèche and Tenenbaum1, Théorème 2.1] we have

where
$\unicode[STIX]{x1D6FC}>3/4$
in our range for
$y$
. Therefore in total the
$\ell$
th term in

Summing over
$\ell >X$
, we obtain, taking
$\unicode[STIX]{x1D70E}=\unicode[STIX]{x1D6FC}-1/4$
,

Taking
$X=(1+\unicode[STIX]{x1D6FD})^{2}(\log x)^{2A+6}$
, the contribution of the
$\ell >X$
is therefore
$\ll \unicode[STIX]{x1D6F9}(x,y)/(\log x)^{A}$
. Combining all of the above then yields the result.◻
4 Putting the pieces together
In order to prove Theorem 1.2 we need Theorem 2.1, Proposition 3.2, Corollary 6.1 (which will be proved in the final two sections), and the following result which is [Reference Granville and Shao6, Proposition 5.1].
Proposition 4.1. Fix
$B\geqslant 0$
and
$0<\unicode[STIX]{x1D702}<\frac{1}{2}$
. Given
$(\log x)^{4B+5}\leqslant y=x^{1/u}\leqslant x^{1/2-\unicode[STIX]{x1D702}}$
let

Suppose that
$f\in {\mathcal{C}}$
, and is only supported on
$y$
-smooth numbers. There exists a set,
$\unicode[STIX]{x1D6EF}$
, of primitive characters
$\unicode[STIX]{x1D713}\;(\text{mod}~r)$
with
$r\leqslant R$
, such that if
$q\leqslant R$
and
$(a,q)=1$
then

Moreover, one may take
$\unicode[STIX]{x1D6EF}$
to be
$\unicode[STIX]{x1D6EF}=\unicode[STIX]{x1D6EF}(2B+2\frac{1}{3})$
, where
$\unicode[STIX]{x1D6EF}(C)$
is the set of primitive characters
$\unicode[STIX]{x1D713}\;(\text{mod}~r)$
with
$r\leqslant R$
such that there exists
$x^{\unicode[STIX]{x1D702}}<X\leqslant x$
for which

Proof of Theorem 1.2.
Let
$c>0$
be the small constant from Corollary 6.1. Set
$D=(x/\unicode[STIX]{x1D6F9}(x,y))^{2}(\log x)^{2A+20}$
, and one easily verifies that the hypothesis for
$y$
implies that

from the usual estimate
$\unicode[STIX]{x1D6F9}(x,y)=xu^{-u+o(u)}$
for smooth numbers. We will prove Theorem 1.2 with
$\unicode[STIX]{x1D6EF}=\unicode[STIX]{x1D6EF}(2A+8\frac{1}{3})$
, where
$\unicode[STIX]{x1D6EF}(C)$
is the set of primitive characters
$\unicode[STIX]{x1D713}\;(\text{mod}\;r)$
with
$r\leqslant D$
, such that there exists
$x^{1/4}<X\leqslant x$
for which (4.1) holds. By Proposition 4.1 with
$B=A+3$
and
$\unicode[STIX]{x1D702}=1/4$
, we have the bound

whenever
$q\leqslant D$
and
$(a,q)=1$
. Moreover, we have the same bound with
$x$
replaced by
$x/\ell$
for any
$\ell =x^{o(1)}$
.
The goal of the next two sections will be to prove Corollary 6.1, which implies that

This implies that
$\unicode[STIX]{x1D6FD}(\unicode[STIX]{x1D6EF})\ll (\log x)^{3A+19}$
, and so
$X\ll (\log x)^{8A+44}$
in Proposition 3.2. Thus for each
$\ell \leqslant X$
and
$d\leqslant D$
, we have

by (3.2). Therefore

We therefore deduce from Proposition 3.2 that

as desired.
To deduce the second part of Theorem 1.2, about functions
$f$
satisfying the Siegel–Walfisz criterion, we use the following variant of Proposition 3.4 in [Reference Granville and Shao6].
Proposition 4.2. Fix
$\unicode[STIX]{x1D700}>0$
. Let
$(\log x)^{1+\unicode[STIX]{x1D700}}\leqslant y\leqslant x$
be large. Let
$f\in {\mathcal{C}}$
be a multiplicative function supported on
$y$
-smooth integers. Suppose that
$\unicode[STIX]{x1D6EF}$
is a set of primitive characters, containing
$\ll (\log x)^{C}$
elements, such that

for
$(a_{q},q)=1$
for all
$q\sim Q$
, where
$Q\leqslant x$
. If the
$D$
-Siegel–Walfisz criterion holds for
$f$
, where
$D\geqslant B+C$
, then

Proof. By the definition of
$\unicode[STIX]{x1D6E5}_{\unicode[STIX]{x1D6EF}}$
we have

Summing this over
$q\sim Q$
and using the hypothesis, we deduce that

It suffices to show that, for each fixed
$\unicode[STIX]{x1D713}\;(\text{mod}\;r)\in \unicode[STIX]{x1D6EF}$
with
$r>1$
, we have

The conclusion then follows since
$|\unicode[STIX]{x1D6EF}|\ll (\log x)^{C}$
and
$D\geqslant B+C$
. If
$\unicode[STIX]{x1D712}\;(\text{mod}\;q)$
is induced by
$\unicode[STIX]{x1D713}\;(\text{mod}\;r)$
, then there is a multiplicative function
$h$
supported only on powers of primes which divide
$q$
but not
$r$
, such that
$h\ast f\overline{\unicode[STIX]{x1D713}}=f\overline{\unicode[STIX]{x1D712}}$
. Note that
$h\in {\mathcal{C}}$
since
$f\in {\mathcal{C}}$
, and in particular
$h$
is
$1$
-bounded. It follows that

Since
$f$
satisfies the
$D$
-Siegel–Walfisz criterion, we have

Using the bound
$\unicode[STIX]{x1D6F9}(x/m,y)\ll m^{-\unicode[STIX]{x1D6FC}}\unicode[STIX]{x1D6F9}(x,y)$
where
$\unicode[STIX]{x1D6FC}=\unicode[STIX]{x1D6FC}(x,y)\geqslant \unicode[STIX]{x1D700}+o(1)$
, we may bound the left-hand side of (4.3) by

To analyze the inner sum over
$m$
, we break it into two pieces depending on whether
$m\leqslant x^{1/2}$
or
$m>x^{1/2}$
:

To deal with the sum over
$x^{1/2}<m\leqslant x$
, note that the number of
$m\leqslant x$
with
$p|m\Rightarrow p|q$
is maximized when
$q$
is the product of primes up to
${\sim}\log x$
, in which case the number of such
$m$
is
$x^{o(1)}$
. Thus the sum over
$x^{1/2}<m\leqslant x$
is
$O(x^{-\unicode[STIX]{x1D6FC}/2+o(1)})$
, and their overall contribution to (4.4) is acceptable. To deal with the sum over
$m\leqslant x^{1/2}$
, note that

where
$\unicode[STIX]{x1D711}_{\unicode[STIX]{x1D6FC}}(m)=\prod _{p\mid m}(p^{\unicode[STIX]{x1D6FC}}-1)$
. Thus their overall contribution to (4.4) is

since the infinite sum over
$m$
converges. This establishes (4.3) and completes the proof of the lemma.◻
To deduce the second part of Theorem 1.2, we apply Proposition 4.2 with
$a_{q}\equiv a_{1}\overline{a_{2}}\;(\text{mod}~q)$
for each dyadic interval with
$Q\leqslant x^{3/5-\unicode[STIX]{x1D700}}$
. It is applicable since we assume that the Siegel–Walfisz criterion holds for
$f$
with exponent
$D\geqslant A+(6A+38)$
. Summing up over all dyadic intervals, the second part of the theorem follows (with
$A$
replaced by
$A-1$
).◻
Remark 4.3. The lower bound required for
$y$
in the hypothesis (1.2) comes precisely from (4.2). In fact, one can still deduce Theorem 1.2 even if we just had

instead. To see this, we follow the arguments above but now use Corollary 6.2 to get

We cannot directly apply Proposition 4.2 now, but if we let
${\mathcal{E}}$
be the set of
$\unicode[STIX]{x1D713}\;(\text{mod}\;r_{\unicode[STIX]{x1D713}})\;\in \unicode[STIX]{x1D6EF}$
with
$r_{\unicode[STIX]{x1D713}}>(\log x)^{14A+78}$
, then removing
${\mathcal{E}}$
from
$\unicode[STIX]{x1D6EF}$
induces an error of at most

As
$|S_{f}(x,\unicode[STIX]{x1D712})|\ll \unicode[STIX]{x1D6F9}_{q}(x,y)\ll (\unicode[STIX]{x1D711}(q)/q)\unicode[STIX]{x1D6F9}(x,y)$
the above is

since

Thus
$|\unicode[STIX]{x1D6EF}\setminus {\mathcal{E}}|\leqslant (\log x)^{28A+156}$
, and we may apply Proposition 4.2 with
$\unicode[STIX]{x1D6EF}$
replaced by
$\unicode[STIX]{x1D6EF}\setminus {\mathcal{E}}$
to conclude the proof.
Thus if one is able to prove Proposition 4.1 with
$R=x^{c}$
in the full range

then one can deduce Theorem 1.2 with (1.2) replaced by (4.5), for some
$C$
sufficiently large in terms of
$c$
.
5 A large sieve inequality supported on smooth numbers
In this section we prove a large sieve inequality for sequences supported on smooth numbers, a result which may be of independent interest.
Theorem 5.1 (Large sieve for smooth numbers).
There exists
$C,c>0$
such that the following statement holds. Let
$(\log x)^{C}\leqslant y\leqslant x$
be large, and

For any sequence
$\{a_{n}\}$
we have

The upper bound is sharp up to a constant, as may be seen by taking each
$a_{n}=1$
, so that the
$\unicode[STIX]{x1D712}=1$
term on the left-hand side equals
$\unicode[STIX]{x1D6F9}(x,y)^{2}$
, the size of the right-hand side. This result has the advantage over the traditional large sieve inequality that the sequence
$\{a_{n}\}$
is supported on a sparse set (when
$y=x^{o(1)}$
), but the disadvantage that this inequality holds in a much smaller range for
$q$
than the usual
$q\ll x^{1/2}$
. It may well be that Theorem 5.1 holds with
$Q=\unicode[STIX]{x1D6F9}(x,y)^{1/2}$
.
5.1 Zero-density estimates
To prove Theorem 5.1, we will use the following two consequences of deep zero-density results in the literature. The first is a bound for character sums over smooth numbers assuming a suitable zero-free region for the associated
$L$
-function (see [Reference Harper7, §3]).
Proposition 5.2. There is a small positive constant
$\unicode[STIX]{x1D6FF}>0$
and a large positive constant
$\unicode[STIX]{x1D705}>0$
such that the following statement holds. Let
$(\log x)^{1.1}\leqslant y\leqslant x$
be large. Let
$\unicode[STIX]{x1D712}\;(\text{mod}~q)$
be a non-principal character with
$q\leqslant x$
and conductor
$r:=\operatorname{cond}(\unicode[STIX]{x1D712})\leqslant x^{\unicode[STIX]{x1D6FF}}$
. If
$L(s,\unicode[STIX]{x1D712})$
has no zeros in the region

where the parameters
$\unicode[STIX]{x1D700},T$
satisfy

and moreover

Then

We also need the following log-free zero-density estimate by Huxley and Jutila, which can be found in [Reference Harper7, §2].
Proposition 5.3. Let
$\unicode[STIX]{x1D700}\in [0,1/2]$
,
$T\geqslant 1$
and
$Q\geqslant 1$
. Then the function
$G_{Q}(s)=\prod _{q\leqslant Q}\prod _{\unicode[STIX]{x1D712}\;(\text{mod}\;q)}^{\ast }L(s,\unicode[STIX]{x1D712})$
has
$\ll (Q^{2}T)^{(5/2)\unicode[STIX]{x1D700}}$
zeros
$s$
, counted with multiplicity, inside the region (5.1).
5.2 Proof of Theorem 5.1
It suffices to prove its dual form.
Proposition 5.4. There exist
$C,c>0$
such that the following statement holds. Let
$(\log x)^{C}\leqslant y\leqslant x$
be large, and

For any sequence
$\{b_{\unicode[STIX]{x1D712}}\}$
we have

Proof. The left-hand side can be bounded by

Thus we need to understand character sums over smooth numbers. The contribution from the diagonal terms with
$\unicode[STIX]{x1D712}_{1}=\unicode[STIX]{x1D712}_{2}$
is clearly acceptable, and thus we focus on non-diagonal terms. For
$\unicode[STIX]{x1D702}\in (0,1/2]$
, define
$\unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
to be the set of all non-principal characters
$\unicode[STIX]{x1D712}\;(\text{mod}~q)$
with
$q\leqslant Q^{2}$
, such that

Furthermore, define
$\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})$
to be the set of primitive characters which induce a character in
$\unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
. The contribution to (5.4) from those
$\unicode[STIX]{x1D712}_{1},\unicode[STIX]{x1D712}_{2}$
with
$\unicode[STIX]{x1D712}_{1}\overline{\unicode[STIX]{x1D712}_{2}}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
is

We claim that for a given
$\unicode[STIX]{x1D712}_{1}\;(\text{mod}\;q_{1})$
and a given primitive
$\unicode[STIX]{x1D713}\in \unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})$
, there is at most one primitive character
$\unicode[STIX]{x1D712}_{2}$
such that
$\unicode[STIX]{x1D712}_{1}\overline{\unicode[STIX]{x1D712}_{2}}$
is induced by
$\unicode[STIX]{x1D713}$
. To see this, suppose that there are two primitive characters
$\unicode[STIX]{x1D712}_{2}\;(\text{mod}\;q_{2})$
and
$\unicode[STIX]{x1D712}_{2}^{\prime }\;(\text{mod}\;q_{2}^{\prime })$
such that both
$\unicode[STIX]{x1D712}_{1}\overline{\unicode[STIX]{x1D712}_{2}}:=\unicode[STIX]{x1D712}$
and
$\unicode[STIX]{x1D712}_{1}\overline{\unicode[STIX]{x1D712}_{2}^{\prime }}:=\unicode[STIX]{x1D712}^{\prime }$
are induced by
$\unicode[STIX]{x1D713}$
. It suffices to show that
$\unicode[STIX]{x1D712}_{2}(n)=\unicode[STIX]{x1D712}_{2}^{\prime }(n)$
whenever
$(n,q_{2}q_{2}^{\prime })=1$
, since this would imply that
$\unicode[STIX]{x1D712}_{2}\unicode[STIX]{x1D712}_{2}^{\prime }$
is the principal character, and thus
$\unicode[STIX]{x1D712}_{2}=\unicode[STIX]{x1D712}_{2}^{\prime }$
as they are both primitive.
If
$(n,q_{2}q_{2}^{\prime })=1$
, then we may find an integer
$k$
such that
$(n+kq_{2}q_{2}^{\prime },q_{1}q_{2}q_{2}^{\prime })=1$
. Thus
$\unicode[STIX]{x1D712}_{1}(n+kq_{2}q_{2}^{\prime })\neq 0$
and
$\unicode[STIX]{x1D712}(n+kq_{2}q_{2}^{\prime })=\unicode[STIX]{x1D712}^{\prime }(n+kq_{2}q_{2}^{\prime })$
. It follows that

This completes the proof of the claim.
It follows that the contribution to (5.4) from those
$\unicode[STIX]{x1D712}_{1},\unicode[STIX]{x1D712}_{2}$
with
$\unicode[STIX]{x1D712}_{1}\overline{\unicode[STIX]{x1D712}_{2}}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
is

We will show that
$|\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})|\ll \unicode[STIX]{x1D702}^{-1/2}$
so that the result follows by summing over
$\unicode[STIX]{x1D702}$
dyadically. We may assume that
$\unicode[STIX]{x1D702}\geqslant Q^{-8}$
, as the bound follows for smaller
$\unicode[STIX]{x1D702}$
from the trivial bound
$|\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})|\leqslant Q^{4}$
.
We now use Proposition 5.2 to show that if
$\unicode[STIX]{x1D712}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
then
$L(s,\unicode[STIX]{x1D712})$
has a zero in the region (5.1) for suitable values of
$\unicode[STIX]{x1D700}$
and
$T$
. This would imply that
$L(s,\unicode[STIX]{x1D713})$
has zero in the region (5.1) for any
$\unicode[STIX]{x1D713}\in \unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})$
.
For the purpose of contradiction, let’s assume that
$\unicode[STIX]{x1D712}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
and
$L(s,\unicode[STIX]{x1D712})$
has no zero in (5.1) with
$T=Q^{500}$
. We wish to verify the hypotheses in (5.2) and (5.3). The upper bound on
$T$
in (5.2) follows from the definition of
$Q$
. Now
$r\leqslant q\leqslant Q^{2}$
and so the first alternative of (5.3) follows by selecting
$c$
so that
$502c\unicode[STIX]{x1D705}\leqslant 1$
. We define

Since
$\log \unicode[STIX]{x1D702}^{-1}\leqslant 8\log Q$
and
$\log \log x\leqslant \log Q$
, we have the upper bound

so that
$\unicode[STIX]{x1D700}=o(1)$
and
$y^{\unicode[STIX]{x1D700}}\ll Q^{108}$
. The first hypothesis in (5.2) follows immediately. Finally, by selecting
$C$
so that
$cC\geqslant 2$
we guarantee that
$Q\geqslant (\log x)^{2}$
, so that the lower bound on
$T$
in (5.2) follows easily.
Now
$\unicode[STIX]{x1D700}\geqslant 12(\log \unicode[STIX]{x1D702}^{-1}+\log \log x)/\!\log x$
so that
$x^{-0.3\unicode[STIX]{x1D700}}\leqslant (\unicode[STIX]{x1D702}/\!\log x)^{3.6}$
, and therefore

Now
$Q^{-10}\log x=o(Q^{-8})=o(\unicode[STIX]{x1D702})$
, as
$Q\geqslant (\log x)^{2}$
. Therefore Proposition 5.2 implies that

contradicting the definition of
$\unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
.
By Proposition 5.3 we now deduce (remembering that characters in
$\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})$
have conductors at most
$Q^{2}$
) that

which completes the proof. ◻
Remark 5.5. Assuming the Riemann hypothesis for Dirichlet
$L$
-functions, by [Reference Harper7, Proposition 1], we have the bound
$\sum _{n\leqslant x,P(n)\leqslant y}\unicode[STIX]{x1D712}(n)=O(x^{1-c})$
uniformly for
$\unicode[STIX]{x1D712}$
non-principal
$\text{mod}\;q$
,
$q\leqslant x^{c}$
,
$(\log x)^{C}\leqslant y\leqslant x^{c}$
, for some absolute constants
$C,c>0$
. This implies an upper bound

for (5.4), for all
$Q\leqslant x^{c}$
, and Theorem 5.1 would hold with
$Q=x^{c/3}$
and
$C$
large enough.
5.3 A variant of Theorem 5.1
We may extend the range to
$Q=x^{c}$
in Theorem 5.1 unconditionally if we insert some weights that reduce the effects of characters with large conductor.
Theorem 5.6. There exists
$C,c>0$
such that the following statement holds. Let
$(\log x)^{C}\leqslant y\leqslant x$
be large, and let
$Q=x^{c}$
. For any sequence
$\{a_{n}\}$
we have

Proof. The proof is similar as the proof of Theorem 5.1. We begin by passing to its dual form, so that we need to prove that

for any sequence
$\{b_{\unicode[STIX]{x1D712}}\}$
, where the summation is over all primitive characters
$\unicode[STIX]{x1D712}\;(\text{mod}\;q)$
with
$q\leqslant Q$
. Expanding the square, we can bound the left-hand side above by

For
$\unicode[STIX]{x1D702}\in (0,1/2]$
, define
$\unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
and
$\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})$
as in the proof of Theorem 5.1. If
$\unicode[STIX]{x1D712}_{1}\overline{\unicode[STIX]{x1D712}_{2}}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
so that it is induced by some
$\unicode[STIX]{x1D713}\;(\text{mod}\;r)\in \unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})$
, then

Thus the contribution to (5.5) from those
$\unicode[STIX]{x1D712}_{1},\unicode[STIX]{x1D712}_{2}$
with
$\unicode[STIX]{x1D712}_{1}\overline{\unicode[STIX]{x1D712}_{2}}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
is

Hence it suffices to show that

and then the conclusion follows after dyadically summing over
$\unicode[STIX]{x1D702}$
. For
$1\leqslant R\leqslant Q^{2}$
, let
$\unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702},R)$
and
$\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702},R)$
be the set of characters in
$\unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702})$
and
$\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702})$
with conductors
${\sim}R$
, respectively. Thus it suffices to show that

for each
$\unicode[STIX]{x1D702}\in (0,1/2]$
and
$1\leqslant R\leqslant Q^{2}$
. We may assume that
$\unicode[STIX]{x1D702}\geqslant R^{-4}$
, since otherwise the trivial bound
$|\unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702},R)|\ll R^{2}$
suffices. From now on fix such
$\unicode[STIX]{x1D702}$
and
$R$
.
We now use Proposition 5.2 to show that if
$\unicode[STIX]{x1D712}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702},R)$
then
$L(s,\unicode[STIX]{x1D712})$
has a zero in the region (5.1) for suitable values of
$\unicode[STIX]{x1D700}$
and
$T$
. This would imply that
$L(s,\unicode[STIX]{x1D713})$
has zero in the region (5.1) for any
$\unicode[STIX]{x1D713}\in \unicode[STIX]{x1D6EF}^{\ast }(\unicode[STIX]{x1D702},R)$
.
Set
$T=(\unicode[STIX]{x1D702}^{-1}\log x)^{60}$
. If
$R\leqslant (\log x)^{10}$
(say), then the first alternative of (5.3) holds because

provided that
$C\geqslant 2500\unicode[STIX]{x1D705}$
. In this case we will set
$\unicode[STIX]{x1D700}$
to be exactly the same as before:

Then (5.2) can be easily verified, and the contrapositive of Proposition 5.2 implies that
$L(s,\unicode[STIX]{x1D712})$
has a zero in the region (5.1) whenever
$\unicode[STIX]{x1D712}\in \unicode[STIX]{x1D6EF}(\unicode[STIX]{x1D702},R)$
. Hence by Proposition 5.3 we have

since
$\unicode[STIX]{x1D700}\ll 1/\!\log \log x$
in this case.
It remains to consider the case when
$(\log x)^{10}\leqslant R\leqslant Q^{2}$
. We set
$T$
as above, and we will now set

so that the second alternative in (5.3) is satisfied. One can still easily verify (5.2), and thus Propositions 5.2 and 5.3 combine to give

since
$\unicode[STIX]{x1D700}\leqslant 1/300$
(by choosing
$C$
large enough) and
$\log x\leqslant R^{1/10}$
. This completes the proof.◻
Examining the proof, one easily sees that the weight
$1/q^{1/2}$
can be replaced by
$1/q^{\unicode[STIX]{x1D70E}}$
for any constant
$\unicode[STIX]{x1D70E}>0$
, and the statement remains true provided that
$C$
is large enough in terms of
$\unicode[STIX]{x1D70E}$
. For our purposes, any exponent strictly smaller than
$1$
suffices.
6 Bounding the number of exceptional characters
Corollary 6.1. There exist
$C,c>0$
such that the following statement holds. Let
$(\log x)^{C}\leqslant y\leqslant x^{1/4}$
be large. Let
$\{a_{n}\}$
be an arbitrary
$1$
-bounded sequence. For
$B\geqslant 0$
, let
$\unicode[STIX]{x1D6EF}(B)$
be the set of primitive characters
$\unicode[STIX]{x1D712}\;(\text{mod}\;r)$
with
$r\leqslant Q$
where

such that there exists
$x^{1/4}<X\leqslant x$
for which

Then
$|\unicode[STIX]{x1D6EF}(B)|\ll (\log x)^{3B+13}$
.
Proof. Let
$T=(u\log u)^{4}(\log x)^{B}$
. We begin by partitioning the interval
$[x^{1/4},x]$
using a sequence
$x^{1/4}=X_{0}<X_{1}<\cdots <X_{J-1}<X_{J}=x$
with
$J\asymp T\log x$
, such that
$X_{j+1}-X_{j}\asymp \unicode[STIX]{x1D700}X_{j}/T$
, for some fixed small enough
$\unicode[STIX]{x1D700}>0$
, for each
$0\leqslant j<J$
.
For each
$\unicode[STIX]{x1D712}\in \unicode[STIX]{x1D6EF}(B)$
, there exists some
$0\leqslant j<J$
for which

Corollary 2 of [Reference Hildebrand8] implies a good upper bound for smooth numbers in short intervals: for any fixed
$\unicode[STIX]{x1D705}>0$
,

In our case
$u\log u\leqslant \log x$
so that
$T\leqslant (\log x)^{B+4}$
, so the hypothesis here is satisfied, and we therefore have

By choosing
$\unicode[STIX]{x1D700}$
sufficiently small, we deduce that

We deduce that there exists some
$0\leqslant j<J$
such that (6.2) holds for at least
$|\unicode[STIX]{x1D6EF}(B)|/J$
characters
$\unicode[STIX]{x1D712}\in \unicode[STIX]{x1D6EF}(B)$
. Therefore

On the other hand, Theorem 5.1 implies that

and therefore
$|\unicode[STIX]{x1D6EF}(B)|\ll T^{3}\log x=(u\log u)^{12}(\log x)^{3B+1}\ll (\log x)^{3B+13}$
, as claimed.◻
We also record the following variant which gives a weighted count of exceptional characters, but now with the wider range
$Q=x^{c}$
.
Corollary 6.2. There exist
$C,c>0$
such that the following statement holds. Let
$(\log x)^{C}\leqslant y\leqslant x^{1/4}$
be large. Let
$\{a_{n}\}$
be an arbitrary
$1$
-bounded sequence. For
$B\geqslant 0$
, let
$\unicode[STIX]{x1D6EF}(B)$
be the set of primitive characters
$\unicode[STIX]{x1D712}\;(\text{mod}\;r)$
with
$r\leqslant Q:=x^{c}$
, such that there exists
$x^{1/4}<X\leqslant x$
for which

Then

Proof. The proof is the same as above, except that one considers the weighted sum

and use Theorem 5.6 instead of Theorem 5.1 in the last step. ◻
Acknowledgements
A.G. has received funding from the European Research Council grant agreement no. 670239, and from NSERC Canada under the CRC program. X.S. was supported by a Glasstone Research Fellowship. We would like to thank Adam Harper for a valuable discussion.