From a set , assume that we want to pick elements that are not divisible by any primes in a set . Let be the set of element of divisible by
If we write the inclusion-exclusion to sift out the element divisible by some prime in , we get
Even if we know very accurately, the above expression is not useful if has a lot of primes because the sum has too many terms, the error terms with add up.
So we need approximations to that are supported on small so that we have lesser number of terms in the sifting sum.
Say we want an upper bound, one way to achieve is to make sure that you always count elements not divisible by and add flexibility to include more elements.
So we want such
Selberg choice is the the following square form to ensure positivity.
with .
Assume
where is multiplicative function telling us the probability of a random element being divisible by , is the error and is the total size of .
We now ignore the error terms and try to find that minimize the main term. This is a quadratic form with coefficients
where we denoted , with the gcd
Let us call define by
We have
So we have
With
the form is diagaonalized to
Now we rewrite in terms of .
The constraint gives
Assume the support of is
Cauchy -Schwarz implies
Therefore
where
The optimal vector satisifies
And we get the main term . So we have
where
We can see that
Thus
Note the is the main term for the sifted set if the events are independent.
To get the good upper bounds we need to show that is large.
To get bounds on , we need bounds on . I f we prove , we get
So we need to look at the how the optimal look like.
Hence we have
The shape of the sieve weight .
We want to evaluate . Without the the restriction we get
if we have something like in some average sense where
Thus behaves like a smooth truncation of . Instead of the optimal vectors, we could start with vectors of the shape
where is is a smooth function on and similar results.
The smoothness is important because the sharp truncation gives a lot of weight to numbers that are not free of small prime factors. In fact, we have
whereas the smooth weights above give
and hence the main contribution comes from numbers with few number of small prime factors.
But we also have
so the contribution of numbers with small prime factors is not too small, because by the above estimate, if we weight it by , that the contribution amplifies and dominates the sum.
The sign of the resulting sieve weight:
Looking at the formula for we saw that has the same sign as , but this doesn’t mean will also have the same sign.
But
We can also prove that
GPY Weights:
Instead of choosing to be constant, if we choose
the resulting sieve weights looks like
that is the like the weights for dimension . These are helpful to get small gaps between primes.
Dependence on :
If the primes sifted is , then for a fixed (the dimension, average classes mod that need to be sifted from ), as becomes small relative to (the support of the sieve weight), the main term converges to the . But the rate of convergence is small compared to other combinatorial sieves like beta sieve.
The Selberg sieve can be used to create lower bound sieve which for large , gives better lower bound in a wider range of , compared to other combinatorial sieves.
Also for large , the main terms wont change much if we increase from to a bigger values. That is primes large than have no effect on Selberg’s upper bound.
Application to Brun-Titichmarch inequality:
If we take a short interval and we want to count primes in the arithmetic progression , we have
with , we have
So taking , we get
It’s possible to remove the error by other methods.