One of the earliest and most influential applications of the probabilistic method in combinatorics was given by Paul Erdős in his 1947 paper Some Remarks on the Theory of Graphs. The argument is famous not because it is long or technically complicated, but because it introduced a powerful new way of proving existence. Instead of constructing a mathematical object explicitly, Erdős showed that if we choose an object randomly, then the probability that it has the desired property is positive. Therefore, such an object must exist.
The setting is Ramsey theory. The diagonal Ramsey number is the smallest integer
such that every red-blue coloring of the edges of the complete graph
contains a monochromatic copy of
. A monochromatic
is a set of
vertices such that every edge between them has the same color, either all red or all blue. Thus, saying
means that there exists at least one red-blue edge-coloring of
with no monochromatic
.
Erdős’s theorem gives a clean condition for such a coloring to exist.
Theorem: If , then
.
The proof begins by coloring every edge of independently and uniformly at random, choosing red with probability
and blue with probability
. Now fix a particular set
of
vertices. The complete graph on these vertices contains
edges. Let
be the event that this set
forms a monochromatic
. This can happen in exactly two ways: either every edge inside
is red, or every edge inside
is blue.
The probability that all edges are red is
. The probability that all of them are blue is the same:
. Since these two possibilities are disjoint, we get
.
There are possible choices for the set
. We now use the union bound. The probability that there exists at least one monochromatic
is at most the sum of the probabilities of all the bad events
. Therefore,
.
Since each has probability
, this becomes
.
Thus, if , then the probability that a random coloring contains a monochromatic
is less than
. Hence the probability that it contains no monochromatic
is positive. This means at least one good coloring exists, and therefore
.
This is the probabilistic method in its purest form. We do not explicitly describe the good coloring. We prove that it must exist because a random coloring has a positive chance of being good.
To understand the numerical strength of the result, we now estimate the largest possible satisfying the inequality. We use the standard estimate
. By Stirling’s formula,
. Ignoring lower-order factors, this gives
, so
. The condition
is therefore roughly
.
Since , we get
. Taking
-th roots gives
, and hence
. Now
, so asymptotically this gives
.
This was already a remarkable result. It showed that Ramsey numbers grow exponentially and that random colorings provide strong lower bounds.
Erdős’s original paper can also be interpreted in terms of counting. There are total red-blue colorings of
. Fix a set
of
vertices. The number of colorings in which
is monochromatic is
. The factor
comes from choosing whether the clique is all red or all blue. The remaining edges outside that clique can be colored freely. Since there are
possible sets
, the number of bad colorings is at most
. This equals
. If
, then the number of bad colorings is strictly smaller than the total number of colorings. Hence at least one coloring is not bad. That coloring avoids monochromatic
.
Alteration Method
A later refinement is the alteration method. Instead of requiring the random coloring to be perfect immediately, we allow it to have some bad structures and then delete vertices to destroy them. Randomly color as before, and let
be the number of monochromatic copies of
. For each
-vertex set
, define an indicator variable
Then we have
.
By linearity of expectation,
.
Since is an indicator random variable,
.
Therefore, .
Now delete one vertex from every monochromatic . If there are
monochromatic cliques, we delete at most
vertices. The remaining graph has no monochromatic
, and it has at least
vertices. Taking expectations gives
.
Thus there exists some coloring and deletion process leaving at least vertices with no monochromatic
. Hence
.
Optimizing this bound gives a better constant. Approximate
.
Let . Then the number of remaining vertices is approximately
.
To maximize this, differentiate: . Setting
gives
, so
. Since
, we obtain
. Using Stirling’s formula again, this gives approximately
. At the maximizing value, the equation
implies
. Therefore,
. Since
, the alteration method gives
.
This improves the original union-bound constant by a factor of .
Lovász Local Lemma
An even stronger improvement comes from the Lovász Local Lemma. The union bound works when the total probability of all bad events is less than , but this can be wasteful because it ignores the fact that many bad events are independent. In the Ramsey setting, two bad events only depend on each other if the corresponding
-vertex sets share an edge. If two vertex sets share fewer than two vertices, then their edge sets are disjoint, and the corresponding events are independent.
The symmetric Lovász Local Lemma says that if each bad event has probability at most , and each bad event depends on at most
other bad events, then it is enough to have
to guarantee that with positive probability none of the bad events occur.
In the Ramsey application, the bad event is again the event that a fixed
-vertex set
spans a monochromatic clique. We already computed
.
Now we estimate . Fix
. Another
-vertex set
can depend on
only if
.
We can overcount the number of such by first choosing two vertices from
and then choosing the remaining
vertices from the full set of
vertices. This gives
.
Therefore, the local lemma applies if . Equivalently,
.
When this holds, there exists a red-blue edge-coloring of with no monochromatic
. Hence
.
We now optimize this condition. Ignoring the harmless , the main inequality is approximately
. Use
and
. Then the inequality becomes
.
Because , this is approximately
.
Thus we need .
Using Stirling’s formula, . Ignoring subexponential factors, this behaves like
. Therefore,
. Taking
-nd roots gives
.
Now simplify the exponent: .
Since , we get
. As
,
. Hence
.
Therefore the Lovász Local Lemma gives
.
This is the best known lower bound of this general form. It improves the alteration bound by another factor of , and it improves the original union-bound argument by a factor of
.
Putting the three bounds side by side, we have the union-bound lower bound
,
the alteration-method lower bound
,
and the Lovász Local Lemma lower bound
.
These results show the power of probabilistic reasoning. The first proof shows that a random coloring is likely enough to avoid all bad events when the union bound is strong. The alteration method shows that even if the random object is imperfect, we may repair it by deleting bad parts. The Lovász Local Lemma shows that when bad events have limited dependence, we can do better than the union bound.
The remarkable point is that none of these arguments explicitly constructs the Ramsey coloring. They prove existence indirectly. This is both the beauty and the limitation of the probabilistic method. It tells us that good colorings exist, but it does not necessarily tell us how to find them efficiently.
Constructive Ramsey lower bounds remain difficult. Although random colorings show that many good colorings exist, certifying that a particular coloring avoids monochromatic requires checking many vertex subsets. There are
such subsets, which is enormous for large
. Good objects may be abundant, but recognizing one efficiently is still hard.
Upper Bounds:
The upper-bound side comes from the classical Erdős-Szekeres recurrence
.
Here is the idea. Choose a vertex . The remaining vertices split into two sets: those joined to
by red edges and those joined to
by blue edges. If the red-neighbor set has size at least
, then it contains either a red
, which together with
gives a red
, or a blue
. The blue-neighbor case is symmetric.
Using the boundary values , this recurrence gives
.
In particular, for diagonal Ramsey numbers this implies an upper bound of roughly
.
The known lower bounds are still exponentially smaller, of order roughly .
Thus, despite major progress, there remains a large exponential gap between the best known lower and upper bounds for diagonal Ramsey numbers. Erdős’s probabilistic method opened the door to this field by showing that randomness can prove the existence of highly structured objects that are extremely difficult to construct explicitly.