0. Introduction
Recently, I worked out [1, Exercise 2.4] which deals with the famous ballot problem and simple symmetric random walks.
The problem reads essentially as follows.
Suppose there is a local election with only two candidates A and B. We count the number of votes for both candidates. A and B are equally popular, i.e. the probability to vote for either one is equal, pA=21 and pB=21.
There are 2N votes in total.
(i.) What is the probability that the first tie of votes occurs first after counting 2n votes with n≤N?
(ii.) What is the probability that there is no tie after counting 2n votes?
The combinatorial part of the proof contains a very elegant reflection argument that I haven’t seen before.
In the following, I present a detailed solution including the reflection argument. The main source I used beside [1] is the excellt exposition [2] by Sven Erick Alm.
1. Notation and auxiliary results
First, we need to determine the underlying probability space. There are only two possible outcomes, namely a vote for candidate A or candidate B, which we can decode as {1,−1}. For 2n rounds of voting we therefore have Ω={−1,1}2n.
Ω is obviously finite for finite n, therefore A=P(Ω) as underlying σ-algebra. Since each vote happens equally likely for each candidate we set P=UΩ.
All in all, we arrive at the probability space
(Ω,A,P)=({1,−1}2n,P({1,−1}2n),U{1,−1})
Now consider the graph (Ai)i∈{1,…,n} where
Ai=(i,j=0∑iωi)
Formalise (i). as event Gn and (ii). as G>n. Then, the event Gn⊂(Ai)1≤i≤2n consists of the number of paths that start in (0,0) and after 2n steps arrive in (2n,0) while inbetween it holds that y>0 for {(i,y)}1≤i≤2n. In other words, the graph does not cross the x axis.
To formalise the problem we need to introduce more notation. Let p=pA=21 be the probability for a step upwards and q=pB=1−p=21 the one for one step downwards.
A path that starts in (0,a) and after n steps arrives in (n,b), can be characterized as having h steps up and v steps down. These necessarily fulfill the relationship:
After solving for v and h, we obtain h=2n+(b−a) and v=2n−(b−a). The number of possibilities to perform h steps up in n steps total, is (hn)=(n−hn)=(vn).
Due to the uniformness assumption, the probability to go from a to b in n steps is equal to Pn(a,b)=(hn)phqv.
Note that (hn) does not yet take into account whether we cross the x-axis.
2. Solving the problem
Let Nn(a,b) be the number of paths from a to b in n steps, Nn=0(a,b) the number of paths from a to b that do not cross the x-axis and N0(a,b) the number of paths from a to b that cross the x-axis.
It clearly holds that Nn(a,b)=(hn) as you can choose h steps up among n steps in total.
But it also holds that
and Nn0(a,b)=Nn(a,b) if sgn(a)=sgn(b).
Therefore, it is obvious that
but on the other hand,
since both candidates may be in the lead the whole time, i.e. the graph is either above or below the x-axis and, hence, wanders from 1 (candidate A) to 0 or from −1 (candidate B) to 0 in 2n−1 steps.
Since any path above the x-axis always represents the mirror image of a suitable graph below the x-axis, the number of paths double.
Eventually, there are exactly as many paths that wander in 2n−1 steps from 1 to 0 as there are that wander in 2n−2 steps from 1 to 1.
The reason is that 2n−1th steps is predetermined as being a step down from 1 to 0.
Now, we need to use the mirror principle, the main argument of the proof that really appealed to me: A path that crosses the x-axis between the coordinates (1,1) and (2n−2,1), can be represented as a path that starts in (1,−1) and finishes in (2n−2,1).
We do this by mirroring the graph at the x-axis until its first root appears.
Hence, the number of paths that do not cross the x-axis in 2n−2 steps is equal to the difference between all paths between (1,1) and (1,2n−2) and those that start in (1,1), finish in (2n−2,1) and cross the x-axis.
With the formula for h above, we obtain h1=22n−2+(1−1)=n−1 and h2=22n−2−(1−(−1))=n
In total, we get
and
To solve (ii)., [1] suggests to consider the complementary event G>nc. Then, it holds due to the disjointness of Gn for different n (which is clear intuitively) that
and hence,
In the last step, we used the telescope sum together with u0=1.
References
[1] H.-O. Georgii (2014): Stochastik. Eine Einführung.
[2] S. E. Alm (2006): Simple random walks.