Newton interpolation and the nn-th harmonic number

Introduction

Baker (1984), in his well-known introduction to number theory, asks in exercise (iii) of chapter 1 to prove that 1+12+13++1n1 + \frac{1}{2} + \frac{1}{3} + \ldots + \frac{1}{n} is never an integer, for no nN>1n \in \mathbb{N}_{> 1}. At first glance, this is not entirely clear, in particular as the harmonic series diverges rendering a lot of possible values to be attained.
As it turns out, the proof to this claim is rather challenging considering the material Baker covered in the first chapter. There is an interesting approach, however, that is based on numerical analysis in the form of polynomial interpolation. It utilizes chapter 4, exercises 9 and 10 of Koenigsberger’s ‘Analysis I’ and builds on Newton interpolation.

Newton interpolation

Suppose we have n+1n + 1 distinct input data x0,x1,x2,,xnx_0, x_1, x_2, \ldots, x_n, e.g. temperature values, and n+1n + 1 observed output values y0,,yny_0, \ldots, y_n. We claim a causal or non-causal relationship between the input and output data and further scrutinize the ordered pairs (x0,y0),(x1,y1),,,(xn,yn)(x_0, y_0), (x_1, y_1), \ldots, …, (x_n, y_n). Loosely speaking, the general idea in polynomial interpolation now is to consider some polynomial P(x)P(x) of some degree and change its coefficients in a way that fits the data at hand, i.e. P(x0)y0,P(x1)y1,,P(xn)ynP(x_0) \approx y_0, P(x_1) \approx y_1, \ldots, P(x_n) \approx y_n. One advantage now is that we are able to infer non-observed, but plausible input-output-pairs based on the data observed, for example, P(x0+x12)P(\frac{x_0 + x_1}{2}).

In Newton interpolation, we consider the following as the base polynomial whose coefficients are fitted to the data,

P(x)=c0+c1(xx0)+c2(xx0)(xx1)++cn(xx0)(xx1)(xxn1)=c0+k=0n1ck+1(xx0)(xxk)

The Newton polynomial incorporates the observed data in its design. Rather than to approximate y0,,yny_0, \ldots, y_n, it demands to change the coefficients c0,,cnc_0, \ldots, c_n in such a way that P(xi)=yiP(x_i) = y_i for all i{0,,n}i \in \{0, \ldots, n\}. In other words, the resulting polynomial fits the observed data perfectly. The following proposition proves that for an arbitrary collection of input-output-pairs (x0,y0),,(xn,yn)(x_0, y_0), \ldots, (x_n, y_n) there are always unique coefficients c0,,cnc_0, \ldots, c_{n} that fulfill that demand.

Proposition. There are unique coefficients c0,,cnc_0, \ldots, c_{n} such that the above polynomial P(x)P(x) has the property that P(xi)=yiP(x_i) = y_i for all i{0,,n}i \in \{0, \ldots, n\}.

Proof.

One conceptual idea to prove the statement is to rewrite it in terms of matrix algebra,

(P(x0)P(xn))=(c0c0+c1(x1x0)c0++cn(xnx0)(xnxn1))=Pc=(y0yn)

where

P=(1001(x1x0)01(x1x0)(xnx0)(xnx1)(xnxn1))

and

c=(c0cn)

Since the x0,,xnx_0, \ldots, x_n are distinct by assumption, the diagonal of PP doesn’t vanish anywhere. Since the determinant of a triangular matrix is the product of its diagonal elements and the matrix is invertible iff the determinant doesn’t vanish, it follows that P\mathbf{P} is invertible and hence there is unique c\mathbf{c} that solves the system of equations. \blacksquare

From Newton polynomials to integer-valued polynomials

Consider the polynomial P(x)P(x) from the section before and set xi=ix_i = i for i{0,,n}i \in \{0, \ldots, n\} as the input data. Then, we have that

P(x)=c0+k=0n1ck+1(xx0)(xxk)=c0+k=0n1ck+1x(x1)(xxk)=c0+k=1nckx(x1)(xk+1)=c0+k=1nckx!(xk)!k!k! =c0+k=1nckk!(xk)=c0+k=1nbk(xk) =k=0nbk(xk)

where bk=ckk!b_k = c_kk!. The representation that will be important in what follows is

P(x)=k=0nbk(xk)(1)

It is posed as an exercise in Konigsberger (1999, chapter 4, ex. 10) to show that some polynomial SS of degree nn is integer-valued if and only if it has the representation in (1) with b0,b1,,bnZb_0, b_1, \ldots, b_n \in \mathbb{Z}. Clearly, (xk)\binom{x}{k} is for xZx \in \mathbb{Z} and kNk \in \mathbb{N} always an integer.

Proof. One direction is obvious as the sum of and products of integers are again integers. Now, suppose we have an polynomial SS from which we know that is integer-valued and has the given form. Then, also P(0)P(0) is an integer. But clearly, P(0)=b0P(0) = b_0. Therefore, b0b_0 has to be an integer. The other coefficients, bkb_k for k=1,,nk = 1, \ldots, n are then recursively determined since b1=P(m)b0b_1 = P(m) - b_0 and so on, i.e. bm=P(m)k=0m1bk(zm)b_m = P(m) - \sum\limits_{k = 0}^{m - 1} b_k\binom{z}{m}. Since b0b_0 is an integer and P(m)P(m) is an integer by assumption, we have the converse statement that bkZb_k \in \mathbb{Z} for all k{1,,n}k \in \{1, \ldots, n\}.

The nn-th harmonic number

After introducing Newton interpolation, let us go back to the original problem of showing that

Hn=1+12+13++1n

is never an integer. In mathematics, HnH_n is called the nn-th harmonic number and credited to Euler who analyzed it extensively.

It is very easy to show that for xRx \in \mathbb{R} and nNn \in \mathbb{N} it holds that

1+x+x2++xn=1xn+11x

By simple integration rules we have that 1=011dx12=01xdx 13=01x2dx 

and therefore

Hn=1+12++1n=011+x+x2++xn1dx=011xn1xdx

Now set the substitution x=1ux = 1 - u. Then, we have that dxdu=1\frac{\text{d}x}{\text{d}u} = -1. Also, the upper limit transforms to u(1)=11=0u(1) = 1 - 1 = 0 and the lower limit to u(0)=10=1u(0) = 1 - 0 = 1. Therefore, we get

Hn=011xn1xdx=101(1u)nudu=011(1u)nudu

The integrand can be expanded into a binomial since (1u)n=k=0n(nk)(1)kuk(1 - u)^{n} = \sum\limits_{k = 0}^{n} \binom{n}{k}(-1)^{k}u^{k}. Therefore, we arrive at

Hn=01(k=1n(nk)(1)k1uk1)du=k=1n(1)k11k(nk)(2)

Solution to the initial problem

Consider now HnH_n in (2) as a polynomial of nn, i.e. H(n)H(n) and write bk=(1)k11kb_k = (-1)^{k - 1}\frac{1}{k} as in (1). Then we know from section 2 and 3 that this polynomial can only be integer-valued iff all the bkb_k for k{1,,n}k \in \{1, \ldots, n\} are themselves integers. But clearly, they are not, for no integer nn greater 1. Hence, HnH_n is never an integer, for no nN>1n \in \mathbb{N}_{> 1}. \blacksquare

References

[1] Alan Baker (1984): “A Concise Introduction to Number Theory”. Cambridge University Press.

[2] Konigsberger (1999): “Analysis I”. Springer Verlag.