Mathematics behind Babylonian Square Root method [

This is a typical application of Newton's method for calculating the square root of n. You're calculating the limit of the sequence:

x_0 = n
x_{i+1} = (x_i + n / x_i) / 2

Your variable x is the current term x_i and your variable y is n / x_i.

To understand why you have to calculate this limit, you need to think of the function:

f(x) = x^2 - n

You want to find the root of this function. Its derivative is

f'(x) = 2 * x

and Newton's method gives you the formula:

x_{i+1} = x_i - f(x_i) / f'(x_1) = ... = (x_i + n / x_i) / 2

For completeness, I'm copying here the rationale from @rodrigo's answer, combined with my comment to it. This is helpful if you want to forget about Newton's method and try to understand this algorithm alone.

The trick is that if x is not the square root of n, then it is an approximation which lies either above or below the real root, and y = n/x is always on the other side. So if you calculate the midpoint of (x+y)/2, it will be nearer to the real root than the worst of these two approximations (x or y). When x and y are close enough, you're done.

This will also help you find the complexity of the algorithm. Say that d is the distance of the worst of the two approximations to the real root r. Then the distance between the midpoint (x+y)/2 and r is at most d/2 (it will help you if you draw a line to visualize this). This means that, with each iteration, the distance is halved. Therefore, the worst-case complexity is logarithmic w.r.t. to the distance of the initial approximation and the precision that is sought. For the given program, it is

log(|n-sqrt(n)|/epsilon)

回答3:

I think all information can be found in wikipedia.

The basic idea is that if x is an overestimate to the square root of a non-negative real number S then S/x, will be an underestimate and so the average of these two numbers may reasonably be expected to provide a better approximation.

With each iteration this algorithm doubles correct digits in answer, so complexity is linear to desired accuracy's logarithm.

Why does it work? As stated here, if you will do infinite iterations you'll get some value, let's name it L. L has to satisfy equasion L = (L + N/L)/2 (as in algorithm), so L = sqrt(N). If you're worried about convergence, you may calculate squared relative errors for each iteration (Ek is error, Ak is computed value):

Ek = (Ak/sqrt(N) - 1)²
if:
Ak = (Ak-1 + N/Ak-1)/2 and Ak = sqrt(N)(sqrt(Ek) + 1)
you may derive recurrence relation for Ek:
Ek = Ek-1²/[4(sqrt(Ek-1) + 1)²]
and limit of it is 0, so limit of A1,A2... sequence is sqrt(N).

回答4:

The mathematical explanation is that, over a small range, the arithmetic mean is a reasonable approximation to the geometric mean, which is used to calculate the square root. As the iterations get closer to the true square root, the difference between the arithmetic mean and the geometric mean vanishes, and the approximation gets very close. Here is my favorite version of Heron's algorithm, which first normalizes the input n over the range 1 ≤ n < 4, then unrolls the loop for a fixed number of iterations that is guaranteed to converge.

def root(n):
    if n < 1: return root(n*4) / 2
    if 4 <= n: return root(n/4) * 2
    x = (n+1) / 2
    x = (x + n/x) / 2
    x = (x + n/x) / 2
    x = (x + n/x) / 2
    x = (x + n/x) / 2
    x = (x + n/x) / 2
    return x

I discuss several programs to calculate the square root at my blog.