Ackermann function

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

In computability theory, the Ackermann function, named after Wilhelm Ackermann, is one of the simplest [1] and earliest-discovered examples of a total computable function that is not primitive recursive. All primitive recursive functions are total and computable, but the Ackermann function illustrates that not all total computable functions are primitive recursive. After Ackermann's publication [2] of his function (which had three nonnegative integer arguments), many authors modified it to suit various purposes, so that today "the Ackermann function" may refer to any of numerous variants of the original function. One common version, the two-argument Péter-Ackermann function is defined as follows for nonnegative integers m and n:

Its value grows rapidly, even for small inputs. For example, A(4, 2) is an integer of 19,729 decimal digits [3] (equivalent to 265536−3, or 22222−3).

History[edit]

In the late 1920s, the mathematicians Gabriel Sudan and Wilhelm Ackermann, students of David Hilbert, were studying the foundations of computation. Both Sudan and Ackermann are credited[4] with discovering total computable functions (termed simply "recursive" in some references) that are not primitive recursive. Sudan published the lesser-known Sudan function, then shortly afterwards and independently, in 1928, Ackermann published his function (the Greek letter phi). Ackermann's three-argument function, , is defined such that for , it reproduces the basic operations of addition, multiplication, and exponentiation as

and for p > 2 it extends these basic operations in a way that can be compared to the hyperoperations:

(Aside from its historic role as a total-computable-but-not-primitive-recursive function, Ackermann's original function is seen to extend the basic arithmetic operations beyond exponentiation, although not as seamlessly as do variants of Ackermann's function that are specifically designed for that purpose—such as Goodstein's hyperoperation sequence.)

In On the Infinite,[5] David Hilbert hypothesized that the Ackermann function was not primitive recursive, but it was Ackermann, Hilbert's personal secretary and former student, who actually proved the hypothesis in his paper On Hilbert's Construction of the Real Numbers.[2][6]

Rózsa Péter[7] and Raphael Robinson[8] later developed a two-variable version of the Ackermann function that became preferred by many authors.

The generalized hyperoperation sequence, e.g. , is a version of Ackermann function as well.[9]

In 1963 R.C. Buck based an intuitive two-variable variant ([10]) on the hyperoperation sequence:[11]

Compared to most other versions Buck's function has no unessential offsets:

Many other versions of Ackermann function have been investigated.[12]

Definition[edit]

As m-ary function[edit]

Ackermann's original three-argument function is defined recursively as follows for nonnegative integers and :

Of the various two-argument versions, the one developed by Péter and Robinson (called "the" Ackermann function by some authors) is defined for nonnegative integers and as follows:

The Péter-Ackermann function can also be expressed in relation to various other versions of the Ackermann function:

Hence
( and would correspond with and , which could logically be added.)

As iterated 1-ary function[edit]

Define f n as the n-th iterate of f

where id is the identity function and fg denotes function composition.

Conceiving the Ackermann function as a sequence of unary functions, one can set .

The function then becomes a sequence of unary [13] functions, defined from iteration:

As function composition (fg)(x) = f (g(x)) is associative, the last line can as well be

Computation[edit]

The recursive definition of the Ackermann function can naturally be transposed to a term rewriting system (TRS).

TRS, based on 2-ary function[edit]

The definition of the 2-ary (Péter-)Ackermann function corresponds with the reduction rules

Example

Compute

To compute one can use a stack, which initially contains the elements .

Then repeatedly the two top elements are replaced according to the rules [nb 1]

Schematically, starting from :

WHILE stackLength <> 1
{
   POP 2 elements;
   PUSH 1 to 3 elements in conformance with the rules r1, r2, r3
}

The pseudocode is published in Grossman & Zeitman (1988).

For example, on input the successive stack configurations are

This reflects the reduction sequence [nb 2]

Remarks

  • The leftmost-innermost strategy is implemented in 225 computer languages on Rosetta Code.
  • Grossman & Zeitman (1988) pointed out that in the computation of the maximum length of the stack is , as long as .

TRS, based on iterated 1-ary function[edit]

The definition of the iterated 1-ary Ackermann functions leads to different reduction rules

As function composition is associative, instead of rule r6 one can define

Like in the previous section the computation of can be implemented with a stack which maintains the current term.

Initially the stack contains the three elements .

Then repeatedly the three top elements are replaced according to the rules [nb 1]

Schematically, starting from :

WHILE stackLength <> 1
{
   POP 3 elements;
   PUSH 1 or 3 or 5 elements in conformance with the rules r4, r5, r6;
}

Example

On input the successive stack configurations are

The corresponding equalities are


When reduction rule r7 is used instead of r6, the replacements in the stack will follow

The successive stack configurations will then be

The corresponding equalities are

Remarks

  • On any given input all TRSs presented above converge in the same number of steps. They also use the same reduction rules (in this comparison the rules r1, r2, r3 are considered "the same as" the rules r4, r5, r6/7 respectively). For example, the reduction of converges in 14 steps: 6 X r1, 3 X r2, 5 X r3. The reduction of also converges in 14 steps: 6 X r4, 3 X r5, 5 X r6/7. The TRSs differ in the order in which the reduction rules are applied.
  • When is computed following the rules r4, r5, r6, the maximum length of the stack stays below 2 * . When reduction rule r7 is used instead of rule r6, the maximum length of the stack drops below 2(i+2). The length of the stack reflects the recursion depth. As the computation according to the rules r4, r5, r7 involves the smallest maximum depth of recursion seen so far, it is efficient in that respect.

Huge numbers[edit]

To demonstrate how the computation of results in many steps and in a large number: [nb 2]

Table of values[edit]

Computing the Ackermann function can be restated in terms of an infinite table. First, place the natural numbers along the top row. To determine a number in the table, take the number immediately to the left. Then use that number to look up the required number in the column given by that number and one row up. If there is no number to its left, simply look at the column headed "1" in the previous row. Here is a small upper-left portion of the table:

Values of A(mn)
n
m
0 1 2 3 4 n
0 1 2 3 4 5
1 2 3 4 5 6
2 3 5 7 9 11
3 5 13 29 61 125
4 13


65533


265536 − 3










5 65533

6
m

The numbers here which are only expressed with recursive exponentiation or Knuth arrows are very large and would take up too much space to notate in plain decimal digits.

Despite the large values occurring in this early section of the table, some even larger numbers have been defined, such as Graham's number, which cannot be written with any small number of Knuth arrows. This number is constructed with a technique similar to applying the Ackermann function to itself recursively.

This is a repeat of the above table, but with the values replaced by the relevant expression from the function definition to show the pattern clearly:

Values of A(mn)
n
m
0 1 2 3 4 n
0 0+1 1+1 2+1 3+1 4+1 n + 1
1 A(0, 1) A(0, A(1, 0))
= A(0, 2)
A(0, A(1, 1))
= A(0, 3)
A(0, A(1, 2))
= A(0, 4)
A(0, A(1, 3))
= A(0, 5)
A(0, A(1, n−1))
2 A(1, 1) A(1, A(2, 0))
= A(1, 3)
A(1, A(2, 1))
= A(1, 5)
A(1, A(2, 2))
= A(1, 7)
A(1, A(2, 3))
= A(1, 9)
A(1, A(2, n−1))
3 A(2, 1) A(2, A(3, 0))
= A(2, 5)
A(2, A(3, 1))
= A(2, 13)
A(2, A(3, 2))
= A(2, 29)
A(2, A(3, 3))
= A(2, 61)
A(2, A(3, n−1))
4 A(3, 1) A(3, A(4, 0))
= A(3, 13)
A(3, A(4, 1))
= A(3, 65533)
A(3, A(4, 2)) A(3, A(4, 3)) A(3, A(4, n−1))
5 A(4, 1) A(4, A(5, 0)) A(4, A(5, 1)) A(4, A(5, 2)) A(4, A(5, 3)) A(4, A(5, n−1))
6 A(5, 1) A(5, A(6, 0)) A(5, A(6, 1)) A(5, A(6, 2)) A(5, A(6, 3)) A(5, A(6, n−1))

Properties[edit]

General remarks[edit]

  • It may not be immediately obvious that the evaluation of always terminates. However, the recursion is bounded because in each recursive application either decreases, or remains the same and decreases. Each time that reaches zero, decreases, so eventually reaches zero as well. (Expressed more technically, in each case the pair decreases in the lexicographic order on pairs, which is a well-ordering, just like the ordering of single non-negative integers; this means one cannot go down in the ordering infinitely many times in succession.) However, when decreases there is no upper bound on how much can increase — and it will often increase greatly.
  • For small values of m like 1, 2, or 3, the Ackermann function grows relatively slowly with respect to n (at most exponentially). For , however, it grows much more quickly; even is about 2×1019728, and the decimal expansion of A(4, 3) is very large by any typical measure.
  • An interesting aspect is that the only arithmetic operation it ever uses is addition of 1. Its fast growing power is based solely on nested recursion. This also implies that its running time is at least proportional to its output, and so is also extremely huge. In actuality, for most cases the running time is far larger than the output; see below.
  • A single-argument version that increases both and at the same time dwarfs every primitive recursive function, including very fast-growing functions such as the exponential function, the factorial function, multi- and superfactorial functions, and even functions defined using Knuth's up-arrow notation (except when the indexed up-arrow is used). It can be seen that is roughly comparable to in the fast-growing hierarchy. This extreme growth can be exploited to show that which is obviously computable on a machine with infinite memory such as a Turing machine and so is a computable function, grows faster than any primitive recursive function and is therefore not primitive recursive.

Not primitive recursive[edit]

The Ackermann function grows faster than any primitive recursive function and therefore is not itself primitive recursive.

Specifically, one shows that to every primitive recursive function there exists a non-negative integer such that for all non-negative integers ,

Once this is established, it follows that itself is not primitive recursive, since otherwise putting would lead to the contradiction

The proof[15] proceeds as follows: define the class of all functions that grow slower than the Ackermann function

and show that contains all primitive recursive functions. The latter is achieved by showing that contains the constant functions, the successor function, the projection functions and that it is closed under the operations of function composition and primitive recursion.

Note that not being primitive recursive does not mean that it cannot be calculated using a for loop.[16]

Inverse[edit]

Since the function  f(n) = A(n, n) considered above grows very rapidly, its inverse function, f−1, grows very slowly. This inverse Ackermann function f−1 is usually denoted by α. In fact, α(n) is less than 5 for any practical input size n, since A(4, 4) is on the order of .

This inverse appears in the time complexity of some algorithms, such as the disjoint-set data structure and Chazelle's algorithm for minimum spanning trees. Sometimes Ackermann's original function or other variations are used in these settings, but they all grow at similarly high rates. In particular, some modified functions simplify the expression by eliminating the −3 and similar terms.

A two-parameter variation of the inverse Ackermann function can be defined as follows, where is the floor function:

This function arises in more precise analyses of the algorithms mentioned above, and gives a more refined time bound. In the disjoint-set data structure, m represents the number of operations while n represents the number of elements; in the minimum spanning tree algorithm, m represents the number of edges while n represents the number of vertices. Several slightly different definitions of α(m, n) exist; for example, log2 n is sometimes replaced by n, and the floor function is sometimes replaced by a ceiling.

Other studies might define an inverse function of one where m is set to a constant, such that the inverse applies to a particular row. [17]

The inverse of the Ackermann function is primitive recursive.[18]

Use as benchmark[edit]

The Ackermann function, due to its definition in terms of extremely deep recursion, can be used as a benchmark of a compiler's ability to optimize recursion. The first published use of Ackermann's function in this way was in 1970 by Dragoș Vaida[19] and, almost simultaneously, in 1971, by Yngve Sundblad.[20]

Sundblad's seminal paper was taken up by Brian Wichmann (co-author of the Whetstone benchmark) in a trilogy of papers written between 1975 and 1982.[21][22][23]

See also[edit]

References[edit]

  1. ^ a b here: leftmost-innermost strategy!
  2. ^ a b For better readability
    S(0) is notated as 1,
    S(S(0)) is notated as 2,
    S(S(S(0))) is notated as 3,
    etc...

Bibliography[edit]

External links[edit]

  1. ^ Monin & Hinchey 2003, p. 61.
  2. ^ a b Ackermann 1928.
  3. ^ "Decimal expansion of A(4,2)". kosara.net. August 27, 2000. Archived from the original on January 20, 2010.
  4. ^ Calude, Marcus & Tevy 1979.
  5. ^ Hilbert 1926, p. 185.
  6. ^ van Heijenoort 1967.
  7. ^ Péter 1935.
  8. ^ Robinson 1948.
  9. ^ Ritchie 1965, p. 1028.
  10. ^ with parameter order reversed
  11. ^ Buck 1963.
  12. ^ Munafo 1999a.
  13. ^ 'curried'
  14. ^ In each step the underlined redex is rewritten.
  15. ^ Woo, Chi (2009-12-17). "Ackermann function is not primitive recursive | planetmath.org". planetmath.org. Archived from the original on 2013-05-09.
  16. ^ "The Koop Loop (Ackermann's Function in a For loop) | timkoop". timkoop.com. 2021-04-27. Archived from the original on 2021-04-27. Retrieved 2021-04-27.
  17. ^ Pettie 2002.
  18. ^ Matos 2014.
  19. ^ Vaida 1970.
  20. ^ Sundblad 1971.
  21. ^ Wichmann 1976.
  22. ^ Wichmann 1977.
  23. ^ Wichmann 1982.