Functional programming: Difference between revisions

Content deleted Content added

Inline

Revision as of 16:52, 21 November 2008

In computer science, functional programming is a programming paradigm that treats computation as the evaluation of mathematical functions and avoids state and mutable data. It emphasizes the application of functions, in contrast to the imperative programming style, which emphasizes changes in state.^[1]

The lambda calculus provides the model for functional programming. Modern functional languages can be viewed as embellishments to the lambda calculus.^[1]

While not fully functional, both the original Lisp and APL were important in the development of functional programming. Later versions of Lisp such as Scheme and variants of APL did provide full functional support. Other important functional languages include Erlang, Haskell, and ML.

Functional programming languages, especially purely functional ones, have largely been emphasized in academia rather than in commercial software development. However, notable functional programming languages used in industry and commercial applications include Erlang,^[2] OCaml,^[3] Haskell,^[4] Scheme (since 1986)^[5]^[6] and domain-specific programming languages like R (statistics),^[7] Mathematica (symbolic math),^[8] J and K (financial analysis), and XSLT (XML).^[9]^[10]

Many non-functional programming languages such as C, C++ and C# can be made to exhibit functional behaviors using function pointers, the <functional> library and lambda functions respectively.

History

Lambda calculus provides a theoretical framework for describing functions and their evaluation. Although it is a mathematical abstraction rather than a programming language, it forms the basis of almost all functional programming languages today.

Combinatory logic is an equivalent theoretical foundation, developed by Moses Schönfinkel and Haskell Curry. It was originally developed to achieve a clearer approach to the foundations of mathematics.^[11] Combinatory logic is commonly perceived as more abstract than lambda calculus and preceded it in invention.

An early functional-flavored language was LISP, developed by John McCarthy while at MIT for the IBM 700/7000 series scientific computers in the late 1950s.^[12] LISP introduced many features now found in functional languages, though LISP is technically a multi-paradigm language. Scheme and Dylan were later attempts to simplify and improve LISP.

Information Processing Language (IPL) is sometimes cited as the first computer-based functional programming language. It is an assembly-style language for manipulating lists of symbols. It does have a notion of "generator", which amounts to a function accepting a function as an argument, and, since it is an assembly-level language, code can be used as data, so IPL can be regarded as having higher-order functions. However, it relies heavily on mutating list structure and similar imperative features.

Kenneth E. Iverson developed the APL programming language in the early 1960s, described in his 1962 book A Programming Language (ISBN 9780471430148). APL was the primary influence on John Backus's FP programming language. In the early 1990s, Iverson and Roger Hui created a successor to APL, the J programming language. In the mid 1990s, Arthur Whitney, who had previously worked with Iverson, created the K programming language, which is used commercially in financial industries.

John Backus presented the FP programming language in his 1977 Turing Award lecture Can Programming Be Liberated From the von Neumann Style? A Functional Style and its Algebra of Programs. He defines functional programs as being built up in a hierarchical way by means of "combining forms" that allow an "algebra of programs"; in modern language, this means that functional programs follow the principle of compositionality. Backus's paper popularized research into functional programming, though it emphasized function-level programming rather than the lambda-calculus style which has come to be associated with functional programming.

In the 1970s the ML programming language was created by Robin Milner at the University of Edinburgh, and David Turner developed initially the language SASL at the University of St. Andrews and later the language Miranda at the University of Kent. ML eventually developed into several dialects, the most common of which are now Objective Caml and Standard ML. The Haskell programming language was released in the late 1980s in an attempt to gather together many ideas in functional programming research.

In 2008, Microsoft Research is adding F#, a functional language, to their .NET platform.^[13]

Concepts

A number of concepts and paradigms are specific to functional programming, and generally foreign to imperative programming (including object oriented programming). However, programming languages are often hybrids of several programming paradigms so programmers using "mostly imperative" languages may have utilized some of these concepts.^[14]

Higher-order functions

Functions are higher-order when they can take other functions as arguments, and return them as results. (The derivative and antiderivative in calculus are examples of this.)

Higher-order functions are closely related to first-class functions, in that higher-order functions and first-class functions both allow functions as arguments and results of other functions. The distinction between the two is subtle: "higher-order" describes a mathematical concept of functions that operate on other functions, while "first-class" is a computer science term that describes programming language entities that have no restriction on their use (thus first-class functions can appear anywhere in the program that other first-class entities like numbers can, including as arguments to other functions and as their return values).

Higher-order functions enable currying, a technique in which a function is applied to its arguments one at a time, with each application returning a new function that accepts the next argument.

Pure functions

Purely functional functions (or expressions) have no memory or I/O side effects, unless the computation of the result in itself is counted as a side-effect. This means that pure functions have several useful properties, many of which can be used to optimize the code:

If the result of a pure expression is not used, it can be removed without affecting other expressions.
If a pure function is called with parameters that cause no side-effects, the result is constant with respect to that parameter list (sometimes called referential transparency), i.e. if the pure function is again called with the same parameters, the same result will be returned (this can enable caching optimisations).
If there is no data dependency between two pure expressions, then their order can be reversed, or they can be performed in parallel and they cannot interfere with one another (in other terms, the evaluation of any pure expression is thread-safe).
If the entire language does not allow side-effects, then any evaluation strategy can be used; this gives the compiler freedom to reorder or combine the evaluation of expressions in a program (for example, using lazy evaluation).

While most compilers for imperative programming languages detect pure functions, and perform common-subexpression elimination for pure function calls, they cannot always do this for pre-compiled libraries, which generally do not expose this information, thus preventing optimisations that involve those external functions. Some compilers, such as gcc, add extra keywords for a programmer to explicitly mark external functions as pure, to enable such optimisations. Fortran 95 allows functions to be designated "pure".

Recursion

Iteration (looping) in functional languages is usually accomplished via recursion. Recursive functions invoke themselves, allowing an operation to be performed over and over. Recursion may require maintaining a stack, but tail recursion can be recognized and optimized by a compiler into the same code used to implement iteration in imperative languages. The Scheme programming language standard requires implementations to recognize and optimize tail recursion.

Common patterns of recursion can be factored out using higher order functions, catamorphisms and anamorphisms (or "folds" and "unfolds") being the most obvious examples. Such higher order functions play a role analogous to built-in control structures such as loops in imperative languages.

Strict versus non-strict evaluation

Functional languages can be categorized by whether they use strict or non-strict evaluation, concepts that refer to how function arguments are processed when an expression is being evaluated.

In brief, strict evaluation always fully evaluates function arguments before invoking the function. Non-strict evaluation is free to do otherwise.

To illustrate, consider the following two functions f and g:

f(x) := x^2 + x + 1
g(x, y) := x + y

Under strict evaluation, we would have to evaluate function arguments first, for example:

  f(g(1, 4))
= f(1 + 4)
= f(5)
= 5^2 + 5 + 1
= 31

By contrast, non-strict evaluation need not fully evaluate the arguments; in particular it may send the arguments unevaluated to the function, perhaps evaluating them later. For example, one non-strict strategy (call-by-name) might work as follows:

  f(g(1, 4))
= g(1, 4)^2 + g(1, 4) + 1
= (1 + 4)^2 + (1 + 4) + 1
= 5^2 + 5 + 1
= 31

A key property of strict evaluation is that when an argument expression fails to terminate, the whole expression fails to terminate. With non-strict evaluation, this need not be the case, since argument expressions need not be evaluated at all.

Advantages of strict-evaluation

Parameters are usually passed around as (simple) atomic units, rather than as (rich) expressions. (For example, the integer 5 can be passed on a register, whereas the expression 1+4 will require several memory locations). This has a direct implementation on standard hardware.

The order of evaluation is quite clear to the programmer: every argument must be evaluated before the function body is invoked.

Advantages of non-strict-evaluation

Lambda calculus provides a stronger theoretic foundation for languages that employ non-strict evaluation.^[1]

A non-strict evaluator may recognize that a sub-expression does not need to be evaluated. For example, given the definitions

multiply(0, x) = 0;
multiply(n, x) = x + multiply(n-1, x);
f(0) = 1;
f(n) = n * f(n-1);

and expression

multiply(0, f(1000000))

a strict evaluator would (strictly speaking) need to take (on the order of) 1,000,000 steps to find the value of f(1000000). A non-strict evaluator may use the definition of multiply first, reducing the whole expression to 0 before even trying to compute f(1000000).

Non-strict evaluation can use the above to allow "infinite" data structures. For example, using Haskell, given the definitions

evens n = n : [evens (n+2)] -- an "infinite list" of even numbers starting with n
 
-- The function "take n" returns the first n elements of its argument
take 0 (list)   = []                  -- when n is 0, return an empty list
take n (x:list) = x : (take (n-1) list) -- otherwise, return the first element and n-1 of the next elements

the expression

take 4 (evens 0)

quickly returns [0,2,4,6]. Under strict evaluation, evens would need to be fully evaluated before take could be called, but since evens is recursive it would never (strictly speaking) terminate. With non-strict evaluation, the take 4 function only forces the evaluation of four elements of evens 0 and the other elements are never inspected or evaluated.

Lazy evaluation

The need for a more efficient form of non-strict evaluation led to the development of lazy evaluation, a type of non-strict evaluation, where the initial evaluation of an argument is shared throughout the evaluation sequence. Consequently an argument (such as g(1, 4) in the above example) is never evaluated more than once. Under lazy evaluation, expressions are sent to subordinate functions as references to expression trees whose value has not yet been computed. When any such expression tree must be expanded, the expression tree "remembers" its result, and thus avoids recomputing the same expression a second time. In the initial example, this would pose as follows:

= f(g(1, 4))
= g(1, 4)^2 + g(1, 4) + 1

It is then necessary to evaluate g(1, 4). This can be computed once, yielding:

  g(1, 4)
  = 1 + 4
  = 5

Then, since both references to g(1, 4) are references to the same (pure) expression, both "know" that their value is 5. This means that their value is computed only once, even though they are passed symbolically to the function f.

= 5^2 + 5 + 1
= 25 + 5 + 1
= 31

Lazy evaluation tends to be used by default in pure functional languages such as Miranda, Clean and Haskell.

Functional programming in non-functional languages

It is possible to employ a functional style of programming in languages that are not traditionally considered functional languages.^[15] Some non-functional languages have borrowed features such as higher-order functions, and list comprehensions from functional programming languages. This makes it easier to adopt a functional style when using these languages. Functional constructs such as higher-order functions and lazy lists can be obtained in C++ via libraries.^[16] In C, function pointers can be used to get some of the effects of higher-order functions. For example the common function map can be implemented using function pointers. In C# version 3.0 and higher, lambda functions can be employed to write programs in a functional style.^{[citation needed]} Widespread declarative domain specific languages like SQL and Lex/Yacc, although not always Turing-complete, use some elements of functional programming, especially in eschewing mutable values.^[17]

Comparison of functional and imperative programming

Functional programming is very different from imperative programming. The most significant differences stem from the fact that functional programming avoids side effects, which are used in imperative programming to implement state and I/O. Pure functional programming disallows side effects completely. Disallowing side effects provides for referential transparency, which makes it easier to verify, optimize, and parallelize programs, and easier to write automated tools to perform those tasks.

Higher order functions are rarely used in older imperative programming. Where a traditional imperative program might use a loop to traverse a list, a functional style would often use a higher-order function, map, that takes as arguments a function and a list, applies the function to each element of the list, and returns a list of the results.

Simulating state

There are tasks—for example, maintaining a bank account balance—that often seem most naturally implemented with state. Pure functional programming performs these tasks, and I/O tasks such as accepting user input and printing to the screen, in a different way.

The pure functional programming language Haskell implements them using monads, derived from category theory. Monads are powerful and offer an intuitive way to model state (and other side effects such as I/O) in an imperative manner without losing purity. While existing monads are easy to use, many find it difficult to understand how to define new monads (which is sometimes needed for certain types of libraries).^[18]

Alternative methods such as Hoare logic and uniqueness have been developed to track side effects in programs. Some modern research languages use effect systems to make explicit the presence of side effects.

Efficiency issues

Functional programming languages have been perceived as less efficient in their use of CPU and memory than imperative languages such as C and Pascal.^{[citation needed]} However, for programs that perform intensive numerical computations, functional languages such as OCaml and Clean are similar in speed to C. For programs that handle large matrices and multidimensional databases, array functional languages (such as J and K) were designed with speed optimization in mind.

Purely functional languages have a reputation for being slower than imperative languages.^{[citation needed]} However, immutability of data can, in many cases, lead to execution efficiency in allowing the compiler to make assumptions that are unsafe in an imperative language.^{[citation needed]}

For purely functional languages, the worst-case slowdown is exponential.^[19] However, such slowdowns occur very rarely in practice.^{[citation needed]}

Coding styles

Imperative programs tend to emphasize the series of steps taken by a program in carrying out an action, while functional programs tend to emphasize the composition and arrangement of functions, often without specifying explicit steps. A simple example of two solutions to the same programming goal (using the same multi-paradigm language Python) illustrates this.

# imperative style
target = [] # create empty list
for item in source_list: # iterate over each thing in source
    trans1 = G(item) # transform the item with the G() function
    trans2 = F(trans1) # second transform with the F() function
    target.append(trans2) # add transformed item to target

A functional version has a different feel to it:

# functional style
# FP-oriented languages often have standard compose()
compose2 = lambda F, G: lambda x: F(G(x))
target = map(compose2(F, G), source_list)

In contrast to the imperative style that describes the steps involved in building target, the functional style describes the mathematical relationship between source_list and target.

In practice in Python that code is often written with a list comprehension, a form of syntactic sugar for the for loop with implicit append:

target = [F(G(item)) for item in source_list]

References

^ ^a ^b ^c Hudak, Paul (1989). "Conception, evolution, and application of functional programming languages" (PDF). ACM Computing Surveys. 21 (3): 359–411. doi:10.1145/72551.72554. {{cite journal}}: Unknown parameter |month= ignored (help)
^ "Who uses Erlang for product development?". Frequently asked questions about Erlang. Retrieved 2007-08-05.
^ Minsky, Yaron; Weeks, Stephen (July 2008). "Caml Trading - experiences with functional programming on Wall Street". Journal of Functional Programming. 18 (4). Cambridge University Press: 553–564. doi:10.1017/S095679680800676X. Retrieved 2008-08-27.
^ ""Haskell - Haskell Wiki". Retrieved 2008-08-27.
^ Clinger, Will (1987). "MultiTasking and MacScheme". MacTech. Vol. 3, no. 12. Retrieved 2008-08-28.
^ Hartheimer, Anne (1987). "Programming a Text Editor in MacScheme+Toolsmith". MacTech. Vol. 3, no. 1. Retrieved 2008-08-28.
^ The useR! 2006 conference schedule includes papers on the commercial use of R
^ Department of Applied Math, University of Colorado. "Functional vs. Procedural Programming Language". Retrieved 2006-08-28.
^ Dimitre Novatchev. "The Functional Programming Language XSLT - A proof through examples". TopXML. {{cite web}}: Unknown parameter |accessmonthday= ignored (help); Unknown parameter |accessyear= ignored (|access-date= suggested) (help)
^ David Mertz. "XML Programming Paradigms (part four): Functional Programming approached to XML processing". IBM developerWorks. {{cite web}}: Unknown parameter |accessmonthday= ignored (help); Unknown parameter |accessyear= ignored (|access-date= suggested) (help)
^ Curry, Haskell Brooks (1958). Combinatory Logic. Volume I. Amsterdam: North-Holland Publishing Company. {{cite book}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
^ McCarthy, John (1978). "History of Lisp". In ACM SIGPLAN History of Programming Languages Conference: 173–196. {{cite journal}}: Unknown parameter |month= ignored (help) " The implementation of LISP began in Fall 1958."
^ "The F# September 2008 CTP is now available!"., Don Syme's WebLog on the F# Language and Related Topics
^ Dick Pountain. "Functional Programming Comes of Age". BYTE.com (August 1994). {{cite web}}: Unknown parameter |accessmonthday= ignored (help); Unknown parameter |accessyear= ignored (|access-date= suggested) (help)
^ Hartel, Pieter (2004). "The Functional C experience" (PDF). The Journal of Functional Programming. 14 (2): 129–135. doi:10.1017/S0956796803004817. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |month= ignored (help); David Mertz. "Functional programming in Python, Part 3". IBM developerWorks. Retrieved 2006-09-17.(Part 1, Part 2)
^ McNamara, B. "FC++: Functional Programming in C++". Retrieved 2006-05-28.
^ Donald D. Chamberlin and Raymond F. Boyce (1974). "SEQUEL: A structured English query language". Proceedings of the 1974 ACM SIGFIDET: 249–264.. In this paper, one of the first formal presentations of the concepts of SQL (and before the name was later abbreviated), Chamberlin and Boyce emphasize that SQL was developed "Without resorting to the concepts of bound variables and quantifiers".
^ Newbern, J. "All About Monads: A comprehensive guide to the theory and practice of monadic programming in Haskell". Retrieved 2008-02-14., "The sheer number of different monad tutorials on the internet is a good indication of the difficulty many people have understanding the concept. This is due to the abstract nature of monads and to the fact that they are used in several different capacities, which can confuse the picture of exactly what a monad is and what it is good for."
^ R.A. DeMillo, S.C. Eisenstat, R.J. Lipton (1980). "Space-time trade-offs in structured programming", JACM 27: 123-127. doi:10.1145/322169.322180

External links

Why Functional Programming Matters by John Hughes
Functional Programming for the Rest of Us An introduction
"Functional Programming" — Chapter 4 of Advanced Programming Language Design by Raphael Finkel, an introductory explanation of functional programming
Functional programming in Python (by David Mertz): part 1, part 2, part 3
The Implementation of Functional Programming Languages Simon Peyton Jones, published by Prentice Hall, 1987.

[hudak1989-1] Hudak, Paul (1989). "Conception, evolution, and application of functional programming languages" (PDF). ACM Computing Surveys. 21 (3): 359–411. doi:10.1145/72551.72554. {{cite journal}}: Unknown parameter |month= ignored (help)

[2] "Who uses Erlang for product development?". Frequently asked questions about Erlang. Retrieved 2007-08-05.

[3] Minsky, Yaron; Weeks, Stephen (July 2008). "Caml Trading - experiences with functional programming on Wall Street". Journal of Functional Programming. 18 (4). Cambridge University Press: 553–564. doi:10.1017/S095679680800676X. Retrieved 2008-08-27.

[4] ""Haskell - Haskell Wiki". Retrieved 2008-08-27.

[5] Clinger, Will (1987). "MultiTasking and MacScheme". MacTech. Vol. 3, no. 12. Retrieved 2008-08-28.

[6] Hartheimer, Anne (1987). "Programming a Text Editor in MacScheme+Toolsmith". MacTech. Vol. 3, no. 1. Retrieved 2008-08-28.

[7] The useR! 2006 conference schedule includes papers on the commercial use of R

[8] Department of Applied Math, University of Colorado. "Functional vs. Procedural Programming Language". Retrieved 2006-08-28.

[9] Dimitre Novatchev. "The Functional Programming Language XSLT - A proof through examples". TopXML. {{cite web}}: Unknown parameter |accessmonthday= ignored (help); Unknown parameter |accessyear= ignored (|access-date= suggested) (help)

[10] David Mertz. "XML Programming Paradigms (part four): Functional Programming approached to XML processing". IBM developerWorks. {{cite web}}: Unknown parameter |accessmonthday= ignored (help); Unknown parameter |accessyear= ignored (|access-date= suggested) (help)

[11] Curry, Haskell Brooks (1958). Combinatory Logic. Volume I. Amsterdam: North-Holland Publishing Company. {{cite book}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)

[12] McCarthy, John (1978). "History of Lisp". In ACM SIGPLAN History of Programming Languages Conference: 173–196. {{cite journal}}: Unknown parameter |month= ignored (help) " The implementation of LISP began in Fall 1958."

[13] "The F# September 2008 CTP is now available!"., Don Syme's WebLog on the F# Language and Related Topics

[14] Dick Pountain. "Functional Programming Comes of Age". BYTE.com (August 1994). {{cite web}}: Unknown parameter |accessmonthday= ignored (help); Unknown parameter |accessyear= ignored (|access-date= suggested) (help)

[15] Hartel, Pieter (2004). "The Functional C experience" (PDF). The Journal of Functional Programming. 14 (2): 129–135. doi:10.1017/S0956796803004817. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help); Unknown parameter |month= ignored (help); David Mertz. "Functional programming in Python, Part 3". IBM developerWorks. Retrieved 2006-09-17.(Part 1, Part 2)

[16] McNamara, B. "FC++: Functional Programming in C++". Retrieved 2006-05-28.

[17] Donald D. Chamberlin and Raymond F. Boyce (1974). "SEQUEL: A structured English query language". Proceedings of the 1974 ACM SIGFIDET: 249–264.. In this paper, one of the first formal presentations of the concepts of SQL (and before the name was later abbreviated), Chamberlin and Boyce emphasize that SQL was developed "Without resorting to the concepts of bound variables and quantifiers".

[18] Newbern, J. "All About Monads: A comprehensive guide to the theory and practice of monadic programming in Haskell". Retrieved 2008-02-14., "The sheer number of different monad tutorials on the internet is a good indication of the difficulty many people have understanding the concept. This is due to the abstract nature of monads and to the fact that they are used in several different capacities, which can confuse the picture of exactly what a monad is and what it is good for."

[19] R.A. DeMillo, S.C. Eisenstat, R.J. Lipton (1980). "Space-time trade-offs in structured programming", JACM 27: 123-127. doi:10.1145/322169.322180

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

@@ Line 1: / Line 1: @@
 {{Cleanup-jargon|date=October 2008}}
-In [[computer science]], '''functional programming''' is a [[programming paradigm]] that treats [[computation]] as the evaluation of [[function (mathematics)|mathematical function]]s and avoids [[program state|state]] and [[immutable object|mutable]] data. It emphasizes the application of functions, in contrast with the [[imperative programming]] style which emphasizes changes in state.<ref name="hudak1989">{{cite journal|last=Hudak|first=Paul|authorlink=Paul Hudak|title=Conception, evolution, and application of functional programming languages|journal=[[Association for Computing Machinery|ACM]] Computing Surveys|volume=21|issue=3|pages=359–411|month=September|year=1989|url=http://www.cs.berkeley.edu/~jcondit/pl-prelim/hudak89functional.pdf|doi=10.1145/72551.72554}}</ref>
+In [[computer science]], '''functional programming''' is a [[programming paradigm]] that treats [[computation]] as the evaluation of [[function (mathematics)|mathematical function]]s and avoids [[program state|state]] and [[immutable object|mutable]] data. It emphasizes the application of functions, in contrast to the [[imperative programming]] style, which emphasizes changes in state.<ref name="hudak1989">{{cite journal|last=Hudak|first=Paul|authorlink=Paul Hudak|title=Conception, evolution, and application of functional programming languages|journal=[[Association for Computing Machinery|ACM]] Computing Surveys|volume=21|issue=3|pages=359–411|month=September|year=1989|url=http://www.cs.berkeley.edu/~jcondit/pl-prelim/hudak89functional.pdf|doi=10.1145/72551.72554}}</ref>
 The [[lambda calculus]] provides the model for functional programming. Modern functional languages can be viewed as embellishments to the lambda calculus.<ref name="hudak1989"/>

v t e Programming languages
Comparison Timeline History
Ada ALGOL Simula APL Assembly BASIC Visual Basic classic .NET C C++ C# COBOL Erlang Forth Fortran Go Haskell Java JavaScript Julia Kotlin Lisp Lua MATLAB ML Pascal Object Pascal Perl PHP Prolog Python R Ruby Rust SQL Scratch Shell Smalltalk Swift more...
Lists: Alphabetical Categorical Generational Non-English-based Category