Introduction to Functional Programming

(Content adapted from Critchlow & Eck)

Functions are fundamental in computer programming, although not everything in programming that goes by the name of "function" is a function according to the mathematical definition.

In computer programming, a function is a routine that is given some data as input and that will calculate and return an answer based on that data. For example, in the Java programming language, a function that calculates the cube of an integer could be written

int cube(int n) {
    return n * n * n;
}

In Java, int is a data type. From the mathematical point of view, a data type is a set. The data type int is the set of all integers that can be represented as 32-bit binary numbers. Mathematically, then, $\textit{int}\subseteq\Z$ . (You should get used to the fact that sets and functions can have names that consist of more than one character, since it's done all the time in computer programming.) The first line of the above function definition, "int cube(int n)", says that we are defining a function named cube whose range is int and whose domain is int. In the usual notation for functions, we would express this as $\textit{cube}\colon \textit{int}\to\textit{int}$ , or possibly as $\textit{cube}\in{\textit{int}}^{\textit{\small{int}}}$ , where ${\textit{int}}^{\textit{\small{int}}}$ is the set of all functions that map the set int to the set int.

The first line of the function, int cube(int n), is called the signature of the function (in some languages, such as C++, it is called the prototype). The signature specifies the name, the domain, and the range of the function and so carries exactly the same information as the notation " $f\colon A\to B$ ". The " $n$ " in "int cube(int n)" is a name for an arbitrary element of the data type int. In computer jargon, $n$ is called a parameter of the function. The rest of the definition of cube tells the computer to calculate the value of $\textit{cube}(n)$ for any $n\in\textit{int}$ by multiplying $n\times n\times n$ . The statement "return n * n * n" says that $n\times n\times n$ is the value that is computed, or "returned," by the function. (The $*$ stands for multiplication.)

Java has many data types in addition to int. There is a boolean data type named boolean. The values of type boolean are true and false. Mathematically, boolean is a name for the set $\{\textit{true},\,\textit{false}\}$ . The type double consists of real numbers, which can include a decimal point. Of course, on a computer, it's not possible to represent the entire infinite set of real numbers, so double represents some subset of the mathematical set of real numbers. There is also a data type whose values are strings of characters, such as "Hello world" or "xyz152QQZ". The name for this data type in Java is String. All these types, and many others, can be used in functions. For example, in Java, $m\,\%\,n$ is the remainder when the integer $m$ is divided by the integer $n$ . We can define a function to test whether an integer is even as follows:

boolean even(int k) {
    if ( k % 2 == 1 )
      return false;
    else
      return true;
}

You don't need to worry about all the details here, but you should understand that the signature, boolean even(int k), says that even is a function from the set int to the set boolean. That is, $\textit{even}\colon\textit{int}\to\textit{boolean}$ . Given an integer $N$ , $\textit{even}(N)$ has the value true if $N$ is an even integer, and it has the value false if $N$ is an odd integer.

A function can have more than one parameter. For example, we might define a function with signature int index(String str, String sub). If $s$ and $t$ are strings, then $\textit{index}(s,t)$ would be the int that is the value of the function at the ordered pair $(s,t)$ . We see that the domain of index is the cross product $\textit{String}\times\textit{String}$ , and we can write $\textit{index}\colon \textit{String}\times\textit{String}\to\textit{int}$ or, equivalently, $\textit{index}\in\textit{int}^{\textit{String}\times\textit{String}}$ .

Partial and Impure Functions

Not every Java function is actually a function in the mathematical sense. In mathematics, a function must associate a single value in its range to each value in its domain. There are two things that can go wrong: The value of the function might not be defined for every element of the domain, and the function might associate several different values to the same element of the domain. Both of these things can happen with Java functions.

In computer programming, it is very common for a "function" to be undefined for some values of its parameter. In mathematics, a partial function from a set $A$ to a set $B$ is defined to be a function from a subset of $A$ to $B$ . A partial function from $A$ to $B$ can be undefined for some elements of $A$ , but when it is defined for some $a\in A$ , it associates just one element of $B$ to $a$ . Many functions in computer programs are actually partial functions. (When dealing with partial functions, an ordinary function, which is defined for every element of its domain, is sometimes referred to as a total function. Note that—with the mind-boggling logic that is typical of mathematicians—a total function is a type of partial function, because a set is a subset of itself.)

It's also very common for a "function" in a computer program to produce a variety of values for the same value of its parameter. This most frequently occurs because the function is impure—either it has a side-effect (such as changing the value of a global variable) beyond just computing a result, or it relies on a value that has been changed as a side-effect of some other part of the program. Another way to say that a function is impure is that it depends on a hidden state—some extra information affecting the result that is not provided by the function's direct inputs. A common example is the method in the java.util.Random class with signature int nextInt(int N), which returns a random integer between 0 and $N-1$ . The value of nextInt(5) could be 0, 1, 2, 3, or 4. This is not the behavior of a mathematical function! Behind the scenes, this function is storing a variable called the "seed", which is used to generate a random value, plus a new value for the seed, each time the function is called. Each time the seed changes, the next value to be returned is likely to change.

Even though many functions in computer programs are not really mathematical functions, I will continue to refer to them as functions in this section. Mathematicians will just have to stretch their definitions a bit to accommodate the realities of computer programming.

Pure Functional Programming

Unlike Java, a typical functional programming language such as Scala, Haskell, or ReasonML will actively discourage the use of side-effects in functions.¹ The benefit of restricting the programmer to pure functions, that always return the same value for a given argument, is that it becomes possible to reason about the behavior algebraically, freely substituting the returned values in place of function calls without having to worry about whether some "hidden state" might have changed since the last time the function was called.

For example, suppose that we have a function that computes the value of some polynomial, such as $f(x)=x^2+2x+1$ . If we know that another function $g\colon\textit{String}\to\textit{int}$ is pure, then we can be sure that $f(g(s))$ is the same as $g(s)^2+2\cdot g(s)+1$ , as well as $(g(s)+1)^2$ . All of these are algebraically equivalent, as long as $g$ is pure. This allows the programmer to reason more easily about the correctness of their program, and it also enables the computer to choose any equivalent expression for evaluation, ideally choosing the most efficient version.

One of the most common ways that a functional language will encourage pure functions is to do away with, or at least severely restrict, the ability to update the value assigned to a variable. In ReasonML, there is no assignment statement. When a value is bound to a variable with a let statement, that variable will then remain bound to that value for as long as the variable exists. A variable will cease to exist when the block (such as a function body) containing it is finished:

A variable may be temporarily shadowed by another variable with the same name. This may look like an assignment of a changed value to a variable, but each use of the let statement will create a new named location in memory; if the shadowing variable goes away, the original one will become visible again with its correct value:

Again, this behavior permits algebraic reasoning about the program. The above code is equivalent to

where we have uniformly renamed the inner variable x as y to make it clear that they are distinct variables. It is also equivalent to

where we have replaced each use of our identifiers with its value. The output is slightly different in this case, because we no longer have the top-level binding of 42 to x that would have been available if we wrote additional lines at the bottom of the program, but it is equivalent if we just look at the printed results.

First-Class Functions

In most programming languages, functions are not first-class objects. That is, a function cannot be treated as a data value in the same way as a String or an int. However, recent versions of Java have taken a step in this direction. It is possible for a function to be a parameter to another function, as long as it is wrapped in a "function object". For example, consider the function signature

int sumten( Function<Integer, Integer> f )

This is a signature for a function named sumten whose parameter is a function object. The parameter is specified by the type "Function<Integer, Integer>". If S and T are types, then the type Function<S, T> represents functions from S to T. Therefore, the parameter of sumten is essentially a function from int to int.² The parameter name, $f$ , stands for an arbitrary such function. Mathematically, $f\in \textit{int}^{\textit{int}}$ , and so $\textit{sumten}\colon \textit{int}^{\textit{int}}\to\textit{int}$ .

My idea is that $\textit{sumten}(f)$ would compute $f(1)+f(2)+\cdots+f(10)$ . A more useful function would be able to compute $f(a)+f(a+1)+\cdots+f(b)$ for any integers $a$ and $b$ . This just means that $a$ and $b$ should be parameters to the function. The signature for the improved function would look like

int sum( Function<Integer, Integer> f, int a, int b )

The parameters to sum form an ordered triple in which the first coordinate is a function and the second and third coordinates are integers. So, we could write

\textit{sum}\colon \textit{int}^{\textit{int}} \times\textit{int}\times\textit{int}\to\textit{int}

It's interesting that computer programmers deal routinely with such complex objects.

There are several ways of providing a function object as an argument in Java. If we want to pass the method m of an object x, where the signature of m is int m( int i ), then we can call our function as sum(x::m, a, b). However, a more general technique is to use an anonymous function, also known as a lambda.³

Anonymous functions

In Java, an expression such as i -> { return i * i * i; } creates a function object that takes an int (this will be determined from the context) and returns another int. This particular function cubes its input. Thus, if we call our sum function as follows:

sum(i -> { return i * i * i; }, 3, 5)

the result will be $3^3 + 4^3 + 5^3$ , which is 216.

Many languages now support a similar syntax for creating anonymous function values, and offer some facility for working with functions as (mostly) first-class objects. For example, the same function is expressed in ReasonML as i => { i * i * i }. Since one of the hallmarks of the functional languages is their ability to work with function values, you can imagine that they tend to provide the most thorough integration of functions with other kinds of values.

Here are complete demos of the sum example, first in Java and then in ReasonML:

import java.util.function.Function;

public class SumDemo {
  private static int sum(Function<Integer, Integer> f, int a, int b) {
    int total = 0;
    for (int i = a; i <= b; i++) {
      total += f.apply(i);
    }
    return total;
  }

  public static void main(String[] args) {
    System.out.println(sum(i -> { return i * i * i; }, 3, 5));
  }
}

Note that we define the function sum in ReasonML by binding a function value to the name sum, just as we can bind other types of values to identifiers. The keyword "rec" after the "let" indicates that the right-hand side expression is allowed to make use of the name that is currently being defined, so that we can make our function recursive. Without the "rec", any use of sum in the right-hand expression would refer to an older binding to that name.

As a simpler function definition in ReasonML, not needing the "rec" keyword, here is our cube function again:

This is just binding the anonymous function we have been using above to the name cube. Note that the function n => { n * n * n } is exactly the same as the function i => { i * i * i }, because the name of the parameter does not matter outside the function.

An interesting fact about ReasonML is that the operators are also functions, bound to names made out of operator symbols instead of letters and digits. To refer to an operator as a function value, just put the operator in parentheses: (+), (*), (==), …. Therefore, an expression such as a + b * c can also be written as (+)(a, (*)(b, c)) (note that this takes into account the usual higher precedence of multiplication over addition). For example, if we wanted to define an exponentiation operator on int, and call it ***, we could define it as follows:⁴

It is even possible in functional languages for a function to return another function as its value. For example,

Here, x *** n is our exponentiation operator from above, which computes $x^n$ , so for any integers $a$ and $n$ , the value of $\textit{monomial}(a,n)$ is a function that computes $ax^n$ . Thus,

would define $f$ to be the function that satisfies $f(x)=2x^3$ . This is now ready to be handed to our sum function:

would compute $2*3^3+2*4^3+2*5^3+2*6^3$ . In fact, monomial can be used to create an unlimited number of new functions from scratch. It is even possible to write monomial(2, 3)(5) to indicate the result of applying the function monomial(2, 3) to the value 5. The value represented by monomial(2, 3)(5) is $2*5^3$ , or 250. This is real functional programming and might give you some idea of its power.

Exercises

For each of the following Java function signatures, translate the signature into a standard mathematical function specification, such as $\textit{func}\colon\textit{double}\to\textit{int}$ .
- int strlen(String s)
  Answer
  $\textit{strlen}\colon\textit{String}\to\textit{int}$
- double pythag(double x, double y)
  Answer
  $\textit{pythag}\colon\textit{double}\times\textit{double}\to\textit{double}$
- int round(double x)
  Answer
  $\textit{round}\colon\textit{double}\to\textit{int}$
- String sub(String s, int n, int m)
  Answer
  $\textit{sub}\colon\textit{String}\times\textit{int}\times\textit{int}\to\textit{String}$
- String unlikely(Function<String, Integer> f)
  Answer
  $\textit{unlikely}\colon\textit{int}^\textit{String}\to\textit{String}$
- int h(Function<Integer, Integer> f, Function<Integer, Integer> g)
  Answer
  $\textit{h}\colon\textit{int}^\textit{int}\times\textit{int}^\textit{int}\to\textit{int}$
Write a Java function signature for a function that belongs to each of the following sets.
- $\textit{String}^{\textit{String}}$
  Answer
  String foo(String s)
- $\textit{boolean}^{\textit{double}\times\textit{double}}$
  Answer
  boolean bar(double x, double y)
- $\textit{double}^{ \textit{int}^{\textit{int}} }$
  Answer
  double baz(Function<Integer, Integer> f)
It is possible to define new types in Java. For example, the definition

public class Point {
  public double x;
  public double y;
}

defines a new type named Point. A value of type Point contains two values of type double. What mathematical operation corresponds to the construction of this data type? Why?

Answer

Each element of Point contains a pair of numbers, so the type corresponds to the cartesian product $\textit{double}\times\textit{double}$ .

Let cube, sum and monomial be the ReasonML functions described in this section. What is the value of each of the following?
- sum(cube, 2, 4)
  Answer
  $2^3 + 3^3 + 4^3 = 8 + 27 + 64 = 99$
- sum(monomial(5, 2), 1, 3)
  Answer
  $5\cdot1^2 + 5\cdot2^2 + 5\cdot3^2 = 5 + 20 + 45 = 70$
- monomial(cube(2), 7)
  Answer
  The function $f$ such that $f(x) = 8x^7$
- sum( $n\Rightarrow\{ 2*n \}$ , 1, 5)
  Answer
  $2\cdot1 + 2\cdot2 + 2\cdot3 + 2\cdot4 + 2\cdot5 = 2 + 4 + 6 + 8 + 10 = 30$
- cube(sum(monomial(2, 3), 1, 2))
  Answer
  $(2\cdot1^3 + 2\cdot2^3)^3 = (2 + 16)^3 = 18^3 = 5832$
Write a ReasonML function named compose that computes the composition of two functions. That is, $\textit{compose}(f, g)$ is $f\circ g$ , where $f$ and $g$ are functions of one parameter. Recall that $f\circ g$ is the function defined by $(f\circ g)(x)=f(g(x))$ .
Answer
let compose = (f, g) => { x => { f(g(x)) } }
Consider the following ReasonML function:

What is the value of exercise(4, 5)?
Answer
9
What is the value of exercise(12, 13)?
Answer
25
Use algebraic substitution to evaluate exercise(a, b) in terms of the variables a and b.
Answer
First, c = a - b, and d = a - c, so d = b. Now, e = c + d, so e = a. Finally, the result is s(d) - s(e), which will be b * b - a * a.

The most common exception is for functions that send output to the console, such as the print_int function in ReasonML. Being able to track execution and easily display results is very useful, and printing a line on the console is a fairly benign side-effect—it won't cause the function to return different values for the same arguments. However, printing a result can still interfere with algebraic reasoning about a program, because interchanging such a function call with its value can affect whether and how many times the output is printed.↩
For our purposes we may ignore the distinction between int and Integer in Java.↩
The mathematician Alonzo Church introduced in the 1930's the use of the Greek letter lambda ( $\lambda$ ) to indicate an otherwise unnamed function defined by a formula. That is, instead of writing "the function $f$ where $f(x) = \textit{some formula}$ ", he wrote " $\lambda x(\textit{some formula})$ ". When the first functional programming language, LISP (invented in the late 1950's), needed a way to create function values, John McCarthy adopted Church's use of lambda, and the name has stuck.↩
The code here is based on the solution to an exercise in the Recursion section.]↩

Partial and Impure Functions​

Pure Functional Programming​

First-Class Functions​

Anonymous functions​

Exercises​

Partial and Impure Functions

Pure Functional Programming

First-Class Functions

Anonymous functions

Exercises