I'm trying to understand what the dot operator is doing in this Haskell code:
sumEuler = sum . (map euler) . mkList
The entire source code is below.
My understanding
The dot operator is taking the two functions sum
and the result of map euler
and the result of mkList
as the input.
But, sum
isn't a function it is the argument of the function, right? So what is going on here?
Also, what is (map euler)
doing?
Code
mkList :: Int -> [Int]
mkList n = [1..n-1]
euler :: Int -> Int
euler n = length (filter (relprime n) (mkList n))
sumEuler :: Int -> Int
sumEuler = sum . (map euler) . mkList
The . operator is used for function composition. Just like math, if you have to functions f(x) and g(x) f . g becomes f(g(x)).
map is a built-in function which applies a function to a list. By putting the function in parentheses the function is treated as an argument. A term for this is currying. You should look that up.
What is does is that it takes a function with say two arguments, it applies the argument euler. (map euler) right? and the result is a new function, which takes only one argument.
sum . (map euler) . mkList is basically a fancy way of putting all that together. I must say, my Haskell is a bit rusty but maybe you can put that last function together yourself?
The dot operator applies the function on the left (
sum
) to the output of the function on the right. In your case, you're chaining several functions together - you're passing the result ofmkList
to(map euler)
, and then passing the result of that tosum
. This site has a good introduction to several of the concepts.Put simply,
.
is function composition, just like in math:In your case, you are creating a new function,
sumEuler
that could also be defined like this:The style in your example is called "point-free" style -- the arguments to the function are omitted. This makes for clearer code in many cases. (It can be hard to grok the first time you see it, but you will get used to it after a while. It is a common Haskell idiom.)
If you are still confused, it may help to relate
.
to something like a UNIX pipe. Iff
's output becomesg
's input, whose output becomesh
's input, you'd write that on the command-line likef < x | g | h
. In Haskell,.
works like the UNIX|
, but "backwards" --h . g . f $ x
. I find this notation to be quite helpful when, say, processing a list. Instead of some unwieldy construction likemap (\x -> x * 2 + 10) [1..10]
, you could just write(+10) . (*2) <$> [1..10]
. (And, if you want to only apply that function to a single value; it's(+10) . (*2) $ 10
. Consistent!)The Haskell wiki has a good article with some more detail: http://www.haskell.org/haskellwiki/Pointfree
Short answer
Equivalent code without dots, that is just
or without the lambda
because the dot (.) indicates function composition.
Longer answer
First, let's simplify the partial application of
euler
tomap
:Now we just have the dots. What is indicated by these dots?
From the source:
Thus
(.)
is the compose operator.Compose
In math, we might write the composition of functions, f(x) and g(x), that is, f(g(x)), as
which can be read "f composed with g".
So in Haskell, f ∘ g, or f composed with g, can be written:
Composition is associative, which means that f(g(h(x))), written with the composition operator, can leave out the parentheses without any ambiguity.
That is, since (f ∘ g) ∘ h is equivalent to f ∘ (g ∘ h), we can simply write f ∘ g ∘ h.
Circling back
Circling back to our earlier simplification, this:
just means that
sumEuler
is an unapplied composition of those functions:sum
is a function in the Haskell Prelude, not an argument tosumEuler
. It has the typeThe function composition operator
.
has typeSo we have
Note that
Int
is indeed an instance of theNum
typeclass.The . operator composes functions. For example,
Where a and b are functions is a new function that runs b on its arguments, then a on those results. Your code
is exactly the same as:
but hopefully easier to read. The reason there are parens around map euler is because it makes it clearer that there are 3 functions being composed: sum, map euler and mkList - map euler is a single function.