Who first said the following?
A monad is just a monoid in the category of endofunctors, what's the problem?
And on a less important note, is this true and if so could you give an explanation (hopefully one that can be understood by someone who doesn't have much Haskell experience)?
Note: No, this isn't true. At some point there was a comment on this answer from Dan Piponi himself saying that the cause and effect here was exactly the opposite, that he wrote his article in response to James Iry's quip. But it seems to have been removed, perhaps by some compulsive tidier.
Below is my original answer.
It's quite possible that Iry had read From Monoids to Monads, a post in which Dan Piponi (sigfpe) derives monads from monoids in Haskell, with much discussion of category theory and explicit mention of "the category of endofunctors on Hask" . In any case, anyone who wonders what it means for a monad to be a monoid in the category of endofunctors might benefit from reading this derivation.
First, the extensions and libraries that we're going to use:
Of these,
RankNTypes
is the only one that's absolutely essential to the below. I once wrote an explanation ofRankNTypes
that some people seem to have found useful, so I'll refer to that.Quoting Tom Crockett's excellent answer, we have:
How do we translate this to Haskell code? Well, let's start with the notion of a natural transformation:
A type of the form
f :-> g
is analogous to a function type, but instead of thinking of it as a function between two types (of kind*
), think of it as a morphism between two functors (each of kind* -> *
). Examples:Basically, in Haskell, natural transformations are functions from some type
f x
to another typeg x
such that thex
type variable is "inaccessible" to the caller. So for example,sort :: Ord a => [a] -> [a]
cannot be made into a natural transformation, because it's "picky" about which types we may instantiate fora
. One intuitive way I often use to think of this is the following:Now, with that out of the way, let's tackle the clauses of the definition.
The first clause is "an endofunctor, T : X -> X." Well, every
Functor
in Haskell is an endofunctor in what people call "the Hask category," whose objects are Haskell types (of kind*
) and whose morphisms are Haskell functions. This sounds like a complicated statement, but it's actually a very trivial one. All it means is that that aFunctor f :: * -> *
gives you the means of constructing a typef a :: *
for anya :: *
and a functionfmap f :: f a -> f b
out of anyf :: a -> b
, and that these obey the functor laws.Second clause: the
Identity
functor in Haskell (which comes with the Platform, so you can just import it) is defined this way:So the natural transformation η : I -> T from Tom Crockett's definition can be written this way for any
Monad
instancet
:Third clause: The composition of two functors in Haskell can be defined this way (which also comes with the Platform):
So the natural transformation μ : T × T -> T from Tom Crockett's definition can be written like this:
The statement that this is a monoid in the category of endofunctors then means that
Compose
(partially applied to just its first two parameters) is associative, and thatIdentity
is its identity element. I.e., that the following isomorphisms hold:Compose f (Compose g h) ~= Compose (Compose f g) h
Compose f Identity ~= f
Compose Identity g ~= g
These are very easy to prove because
Compose
andIdentity
are both defined asnewtype
, and the Haskell Reports define the semantics ofnewtype
as an isomorphism between the type being defined and the type of the argument to thenewtype
's data constructor. So for example, let's proveCompose f Identity ~= f
:I came to this post by way of better understanding the inference of the infamous quote from Mac Lane's Category Theory For the Working Mathematician.
In describing what something is, it's often equally useful to describe what it's not.
The fact that Mac Lane uses the description to describe a Monad, one might imply that it describes something unique to monads. Bear with me. To develop a broader understanding of the statement, I believe it needs to be made clear that he is not describing something that is unique to monads; the statement equally describes Applicative and Arrows among others. For the same reason we can have two monoids on Int (Sum and Product), we can have several monoids on X in the category of endofunctors. But there is even more to the similarities.
Both Monad and Applicative meet the criteria:
(e.g., in day to day
Tree a -> List b
, but in CategoryTree -> List
)Tree -> List
, onlyList -> List
.The statement uses "Category of..." This defines the scope of the statement. As an example, the Functor Category describes the scope of
f * -> g *
, i.e.,Any functor -> Any functor
, e.g.,Tree * -> List *
orTree * -> Tree *
.What a Categorical statement does not specify describes where anything and everything is permitted.
In this case, inside the functors,
* -> *
akaa -> b
is not specified which meansAnything -> Anything including Anything else
. As my imagination jumps to Int -> String, it also includesInteger -> Maybe Int
, or evenMaybe Double -> Either String Int
wherea :: Maybe Double; b :: Either String Int
.So the statement comes together as follows:
:: f a -> g b
(i.e., any parameterized type to any parameterized type):: f a -> f b
(i.e., any one parameterized type to the same parameterized type) ... said differently,So, where is the power of this construct? To appreciate the full dynamics, I needed to see that the typical drawings of a monoid (single object with what looks like an identity arrow,
:: single object -> single object
), fails to illustrate that I'm permitted to use an arrow parameterized with any number of monoid values, from the one type object permitted in Monoid. The endo, ~ identity arrow definition of equivalence ignores the functor's type value and both the type and value of the most inner, "payload" layer. Thus, equivalence returnstrue
in any situation where the functorial types match (e.g.,Nothing -> Just * -> Nothing
is equivalent toJust * -> Just * -> Just *
because they are bothMaybe -> Maybe -> Maybe
).Sidebar: ~ outside is conceptual, but is the left most symbol in
f a
. It also describes what "Haskell" reads-in first (big picture); so Type is "outside" in relation to a Type Value. The relationship between layers (a chain of references) in programming is not easy to relate in Category. The Category of Set is used to describe Types (Int, Strings, Maybe Int etc.) which includes the Category of Functor (parameterized Types). The reference chain: Functor Type, Functor values (elements of that Functor's set, e.g., Nothing, Just), and in turn, everything else each functor value points to. In Category the relationship is described differently, e.g.,return :: a -> m a
is considered a natural transformation from one Functor to another Functor, different from anything mentioned thus far.Back to the main thread, all in all, for any defined tensor product and a neutral value, the statement ends up describing an amazingly powerful computational construct born from its paradoxical structure:
:: List
); staticfold
that says nothing about the payload)In Haskell, clarifying the applicability of the statement is important. The power and versatility of this construct, has absolutely nothing to do with a monad per se. In other words, the construct does not rely on what makes a monad unique.
When trying to figure out whether to build code with a shared context to support computations that depend on each other, versus computations that can be run in parallel, this infamous statement, with as much as it describes, is not a contrast between the choice of Applicative, Arrows and Monads, but rather is a description of how much they are the same. For the decision at hand, the statement is moot.
This is often misunderstood. The statement goes on to describe
join :: m (m a) -> m a
as the tensor product for the monoidal endofunctor. However, it does not articulate how, in the context of this statement,(<*>)
could also have also been chosen. It truly is a an example of six/half dozen. The logic for combining values are exactly alike; same input generates the same output from each (unlike the Sum and Product monoids for Int because they generate different results when combining Ints).So, to recap: A monoid in the category of endofunctors describes:
(<*>)
and(>>=)
both provide simultaneous access to the twom
values in order to compute the the single return value. The logic used to compute the return value is exactly the same. If it were not for the different shapes of the functions they parameterize (f :: a -> b
versusk :: a -> m b
) and the position of the parameter with the same return type of the computation (i.e.,a -> b -> b
versusb -> a -> b
for each respectively), I suspect we could have parameterized the monoidal logic, the tensor product, for reuse in both definitions. As an exercise to make the point, try and implement~t
, and you end up with(<*>)
and(>>=)
depending on how you decide to define itforall a b
.If my last point is at minimum conceptually true, it then explains the precise, and only computational difference between Applicative and Monad: the functions they parameterize. In other words, the difference is external to the implementation of these type classes.
In conclusion, in my own experience, Mac Lane's infamous quote provided a great "goto" meme, a guidepost for me to reference while navigating my way through Category to better understand the idioms used in Haskell. It succeeds at capturing the scope of a powerful computing capacity made wonderfully accessible in Haskell.
However, there is irony in how I first misunderstood the statement's applicability outside of the monad, and what I hope conveyed here. Everything that it describes turns out to be what is similar between Applicative and Monads (and Arrows among others). What it doesn't say is precisely the small but useful distinction between them.
- E
That particular phrasing is by James Iry, from his highly entertaining Brief, Incomplete and Mostly Wrong History of Programming Languages, in which he fictionally attributes it to Philip Wadler.
The original quote is from Saunders Mac Lane in Categories for the Working Mathematician, one of the foundational texts of Category Theory. Here it is in context, which is probably the best place to learn exactly what it means.
But, I'll take a stab. The original sentence is this:
X here is a category. Endofunctors are functors from a category to itself (which is usually all
Functor
s as far as functional programmers are concerned, since they're mostly dealing with just one category; the category of types - but I digress). But you could imagine another category which is the category of "endofunctors on X". This is a category in which the objects are endofunctors and the morphisms are natural transformations.And of those endofunctors, some of them might be monads. Which ones are monads? Exactly the ones which are monoidal in a particular sense. Instead of spelling out the exact mapping from monads to monoids (since Mac Lane does that far better than I could hope to), I'll just put their respective definitions side by side and let you compare:
A monoid is...
...satisfying these laws:
A monad is...
* -> *
with aFunctor
instance)join
in Haskell)return
in Haskell)...satisfying these laws:
With a bit of squinting you might be able to see that both of these definitions are instances of the same abstract concept.
Intuitively, I think that what the fancy math vocabulary is saying is that:
Monoid
A monoid is a set of objects, and a method of combining them. Well known monoids are:
There are more complex examples also.
Further, every monoid has an identity, which is that "no-op" element that has no effect when you combine it with something else:
Finally, a monoid must be associative. (you can reduce a long string of combinations anyway you want, as long as you don't change the left-to-right-order of objects) Addition is OK ((5+3)+1 == 5+(3+1)), but subtraction isn't ((5-3)-1 != 5-(3-1)).
Monad
Now, let's consider a special kind of set and a special way of combining objects.
Objects
Suppose your set contains objects of a special kind: functions. And these functions have an interesting signature: They don't carry numbers to numbers or strings to strings. Instead, each function carries a number to a list of numbers in a two-step process.
Examples:
Combining Objects
Also, our way of combining functions is special. A simple way to combine function is composition: Let's take our examples above, and compose each function with itself:
Without getting too much into type theory, the point is that you can combine two integers to get an integer, but you can't always compose two functions and get a function of the same type. (Functions with type a -> a will compose, but a-> [a] won't.)
So, let's define a different way of combining functions. When we combine two of these functions, we don't want to "double-wrap" the results.
Here is what we do. When we want to combine two functions F and G, we follow this process (called binding):
Back to our examples, let's combine (bind) a function with itself using this new way of "binding" functions:
This more sophisticated way of combining functions is associative (following from how function composition is associative when you aren't doing the fancy wrapping stuff).
Tying it all together,
Notes
There are lots of ways to "wrap" results. You can make a list, or a set, or discard all but the first result while noting if there are no results, attach a sidecar of state, print a log message, etc, etc.
I've played a bit loose with the definitions in hopes of getting the essential idea across intuitively.
I've simplified things a bit by insisting that our monad operates on functions of type a -> [a]. In fact, monads work on functions of type a -> m b, but the generalization is kind of a technical detail that isn't the main insight.