Consider the Object-Oriented Languages:
Most people coming from an object-oriented programming background, are familiar with the common and intuitive interfaces in various languages that capture the essence of Java's Collection
& List
interfaces. Collection
refers to a collection of objects which doesn't necessarily have an natural ordering/indexing. A List
is a collection which has a natural ordering/indexing. These interfaces abstract many library data-structures in Java, as do their equivalent interfaces in other languages, and an intimate understanding of these interfaces are required to work effectively with most library data-structures.
Transition to Haskell:
Haskell has a type-class system which acts on types analogously to interfaces on objects. Haskell seems to have a well designed type-class hierarchy with regard to Functors, Applicative, Monads, etc. when the type regard functionality. They obviously want correct and well-abstracted type-classes. Yet when you look at many Haskell's containers (List
,Map
,Sequence
,Set
,Vector
) they almost all have very similar (or identical) functions, yet aren't abstracted through type-classes.
Some Examples:
null
for testing "emptyness"length
/size
for element countelem
/member
for set inclusionempty
and/orsingleton
for default constructionunion
for set union(\\)
/diff
for set difference(!)
/(!!)
for unsafe indexing (partial function)(!?)
/lookup
for safe indexing (total function)
If I want to use any of the functions above, but I have imported two or more containers I have to start hiding functions from the imported modules, or explicitly import only the necessary functions from the modules, or qualifying the imported modules. But since all the functions provide the same logical functionality, it just seems like a hassle. If the functions were defined from type-classes, and not separately in each module, the compiler's type inference mechanics could resolve this. It would also make switching underlying containers simple as long as they shared the type-classes (ie: lets just use a Sequence
instead of List
for better random access efficiency).
Why doesn't Haskell have a Collection
and/or Indexable
type-class(es) to unify & generalize some of these functions?
Such typeclasses exist in standard Haskell, but they don't have the same name as their equivalent OO counterparts. The
Collection
typeclass, for example, is calledFoldable
in Haskell. You can use it to test if a structure is empty (foldr (const False) True x
) or to count the number of elements (foldMap (const 1) x
), or to test for set membership (foldr (\e' present -> (e==e') || present) False x
for somee
).For operations like element lookup, you have the
Array
typeclass which might work for sequential data. For more flexibility, you can write your ownIndexable
class, like this for example (beware of lenses) :The null element and set union belong to the
Monoid
typeclass (wheremappend == union
). In this light, set difference could also be implemented in its own typeclassDifferentiable
(which I'm sure already exists in dozens of Haskell libraries) and we would have total compatibility with imperative languages.Haskell, by virtue of being designed by mathematicians and the like, doesn't employ the same vocabulary as most other languages, but rest assured, it doesn't mean that it's not a practical language in addition to being an awesome one :-)
Partly, the reason is that monads and arrows are new, innovative features of Haskell, while collections are relatively more mundane. Haskell has a long history as a research language; interesting research questions (designing monad instances & defining generic operations for monads) get more development effort than "industrial-strength" polishing (defining container APIs).
Partly, the reason is that those types come from three different packages (base, containers, and vector), with three separate histories and designers. That makes it harder for their designers to coordinate on providing instances of any single type class.
Partly, the reason is that defining a single type class to cover all five of the containers you mentioned is really hard. List, Sequence, and Vector are relatively similar, but Map and Set have completely different constraints. For List, Sequence, and Vector, you want a simple constructor class, but for Set that won't work, since Set requires an Ord instance on the element type. Even worse, Map can support most of your methods, but its singleton function needs two parameters where the rest need only one.
As other answers have pointed out, Haskell tends to use different vocabulary. However, I don't think they've explained the reason for the difference very well.
In a language like Java, functions are not "first class citizens"; it's true that anonymous functions are available in the latest versions, but this style of interface (Collection, Indexable, Interable, etc.) were designed before that.
This makes it tedious to pass our code around, so we prefer other people's data to be passed to our code. For example:
Iterable
lets us writefor (Foo x : anIterable) { ... }
ArrayAccess
lets us writeanArrayAccess[anIndex]
This style can also be seen in OO languages which implement generators, since that's another way for us to write
for yieldedElement in aGenerator: ...
.Haskell takes a different approach with its typeclasses: we prefer our code to be passed to other people's data. Some (simplified) examples:
Functor
s accept our code and apply it to any elements they 'contain'Monad
s accept our code and apply it in some kind of 'sequence'Foldable
s accept our code and use it to 'reduce' their contentsJava only needs
Iterable
since we have to call our code in ourfor
loop, so we can make sure it's called correctly. Haskell requires more specific typeclasses since someone else's code will be calling ours, so we need to specify how it should be called; is it amap
, afold
, anunfold
, etc.?Thankfully, the type system helps us choose the right method ;)
Haskell has some type classes for working with collections in the base package: Functor, Foldable and Traversable can be useful for working with collections, and the Monoid, Applicative and/or Alternative typeclasses can be useful for constructing collections.
Together, these classes cover most of the operations mentioned in the question, but maybe less efficient than a more container-specific function (though many of these are class methods, whose default definitions can be overriden if necessary).
Foldable supports
null
since base 4.8 (any (const True)
is an alternative for earlier versions).Foldable supports
length
since base 4.8 (getSum . foldMap (const 1)
is an alternative for earlier versions).Foldable supports
elem
,notElem
andmember
.For empty, there is
mempty
from Monoid andempty
from Alternative. For singleton, there ispure
from Applicative.There is
mappend
from Monoid and<|>
from Alternative. They don't necessarily implement set union, but they implement some form of union that works well together with empty and usually also with singleton and find.This one is not supported, unfortunately.
You could use
fromJust
together with a function for safe indexing.There is
find
from Foldable.You have typeclasses for different collection aspects:
composition: Monoid (module Data.Monoid)
sequential control: Applicative, Monad (modules Control.Applicative, Control.Monad)
sequential composition: Alternative, MonadPlus (modules Control.Applicative, Control.Monad)
non-sequential mapping and reduction: Functor (mod. Data.Functor), Foldable (mod. Data.Foldable)
sequential mapping and reduction: Traversable (module Data.Traversable)
serialisation: Binary (mod. Data.Binary)
comparison: Eq, Ord (mod. Data.Eq, Data.Ord)
textualisation: Show, Read
deep evaluation (to Normal Form): NFData (mod. Control.DeepSeq)
generic datatype traversability: Data (mod. Data.Data)
Except that monomorphic collections (ByteString, IntSet, Text) cannot implement Functor and Foldable (they require type arity == 1 (Kind: * -> *))
Also neither (Set a) implements Functor.
The package mono-traversable redefines some classes without the monomorphic types exclusion.
Update. There is an attempt to put most functions in typeclasses with the packages mono-traversable and classy-prelude.
library ref, platform
The
lens
package provides some of this.Testing for emptiness, creating empty containers These are both provided by the
AsEmpty
typeclass fromControl.Lens.Empty
.Accessing elements by key/index. The
At
andIxed
typeclasses fromControl.Lens.At
.Checking for membership in set-like containers. The
Contains
typeclass fromControl.Lens.At
.Appending and deleting elements to sequence-like containers. The
Cons
andSnoc
typeclasses fromControl.Lens.Cons
.Also, the
pure
method of theApplicative
typeclass can often be used to create "singleton" containers. For things that are not functors/applicatives in Haskell, likeSet
, perhapspoint
fromData.Pointed
could be used.