One of the most powerful patterns available in Scala is the enrich-my-library* pattern, which uses implicit conversions to appear to add methods to existing classes without requiring dynamic method resolution. For example, if we wished that all strings had the method spaces
that counted how many whitespace characters they had, we could:
class SpaceCounter(s: String) {
def spaces = s.count(_.isWhitespace)
}
implicit def string_counts_spaces(s: String) = new SpaceCounter(s)
scala> "How many spaces do I have?".spaces
res1: Int = 5
Unfortunately, this pattern runs into trouble when dealing with generic collections. For example, a number of questions have been asked about grouping items sequentially with collections. There is nothing built in that works in one shot, so this seems an ideal candidate for the enrich-my-library pattern using a generic collection C
and a generic element type A
:
class SequentiallyGroupingCollection[A, C[A] <: Seq[A]](ca: C[A]) {
def groupIdentical: C[C[A]] = {
if (ca.isEmpty) C.empty[C[A]]
else {
val first = ca.head
val (same,rest) = ca.span(_ == first)
same +: (new SequentiallyGroupingCollection(rest)).groupIdentical
}
}
}
except, of course, it doesn't work. The REPL tells us:
<console>:12: error: not found: value C
if (ca.isEmpty) C.empty[C[A]]
^
<console>:16: error: type mismatch;
found : Seq[Seq[A]]
required: C[C[A]]
same +: (new SequentiallyGroupingCollection(rest)).groupIdentical
^
There are two problems: how do we get a C[C[A]]
from an empty C[A]
list (or from thin air)? And how do we get a C[C[A]]
back from the same +:
line instead of a Seq[Seq[A]]
?
* Formerly known as pimp-my-library.
As of this commit the magic incantation is slightly changed from what it was when Miles gave his excellent answer.
The following works, but is it canonical? I hope one of the canons will correct it. (Or rather, cannons, one of the big guns.) If the view bound is an upper bound, you lose application to Array and String. It doesn't seem to matter if the bound is GenTraversableLike or TraversableLike; but IsTraversableLike gives you a GenTraversableLike.
There's more than one way to skin a cat with nine lives. This version says that once my source is converted to a GenTraversableLike, as long as I can build the result from GenTraversable, just do that. I'm not interested in my old Repr.
This first attempt includes an ugly conversion of Repr to GenTraversableLike.
The key to understanding this problem is to realize that there are two different ways to build and work with collections in the collections library. One is the public collections interface with all its nice methods. The other, which is used extensively in creating the collections library, but which are almost never used outside of it, is the builders.
Our problem in enriching is exactly the same one that the collections library itself faces when trying to return collections of the same type. That is, we want to build collections, but when working generically, we don't have a way to refer to "the same type that the collection already is". So we need builders.
Now the question is: where do we get our builders from? The obvious place is from the collection itself. This doesn't work. We already decided, in moving to a generic collection, that we were going to forget the type of the collection. So even though the collection could return a builder that would generate more collections of the type we want, it wouldn't know what the type was.
Instead, we get our builders from
CanBuildFrom
implicits that are floating around. These exist specifically for the purpose of matching input and output types and giving you an appropriately typed builder.So, we have two conceptual leaps to make:
CanBuildFrom
s, not from our collection directly.Let's look at an example.
Let's take this apart. First, in order to build the collection-of-collections, we know we'll need to build two types of collections:
C[A]
for each group, andC[C[A]]
that gathers all the groups together. Thus, we need two builders, one that takesA
s and buildsC[A]
s, and one that takesC[A]
s and buildsC[C[A]]
s. Looking at the type signature ofCanBuildFrom
, we seewhich means that CanBuildFrom wants to know the type of collection we're starting with--in our case, it's
C[A]
, and then the elements of the generated collection and the type of that collection. So we fill those in as implicit parameterscbfcc
andcbfc
.Having realized this, that's most of the work. We can use our
CanBuildFrom
s to give us builders (all you need to do is apply them). And one builder can build up a collection with+=
, convert it to the collection it is supposed to ultimately be withresult
, and empty itself and be ready to start again withclear
. The builders start off empty, which solves our first compile error, and since we're using builders instead of recursion, the second error also goes away.One last little detail--other than the algorithm that actually does the work--is in the implicit conversion. Note that we use
new GroupingCollection[A,C]
not[A,C[A]]
. This is because the class declaration was forC
with one parameter, which it fills it itself with theA
passed to it. So we just hand it the typeC
, and let it createC[A]
out of it. Minor detail, but you'll get compile-time errors if you try another way.Here, I've made the method a little bit more generic than the "equal elements" collection--rather, the method cuts the original collection apart whenever its test of sequential elements fails.
Let's see our method in action:
It works!
The only problem is that we don't in general have these methods available for arrays, since that would require two implicit conversions in a row. There are several ways to get around this, including writing a separate implicit conversion for arrays, casting to
WrappedArray
, and so on.Edit: My favored approach for dealing with arrays and strings and such is to make the code even more generic and then use appropriate implicit conversions to make them more specific again in such a way that arrays work also. In this particular case:
Here we've added an implicit that gives us an
Iterable[A]
fromC
--for most collections this will just be the identity (e.g.List[A]
already is anIterable[A]
), but for arrays it will be a real implicit conversion. And, consequently, we've dropped the requirement thatC[A] <: Iterable[A]
--we've basically just made the requirement for<%
explicit, so we can use it explicitly at will instead of having the compiler fill it in for us. Also, we have relaxed the restriction that our collection-of-collections isC[C[A]]
--instead, it's anyD[C]
, which we will fill in later to be what we want. Because we're going to fill this in later, we've pushed it up to the class level instead of the method level. Otherwise, it's basically the same.Now the question is how to use this. For regular collections, we can:
where now we plug in
C[A]
forC
andC[C[A]]
forD[C]
. Note that we do need the explicit generic types on the call tonew GroupingCollection
so it can keep straight which types correspond to what. Thanks to theimplicit c2i: C[A] => Iterable[A]
, this automatically handles arrays.But wait, what if we want to use strings? Now we're in trouble, because you can't have a "string of strings". This is where the extra abstraction helps: we can call
D
something that's suitable to hold strings. Let's pickVector
, and do the following:We need a new
CanBuildFrom
to handle the building of a vector of strings (but this is really easy, since we just need to callVector.newBuilder[String]
), and then we need to fill in all the types so that theGroupingCollection
is typed sensibly. Note that we already have floating around a[String,Char,String]
CanBuildFrom, so strings can be made from collections of chars.Let's try it out:
As of this commit it's a lot easier to "enrich" Scala collections than it was when Rex gave his excellent answer. For simple cases it might look like this,
which adds a "same result type" respecting
filterMap
operation to allGenTraversableLike
s,And for the example from the question, the solution now looks like,
Sample REPL session,
Again, note that the same result type principle has been observed in exactly the same way that it would have been had
groupIdentical
been directly defined onGenTraversableLike
.