It's pretty easy to build a nice huge scatterplot matrix with histograms down the diagonal for multivariate data as follows:
scatterplotMatrix(somedata[1:points.count,],groups=somedata[1:points.count,class],
by.groups=TRUE,diagonal="histogram")
According to the documentation though, it doesn't seem possible to divide up the histogram by the group labels as is done in this question. How would you do that using scatterplotMatrix or a similar function?
For later reference, the GGally way to do it is as follows:
This intrepid asker figured out that you have to enable axisLabels which is somewhat silly, given the aesthetic emphasis of ggplot and friends.
Now I want to know how to parallelize this, because it's a monster with high numbers of variables.
Is this what you had in mind?
Using the iris dataset:
I keep hearing that the same thing is possible using
ggpairs(...)
in package GGally. I would love to see an actual example of it. The documentation is inscrutable. Also,ggpairs(...)
is extremely slow (in my hands), especially with large datasets.