Iterate over cartesian product of vectors

2019-02-10 02:30发布

问题:

I have the following nested loop:

for (x in xs) {
    for (y in ys) {
        # Do something with x and y
    }
}

Which I’d like to flatten so I thought of building a Cartesian product of the two vectors xs and ys and iterating over the result. In Python, this would be trivial:

for xy in product(xs, ys):
    # x, y = xy[0], xy[1]

But in R, the simplest equivalent I’ve found looks daunting:

xys <- expand.grid(xs, ys)
for (i in 1 : nrow(xys)) {
    xy <- as.vector(xys[i, ])
    # x <- xy[1], y <- xy[2]
}

Surely there must be a better way, no? (To clarify, I don’t want to iterate over an index … I think there must be a way to directly iterate over the tuples in the product.)

回答1:

You can use the apply function to apply a function to each row of your data frame. Just replace "your function" with your actual function.

# example data
xs <- rnorm(10)
ys <- rnorm(10)    

apply(expand.grid(xs, ys), 1, FUN = function(x) {"your function"})

This is a very basic example. Here, the sum of both values in a row is calculated:

apply(expand.grid(xs, ys), 1, FUN = function(x) {x[1] + x[2]})

Here is a variant that uses named arguments (xs, ys) instead of indices (x[1], x[2]):

myfun <- function(xs, ys) xs + ys
arguments <- expand.grid(xs = rnorm(10), ys = rnorm(10))
apply(arguments, 1, function(x)do.call(myfun, as.list(x)))


回答2:

R has a different paradigm than Python, so don't expect it to have generators or tuples -- we have vectors and indices for that.

This way, to map a function on a Cartesian product simply call

outer(xs,ys,function(x,y) ...)

and undim the result if you wish.

EDIT: In case xs or ys are something more complex than base vectors, one option is to use indices, i.e.

outer(seq(a=xs),seq(a=ys),function(xi,yi) ... xs[[xi]]/ys[xi,]/etc. ...)

or map a function on a bit hand-crafted product using mapply

mapply(function(x,y) ...,xs,rep(ys,each=length(xs)))