Replace definition of built-in function in R?

2019-06-21 18:19发布

The 'sparcl' package uses the 'kmeans' function in the standard 'stat' package. I want to make it use my own implementation of kmeans++ instead.

One way to do this would be to edit the code in the sparcl package itself. I'd prefer to avoid this, both because it would be messy and because I'm not sure how I would then install that edited code in R.

Unfortunately, the superassignment operator "<<-" doesn't work:

> kmeans <<- function(x) print("hi!")
Error: cannot change value of locked binding for 'kmeans'

neither does "assign":

assign("kmeans",function(x) {print("HI THERE!"); return(FALSE)},pos="package:sparcl")
Error in assign("is.null", function(x) { : 
  cannot add bindings to a locked environment

So is editing the package code the only way?

Thanks!

2条回答
姐就是有狂的资本
2楼-- · 2019-06-21 18:57

If you do want to edit a function's body (but not its arguments) during an interactive session, you can use trace(), like this:

trace("kmeans", edit=TRUE)

Then, in the editor that pops up, edit the body, so that it looks like this (for example):

function (x, centers, iter.max = 10, nstart = 1, algorithm = c("Hartigan-Wong", 
"Lloyd", "Forgy", "MacQueen")) 
{
    plot(rnorm(99), col = "red")
}

Save the edited function definition and then exit the editor.

Back at the R command line, you can view the edited function and try it out:

body(kmeans)  # To view the tracing code
kmeans()      # To use the edited function

Finally, to revert to the unedited function, just do untrace("kmeans"). (I generally prefer using trace() to assignInNamespace() and friends because untrace() makes it so easy to undo changes.)

查看更多
爷的心禁止访问
3楼-- · 2019-06-21 19:02

On further thought (and after re-reading your question), here's a simple solution that should work for you.

All you need to do is to assign your edited version of kmeans() to the symbol kmeans in the global environment. In other words, at the command line do this:

kmeans <- function(...) plot(rnorm(99), col="red") # but using your own edits

## Then run an example from ?KMeansSparseCluster to see that it works.
library(sparcl)
x <- matrix(rnorm(50*300),ncol=300)
x[1:25,1:50] <- x[1:25,1:50]+1
x <- scale(x, TRUE, TRUE)
KMeansSparseCluster.permute(x,K=2,wbounds=seq(3,9,len=15),nperms=5)

This works because KMeansSparseCluster() (and calls to any other functions in package:sparcl) look for kmeans first in namespace:sparcl, then in imports:sparcl, then in namespace:base, and then in .GlobalEnv, where it'll find your redefined kmeans before it gets to the one in package:stats. To have a look yourself, try this:

parent.env(asNamespace("sparcl"))
parent.env(parent.env(asNamespace("sparcl")))
parent.env(parent.env(parent.env(asNamespace("sparcl"))))
## etc., also wrapping any of the environments above in calls to ls() 
## to see what's in 'em

Nicely, functions from the stats package that use kmeans() won't be disrupted by your version, because they will find kmeans in their own namespace, before the symbol-search ever gets to the global environment.

查看更多
登录 后发表回答