Approximate lookup in R

2020-02-28 06:08发布

I have the following lookup table:

lkp <- data.frame(
         x=c(0,0.2,0.65,0.658,1.3,1.76,2.7), 
         y=c(1,1,1,0.942,0.942, 0.92, 0.89)
       )

I would like to get the value of Y of a given X value.

If the X value exists in the table then the exact Y of the table should be returned. If the X value does not exist, then the Y value should be returned as linear interpolation of the 2 nearest neighbors (only the 2 nearest neighbors). I would not like to fit a model to the overall data.

for the above table

for X=0.2 Y=1 (exact lookup) 
for X=2 Y=0.91 (linear interpolation between the last 2 rows of the data frame)

Is there any ready function to do this?

标签: r lookup
2条回答
2楼-- · 2020-02-28 06:27

I do not think there is a ready function for this, but you could build one quite easily. E.g.:

A function for getting "neighbourgs" (the name is a bit misleading, but the function works):

get.neighbourgs <- function(vector, x) {
diff <- vector-x
if (any(diff==0)) {
    return(which(diff==0))
    } else {
        lower <- tail(which(diff<0), 1)
        return((lower-1):lower)
    }
}

It will return the exact "ID" if a value is found in vector, otherways it will return two "IDs" (two smaller values before that). It requires that your data is ordered! If not, you have to tweak it a bit. Examples of usage:

> get.neighbourgs(lkp$x,1.3)
[1] 5
> get.neighbourgs(lkp$x,2)
[1] 5 6

Using this, a simple function can be built to get the mean of requiredy values, like:

get.y <- function(df, x) {
    mean(df$y[get.neighbourgs(df$x, x)])
}

Examples:

> get.y(lkp, 1.2)
[1] 0.971
> get.y(lkp, 2)
[1] 0.931

Hope that helps.

查看更多
够拽才男人
3楼-- · 2020-02-28 06:42

Yes, it's called approx.

> with(lkp, approx(x, y, xout=c(0.2, 2)))
$x
[1] 0.2 2.0

$y
[1] 1.0000000 0.9123404

See ?approx for more information.

查看更多
登录 后发表回答