Recode a variable using data.table package

2020-05-01 08:32发布

If I want to recode a variable in R using data.table, what is the syntax? I saw some ans but didn't find them appropriate.

e.g. if I have the variable called gender

I want to recode gender 0 to unknown, 1 to male, 2 to female: here is how I tried:

Name <- c("John", "Tina", "Dave", "Casper")
Gender <- c(1, 2, 2, 0)

trips <- cbind.data.frame(Name, Gender)
trips[, gender = ifelse(gender == 0, "Unkown", gender == 1, "Male", gender == 2, "Female" )]

but I get an error

标签: r data.table
3条回答
看我几分像从前
2楼-- · 2020-05-01 08:41

Put the rules in a table and do an update join:

gmap = data.table(old = 0:2, new = c("Unknown", "Male", "Female"))
trips[, Gender := gmap[copy(.SD), on=.(old = Gender), x.new]]

     Name  Gender
1:   John    Male
2:   Tina  Female
3:   Dave  Female
4: Casper Unknown
查看更多
▲ chillily
3楼-- · 2020-05-01 08:44

Once you have a data.table then it would be most efficient to use a vectorized translation strategy. The match function provides a method of creating a "selection vector" for a choosing a item from a set of character possibilities:

library(data.table)
setDT(trips)  # create a data.table from a dataframe

trips[ , Gender := c("Unknown", "male", "Female")[match(Gender, c(0,1,2))] ]
#-------------------
> trips
     Name  Gender
1:   John    male
2:   Tina  Female
3:   Dave  Female
4: Casper Unknown

For this specific case, a simpler solution could be (ht to @Chinsoon):

trips[, gender := c("Unknown", "Male", "Female")[gender + 1L] ]
查看更多
迷人小祖宗
4楼-- · 2020-05-01 08:51

You can do it this way

library(data.table)
trips <- data.table(Name=c('John','Tina','Dave','Casper'),gender=c(1,2,1,0))
trips[,gender:= ifelse(gender == 0 , "Unknown", 
                        ifelse(gender == 1 ,  "Male", 
                                              "Female" ))]

Two problems in your code:

  • You need to use := which is the assigning symbol for a column in data.table
  • You can only have one alternative with ifelse, so you need another ifelse for the third case: if gender is not 0 then you need to test if gender is 1 to separate the Male and Female cases
查看更多
登录 后发表回答