r mutate_each function is deprecated

2019-07-02 12:54发布

问题:

I use the function mutate_each from the tidyverse package and I get a message this function is deprecated. I would like to use the other functions that are not deprecated to change field types.

Below is a reproducible example of how I currently employ mutate_each.

library(tidyverse)

set.seed(123)

df <- data.frame(FirstName = sample(LETTERS[1:2],size=3, replace=TRUE),
             LastName = sample(LETTERS[3:6],size=3, replace=TRUE),
             StartDate =  c("1/31/2000","2/1/2000","3/1/2000"),
             EndDate =   c("1/31/2010","2/10/2011","3/1/2016"),
             stringsAsFactors = FALSE)
str(df)

df %>% mutate_each(funs(as.factor(as.character(.))), 
               c(FirstName:LastName)) %>% 
   mutate_each(funs(as.Date(., format = "%m/%d/%Y",
                       origin = "1899-12-30")), 
          c(StartDate:EndDate))

`mutate_each()` is deprecated.
Use `mutate_all()`, `mutate_at()` or `mutate_if()` instead.
To map `funs` over a selection of variables, use `mutate_at()`...etc

I have played with mutate_all(), mutate_at() and mutate_if(), but no luck.

回答1:

Using the comments from @Chi Pak, the function mutate_at can be used to replace function mutate_each

library(tidyverse)

set.seed(123)

df <- data.frame(FirstName = sample(LETTERS[1:2],size=3, replace=TRUE),
         LastName = sample(LETTERS[3:6],size=3, replace=TRUE),
         StartDate =  c("1/31/2000","2/1/2000","3/1/2000"),
         EndDate =   c("1/31/2010","2/10/2011","3/1/2016"),
         stringsAsFactors = FALSE)

t1 <- df %>% mutate_each(funs(as.factor(as.character(.))), 
                     c(FirstName:LastName )) %>% 
  mutate_each(funs(as.Date(., format = "%m/%d/%Y",
                       origin = "1899-12-30")), 
          c(StartDate:EndDate))

t2 <- df %>% mutate_at(vars(FirstName:LastName),
                   funs(as.factor(as.character(.)))) %>% 
  mutate_at(vars(StartDate:EndDate),
        funs(as.Date(as.character(.),
                     format = "%m/%d/%Y", origin = "1899-12-30")))

identical(t1,t2)
[1] TRUE