from wide format to long format with results in mu

2020-04-11 11:59发布

id name1 adress1 name2 adress2 name3 adress3 1 1 John street a Burt street d chris street 1 2 2 Jack street b Ben street e connor street 2 3 3 Joey <NA> Bob street f <NA> <NA>

id origin names adresses 1 1 1 John street a 2 2 1 Jack street b 3 3 1 Joey <NA> 4 1 2 Burt street d 5 2 2 Ben street e 6 3 2 Bob street f 7 1 3 chris street 1 8 2 3 connor street 2

#code to generete the dataframes: df <- data.frame(id = c(1,2,3), name1 = c("John", "Jack", "Joey"), adress1 = c("street a", "street b", NA), name2 = c("Burt", "Ben", "Bob"), adress2 = c("street d", "street e", "street f"), name3 = c("chris", "connor", NA), adress3 = c("street 1", "street 2", NA), stringsAsFactors = FALSE) expecteddf <- data.frame(id = c(1,2,3,1,2,3,1,2), origin = c(rep(1, 3), rep(2, 3), rep(3, 2)), names = c("John", "Jack", "Joey", "Burt", "Ben", "Bob", "chris", "connor"), adresses = c("street a", "street b", NA, "street d", "street e", "street f", "street 1", "street 2"), stringsAsFactors = FALSE )

We could use melt from the devel version of data.table which can take multiple patterns for the measure columns. Instructions to install the devel version of 'data.table' is here

We convert the 'data.frame' to 'data.table' (setDT(df)), melt, and specify the regex in the patterns of measure argument. Remove the rows that are NA for the 'names' and 'address' column.

library(data.table)#v1.9.5+
dM <- melt(setDT(df), measure=patterns(c('^name', '^adress')),
          value.name=c('names', 'address') )
dM[!(is.na(names) & is.na(address))]
# id variable  names  address
#1:  1        1   John street a
#2:  2        1   Jack street b
#3:  3        1   Joey       NA
#4:  1        2   Burt street d
#5:  2        2    Ben street e
#6:  3        2    Bob street f
#7:  1        3  chris street 1
#8:  2        3 connor street 2

Or we can use reshape from base R.

 dM2 <- reshape(df, idvar='id', varying=list(grep('name', names(df)), 
             grep('adress', names(df))), direction='long')

The NA rows can be removed as in the data.table solution by using standard 'data.frame' indexing after we create the logical index with is.na.

from wide format to long format with results in mu

问题:

回答1:

收藏的人(0)

from wide format to long format with results in mu

问题:

回答1:

收藏的人(0)

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮