I have a data set with 5 columns:
store_id year event item units
123 2015 sale_2 abc 2
234 2015 sale_3 def 1
345 2015 sale_2 xyz 5
I'm trying to rotate out the items
by store_id, year, and event
to get the sum
. For instance
store_id year event abc def xyz
123 2015 sale_2 7 0 0
234 2015 sale_2 2 1 0
I'm having trouble figuring out the best method. Normally I'd use dummyVars in caret to do this but I need sums instead of flag. I've looked at tapply but it can't handle more than 2 grouping variables.
Any other suggestions?
You can use dplyr to group and summarize and tidyr to spread the data into the desired item columns.
For large datasets consider