Cumulative sum of a column based on the values of

2020-04-11 18:41发布

I have a data frame with 2 columns like this:

> data.frame(x=1:10, y=c(0,0,0,1,1,0,0,1,0,1))
    x y
1   1 0
2   2 0
3   3 0
4   4 1
5   5 1
6   6 0
7   7 0
8   8 1
9   9 0
10 10 1

and I want to get the cumulative sum of column x (cumsum(df$x)), but the sum should be reset after a 1 appears in column y. This is the result I am looking for:

How can I achieve this in R?

标签： r

2条回答

家丑人穷心不美

2楼-- · 2020-04-11 19:12

A data.table method using shift

 library(data.table) #devel version `data.table_1.9.5` 
 setDT(d)[, cumsum(x), by = cumsum(shift(y, fill=0))]$V1
 #[1]  1  3  6 10  5  6 13 21  9 19

0人赞添加讨论(0) 举报

趁早两清

3楼-- · 2020-04-11 19:25

You can achieve that by using ave:

ave(d$x,c(0,cumsum(d$y[-nrow(d)])),FUN=cumsum)

#  [1]  1  3  6 10  5  6 13 21  9 19

0人赞添加讨论(0) 举报

Cumulative sum of a column based on the values of

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间