I have a data frame, donorInfo
, with donor information:
id giftdate giftamt
002 2001-01-05 25.00
033 2001-05-08 50.00
054 2001-09-22 125.00
125 2001-11-05 40.00
042 2001-12-04 75.00
... ... ...
I'd like to create a column that shows the cumulative number of unique donor id's up to that date. I think it's something like:
donorInfo$numUnique <- apply/lapply (donorInfo, 1, FUN=nrow(unique(donorInfo$id)))
unfortunately this isn't working and I'm wondering how to remedy things. Thanks for any suggestions.
You can do this with
duplicated()
andcumsum()
(taking advantage of the fact that Boolean-valued logical vectors can be coerced to numeric vectors):try something like this:
Not the most efficient solution no doubt, but it should work.