I have a pandas dataframe that looks like this
ID country month revenue profit ebit
234 USA 201409 10 5 3
344 USA 201409 9 7 2
532 UK 201410 20 10 5
129 Canada 201411 15 10 5
I want to group by ID, country, month and count the IDs per month and country and sum the revenue, profit, ebit. The output for the above data would be:
country month revenue profit ebit count
USA 201409 19 12 5 2
UK 201409 20 10 5 1
Canada 201411 15 10 5 1
I have tried different variations of groupby, sum and count functions of pandas but I am unable to figure out how to apply groupby sum and count all together to give the result as shown. Please share any ideas that you might have. Thanks!
It can be done using
pivot_table
this way:You can do the groupby, and then map the counts of each country to a new column.
Edit
To get the counts per country and month, you can do another groupby, and then join the two DataFrames together.
I added another record for the UK with a different date - notice how there are now two UK entries in the merged DataFrame, with the appropriate counts.