Pandas: Split a dataset to group by two columns and then sort the aggregated results within the groups
9. Grouping by Two Columns and Sorting Aggregated Results
Write a Pandas program to split a dataset to group by two columns and then sort the aggregated results within the groups.
In the following dataset group on 'customer_id', 'salesman_id' and then sort sum of purch_amt within the groups
Test Data:
ord_no purch_amt ord_date customer_id salesman_id 0 70001 150.50 2012-10-05 3005 5002 1 70009 270.65 2012-09-10 3001 5005 2 70002 65.26 2012-10-05 3002 5001 3 70004 110.50 2012-08-17 3009 5003 4 70007 948.50 2012-09-10 3005 5002 5 70005 2400.60 2012-07-27 3007 5001 6 70008 5760.00 2012-09-10 3002 5001 7 70010 1983.43 2012-10-10 3004 5006 8 70003 2480.40 2012-10-10 3009 5003 9 70012 250.45 2012-06-27 3008 5002 10 70011 75.29 2012-08-17 3003 5007 11 70013 3045.60 2012-04-25 3002 5001
Sample Solution:
Python Code :
Sample Output:
Original Orders DataFrame: ord_no purch_amt ord_date customer_id salesman_id 0 70001 150.50 2012-10-05 3001 5002 1 70009 270.65 2012-09-10 3001 5005 2 70002 65.26 2012-10-05 3005 5001 3 70004 110.50 2012-08-17 3001 5003 4 70007 948.50 2012-09-10 3005 5002 5 70005 2400.60 2012-07-27 3001 5001 6 70008 5760.00 2012-09-10 3005 5001 7 70010 1983.43 2012-10-10 3001 5006 8 70003 2480.40 2012-10-10 3005 5003 9 70012 250.45 2012-06-27 3001 5002 10 70011 75.29 2012-08-17 3005 5007 11 70013 3045.60 2012-04-25 3005 5001 Group on 'customer_id', 'salesman_id' and then sort sum of purch_amt within the groups: customer_id salesman_id 3001 5001 2400.60 5006 1983.43 5002 400.95 5005 270.65 5003 110.50 3005 5001 8870.86 5003 2480.40 5002 948.50 5007 75.29 Name: purch_amt, dtype: float64
For more Practice: Solve these Related Problems:
- Write a Pandas program to group a sales dataset by customer_id and salesman_id, aggregate the purchase amounts, and then sort the groups by the aggregated sum.
- Write a Pandas program to group by two columns, compute the total purchase amount, and then sort the results in descending order.
- Write a Pandas program to group the dataframe by customer_id and salesman_id and then apply a custom sort to the aggregated values.
- Write a Pandas program to group the dataset by two keys, aggregate using sum, and then sort the output by the summed purchase amount.
Go to:
Previous: Write a Pandas program to split a dataset to group by two columns and count by each row.
Next: Write a Pandas program to split the following dataframe into groups based on customer id and create a list of order date for each group.
Python Code Editor:
Have another way to solve this solution? Contribute your code (and comments) through Disqus.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.