w3resource

Pandas Practice Set-1: Compute a cross-tabulation of two Series in diamonds DataFrame


35. Compute Cross-Tabulation of Two Series

Write a Pandas program to compute a cross-tabulation of two Series in diamonds DataFrame.

Sample Solution:

Python Code:

import pandas as pd
diamonds = pd.read_csv('https://raw.githubusercontent.com/mwaskom/seaborn-data/master/diamonds.csv')
print("Original Dataframe:")
print(diamonds.head())
print("\nCross-tabulation of two Series of diamonds DataFrame:")
print(pd.crosstab(diamonds.cut, diamonds.price))

Sample Output:

Original Dataframe:
   carat      cut color clarity  depth  table  price     x     y     z
0   0.23    Ideal     E     SI2   61.5   55.0    326  3.95  3.98  2.43
1   0.21  Premium     E     SI1   59.8   61.0    326  3.89  3.84  2.31
2   0.23     Good     E     VS1   56.9   65.0    327  4.05  4.07  2.31
3   0.29  Premium     I     VS2   62.4   58.0    334  4.20  4.23  2.63
4   0.31     Good     J     SI2   63.3   58.0    335  4.34  4.35  2.75

Cross-tabulation of two Series of diamonds DataFrame:
price      326    327    334    335    ...    18804  18806  18818  18823
cut                                    ...                              
Fair           0      0      0      0  ...        0      0      0      0
Good           0      1      0      1  ...        0      0      0      0
Ideal          1      0      0      0  ...        1      1      0      0
Premium        1      0      1      0  ...        0      0      0      1
Very Good      0      0      0      0  ...        0      0      1      0

[5 rows x 11602 columns]

For more Practice: Solve these Related Problems:

  • Write a Pandas program to compute a cross-tabulation between 'cut' and 'color' in the diamonds DataFrame.
  • Write a Pandas program to generate a crosstab of 'cut' vs. 'clarity' and display the frequency table.
  • Write a Pandas program to compute a cross-tabulation between two categorical columns and then normalize the results.
  • Write a Pandas program to create a pivot table (cross-tab) of 'cut' and 'color' with custom aggregation functions.

Go to:


Previous: Write a Pandas program to count the number of unique values in cut series of diamonds DataFrame.
Next: Write a Pandas program to calculate various summary statistics of cut series of diamonds DataFrame.

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

What is the difficulty level of this exercise?



Follow us on Facebook and Twitter for latest update.