Cross-Tabulation in Pandas: Analyzing DataFrame categories
Python Pandas Numpy: Exercise-37 with Solution
Perform a cross-tabulation between two columns in a DataFrame.
Sample Solution:
Python Code:
import pandas as pd
# Create a sample DataFrame
data = {'Category': ['A', 'B', 'A', 'B', 'C', 'A', 'C', 'C', 'B', 'A'],
'Value': [10, 15, 20, 25, 30, 35, 40, 45, 50, 55]}
df = pd.DataFrame(data)
# Perform a cross-tabulation between 'Category' and 'Value'
cross_tab = pd.crosstab(df['Category'], df['Value'])
# Display the cross-tabulation
print(cross_tab)
Output:
Value 10 15 20 25 30 35 40 45 50 55 Category A 1 0 1 0 0 1 0 0 0 1 B 0 1 0 1 0 0 0 0 1 0 C 0 0 0 0 1 0 1 1 0 0
Explanation:
Here's a breakdown of the above code:
- We create a sample DataFrame (df) with two columns: 'Category' and 'Value'.
- The pd.crosstab(df['Category'], df['Value']) line performs a cross-tabulation between these two columns.
- The resulting "cross_tab" DataFrame shows the frequency of each combination of 'Category' and 'Value'.
Flowchart:
Python Code Editor:
Previous: Rolling Calculation in Pandas DataFrame.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.
It will be nice if you may share this link in any developer community or anywhere else, from where other developers may find this content. Thanks.
https://w3resource.com/python-exercises/pandas_numpy/pandas_numpy-exercise-37.php
- Weekly Trends and Language Statistics
- Weekly Trends and Language Statistics