Merging DataFrames with duplicate Keys in Pandas
Pandas: Custom Function Exercise-9 with Solution
Write a Pandas program to merge DataFrames with duplicate Keys.
Following program shows how to merge DataFrames that contain duplicate keys, resulting in a Cartesian product of matching rows.
Sample Solution :
Code :
import pandas as pd
# Create two sample DataFrames with duplicate keys
df1 = pd.DataFrame({
'ID': [1, 1, 2],
'Name': ['Annabel', 'Annabel', 'Selena']
})
df2 = pd.DataFrame({
'ID': [1, 2],
'Age': [25, 30]
})
# Merge the DataFrames with duplicate keys
merged_df = pd.merge(df1, df2, on='ID')
# Output the result
print(merged_df)
Output:
ID Name Age 0 1 Annabel 25 1 1 Annabel 25 2 2 Selena 30
Explanation:
- Created two DataFrames df1 and df2 with duplicate keys in df1.
- Used pd.merge() to merge on the 'ID' column.
- The result is a Cartesian product where matching keys from both DataFrames are combined.
Python-Pandas Code Editor:
Have another way to solve this solution? Contribute your code (and comments) through Disqus.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.
- Weekly Trends and Language Statistics
- Weekly Trends and Language Statistics