Replacing missing data with column Mean in Pandas
Pandas: Data Cleaning and Preprocessing Exercise-12 with Solution
Write a Pandas program to replacing missing data with mean value.
This exercise shows how to replace missing values in a numerical column with the mean of that column.
Sample Solution :
Code :
import pandas as pd
# Create a sample DataFrame with missing values
df = pd.DataFrame({
'Name': ['Selena', 'Annabel', 'Caeso', 'David'],
'Age': [25, None, 22, None]
})
# Fill missing 'Age' values with the column's mean
df['Age'].fillna(df['Age'].mean(), inplace=True)
# Output the result
print(df)
Output:
Name Age 0 Selena 25.0 1 Annabel 23.5 2 Caeso 22.0 3 David 23.5
Explanation:
- Created a DataFrame with missing values in the 'Age' column.
- Replaced missing values in 'Age' with the mean of the column using fillna().
- Returned the DataFrame with missing data replaced by the mean value.
Python-Pandas Code Editor:
Have another way to solve this solution? Contribute your code (and comments) through Disqus.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.
It will be nice if you may share this link in any developer community or anywhere else, from where other developers may find this content. Thanks.
https://w3resource.com/python-exercises/pandas/pandas-replace-missing-data-with-column-mean.php
- Weekly Trends and Language Statistics
- Weekly Trends and Language Statistics