Validating Email format in a column using Regex in Pandas
Pandas: Data Validation Exercise-13 with Solution
Write a Pandas program that validates the format of email addresses.
Following exercise validate the format of email addresses in a column using a regular expression.
Sample Solution :
Code :
import pandas as pd
# Create a sample DataFrame with email addresses
df = pd.DataFrame({
'Email': ['[email protected]', 'invalid-email', '[email protected]']
})
# Validate email format using a regular expression
valid_emails = df['Email'].str.contains(r'^[\w\.-]+@[\w\.-]+\.\w+$')
# Output the result
print(valid_emails)
Output:
0 True 1 False 2 True Name: Email, dtype: bool
Explanation:
- Created a DataFrame with email addresses.
- Used str.contains() with a regex pattern to validate the format of the email addresses.
- Outputted a Boolean Series indicating whether each email is valid.
Python-Pandas Code Editor:
Have another way to solve this solution? Contribute your code (and comments) through Disqus.
What is the difficulty level of this exercise?
Test your Programming skills with w3resource's quiz.
- Weekly Trends and Language Statistics
- Weekly Trends and Language Statistics