w3resource

Pandas: Extract only words from a given column of a given DataFrame


37. Extract Only Words from Column

Write a Pandas program to extract only words from a given column of a given DataFrame.

Sample Solution:

Python Code :

import pandas as pd
import re as re
df = pd.DataFrame({
    'company_code': ['Abcd','EFGF', 'zefsalf', 'sdfslew', 'zekfsdf'],
    'date_of_sale': ['12/05/2002','16/02/1999','05/09/1998','12/02/2022','15/09/1997'],
    'address': ['9910 Surrey Ave.','92 N. Bishop Ave.','9910 Golden Star Ave.', '102 Dunbar St.', '17 West Livingston Court']
})
print("Original DataFrame:")
print(df)

def search_words(text):
    result = re.findall(r'\b[^\d\W]+\b', text)
    return " ".join(result)

df['only_words']=df['address'].apply(lambda x : search_words(x))
print("\nOnly words:")
print(df)

Sample Output:

Original DataFrame:
  company_code date_of_sale                   address
0         Abcd   12/05/2002          9910 Surrey Ave.
1         EFGF   16/02/1999         92 N. Bishop Ave.
2      zefsalf   05/09/1998     9910 Golden Star Ave.
3      sdfslew   12/02/2022            102 Dunbar St.
4      zekfsdf   15/09/1997  17 West Livingston Court

Only words:
  company_code          ...                       only_words
0         Abcd          ...                       Surrey Ave
1         EFGF          ...                     N Bishop Ave
2      zefsalf          ...                  Golden Star Ave
3      sdfslew          ...                        Dunbar St
4      zekfsdf          ...            West Livingston Court

[5 rows x 4 columns]

For more Practice: Solve these Related Problems:

  • Write a Pandas program to extract only alphabetic words from a DataFrame column using regex and then output them as a list.
  • Write a Pandas program to filter a text column for words only and then create a new column with the cleaned text.
  • Write a Pandas program to remove any numeric or special characters from a column and then display only the alphabetic words.
  • Write a Pandas program to apply a regex pattern to extract words from a column and then join them back into a cleaned sentence.

Go to:


Previous: Write a Pandas program to extract date (format: mm-dd-yyyy) from a given column of a given DataFrame.
Next: Write a Pandas program to extract the sentences where a specific word is present in a given column of a given DataFrame.

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

What is the difficulty level of this exercise?

Test your Programming skills with w3resource's quiz.



Follow us on Facebook and Twitter for latest update.