w3resource

Pandas Practice Set-1: Get randomly sample rows from diamonds DataFrame


62. Randomly Sample Rows from Diamonds DataFrame

Write a Pandas program to get randomly sample rows from diamonds DataFrame.

Sample Solution:

Python Code:

import pandas as pd
diamonds = pd.read_csv('https://raw.githubusercontent.com/mwaskom/seaborn-data/master/diamonds.csv')
print("Original Dataframe:")
print(diamonds.head())
print("\nSample 5 rows from the DataFrame without replacement:")
print(diamonds.sample(n=3))

Sample Output:

Original Dataframe:
   carat      cut color clarity  depth  table  price     x     y     z
0   0.23    Ideal     E     SI2   61.5   55.0    326  3.95  3.98  2.43
1   0.21  Premium     E     SI1   59.8   61.0    326  3.89  3.84  2.31
2   0.23     Good     E     VS1   56.9   65.0    327  4.05  4.07  2.31
3   0.29  Premium     I     VS2   62.4   58.0    334  4.20  4.23  2.63
4   0.31     Good     J     SI2   63.3   58.0    335  4.34  4.35  2.75

Sample 5 rows from the DataFrame without replacement:
       carat        cut color clarity  ...   price     x     y     z
44856   0.50  Very Good     D     VS2  ...    1627  5.05  5.08  3.21
30088   0.32      Ideal     H     VS1  ...     720  4.42  4.41  2.73
48144   0.70  Very Good     J     VS2  ...    1940  5.62  5.59  3.54

[3 rows x 10 columns]

For more Practice: Solve these Related Problems:

  • Write a Pandas program to randomly select 10% of the rows from the diamonds DataFrame without replacement.
  • Write a Pandas program to randomly sample 100 rows from the diamonds DataFrame and display the result.
  • Write a Pandas program to extract a random sample of rows from the diamonds DataFrame and then compute summary statistics on the sample.
  • Write a Pandas program to randomly sample rows from the diamonds DataFrame using the sample() function with a fixed random state.

Go to:


Previous: Write a Pandas program to calculate the memory usage for each Series (in bytes) of diamonds DataFrame.
Next: Write a Pandas program to get sample 75% of the diamonds DataFrame's rows without replacement and store the remaining 25% of the rows in another DataFrame.

Python Code Editor:

Have another way to solve this solution? Contribute your code (and comments) through Disqus.

What is the difficulty level of this exercise?



Follow us on Facebook and Twitter for latest update.