w3resource

R Programming: Get the Statistical Summary of a data Frame

R Programming: Data frame Exercise-4 with Solution

Write a R program to get the statistical summary and nature of the data of a given data frame.

Sample Solution :

R Programming Code :

# Create a data frame with columns: name, score, attempts, and qualify
exam_data = data.frame(
  # Vector of names
  name = c('Anastasia', 'Dima', 'Katherine', 'James', 'Emily', 'Michael', 'Matthew', 'Laura', 'Kevin', 'Jonas'),
  # Vector of scores
  score = c(12.5, 9, 16.5, 12, 9, 20, 14.5, 13.5, 8, 19),
  # Vector of attempts
  attempts = c(1, 3, 2, 3, 2, 3, 1, 1, 2, 1),
  # Vector of qualify status
  qualify = c('yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes')
)

# Print a message indicating the output of the original data frame
print("Original dataframe:")

# Print the content of the data frame
print(exam_data)

# Print a message indicating that the statistical summary of the data frame will follow
print("Statistical summary and nature of the data of the said dataframe:")

# Print the statistical summary of the data frame
print(summary(exam_data))

Output:

[1] "Original dataframe:"
        name score attempts qualify
1  Anastasia  12.5        1     yes
2       Dima   9.0        3      no
3  Katherine  16.5        2     yes
4      James  12.0        3      no
5      Emily   9.0        2      no
6    Michael  20.0        3     yes
7    Matthew  14.5        1     yes
8      Laura  13.5        1      no
9      Kevin   8.0        2      no
10     Jonas  19.0        1     yes
[1] "Statistical summary and nature of the data of the said dataframe:"
     name               score          attempts      qualify         
 Length:10          Min.   : 8.00   Min.   :1.00   Length:10         
 Class :character   1st Qu.: 9.75   1st Qu.:1.00   Class :character  
 Mode  :character   Median :13.00   Median :2.00   Mode  :character  
                    Mean   :13.40   Mean   :1.90                     
                    3rd Qu.:16.00   3rd Qu.:2.75                     
                    Max.   :20.00   Max.   :3.00                                                      

Explanation:

  • Create Data Frame:
    • exam_data = data.frame(...): Creates a data frame named exam_data with four columns: name, score, attempts, and qualify.
  • Define Columns:
    • name = c(...): A vector of names of students.
    • score = c(...): A vector of scores corresponding to the students.
    • attempts = c(...): A vector of the number of attempts each student made.
    • qualify = c(...): A vector indicating whether each student qualified or not.
  • Print Original Data Frame:
    • print("Original dataframe:"): Prints a message indicating that the following output is the original data frame.
    • print(exam_data): Displays the content of the exam_data data frame.
  • Print Statistical Summary:
    • print("Statistical summary and nature of the data of the said dataframe:"): Prints a message indicating that the following output will show the statistical summary of the data frame.
    • print(summary(exam_data)): Displays a statistical summary of the data frame, including summary statistics for each column.

    R Programming Code Editor:



    Have another way to solve this solution? Contribute your code (and comments) through Disqus.

    Previous: Write a R program to get the structure of a given data frame.
    Next: Write a R program to extract specific column from a data frame using column name.

    Test your Programming skills with w3resource's quiz.

What is the difficulty level of this exercise?



Become a Patron!

Follow us on Facebook and Twitter for latest update.

It will be nice if you may share this link in any developer community or anywhere else, from where other developers may find this content. Thanks.

https://w3resource.com/r-programming-exercises/dataframe/r-programming-data-frame-exercise-4.php