w3resource

R Programming: Get the Statistical Summary of a data Frame


Write a R program to get the statistical summary and nature of the data of a given data frame.

Sample Solution :

R Programming Code :

# Create a data frame with columns: name, score, attempts, and qualify
exam_data = data.frame(
  # Vector of names
  name = c('Anastasia', 'Dima', 'Katherine', 'James', 'Emily', 'Michael', 'Matthew', 'Laura', 'Kevin', 'Jonas'),
  # Vector of scores
  score = c(12.5, 9, 16.5, 12, 9, 20, 14.5, 13.5, 8, 19),
  # Vector of attempts
  attempts = c(1, 3, 2, 3, 2, 3, 1, 1, 2, 1),
  # Vector of qualify status
  qualify = c('yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes')
)

# Print a message indicating the output of the original data frame
print("Original dataframe:")

# Print the content of the data frame
print(exam_data)

# Print a message indicating that the statistical summary of the data frame will follow
print("Statistical summary and nature of the data of the said dataframe:")

# Print the statistical summary of the data frame
print(summary(exam_data))

Output:

[1] "Original dataframe:"
        name score attempts qualify
1  Anastasia  12.5        1     yes
2       Dima   9.0        3      no
3  Katherine  16.5        2     yes
4      James  12.0        3      no
5      Emily   9.0        2      no
6    Michael  20.0        3     yes
7    Matthew  14.5        1     yes
8      Laura  13.5        1      no
9      Kevin   8.0        2      no
10     Jonas  19.0        1     yes
[1] "Statistical summary and nature of the data of the said dataframe:"
     name               score          attempts      qualify         
 Length:10          Min.   : 8.00   Min.   :1.00   Length:10         
 Class :character   1st Qu.: 9.75   1st Qu.:1.00   Class :character  
 Mode  :character   Median :13.00   Median :2.00   Mode  :character  
                    Mean   :13.40   Mean   :1.90                     
                    3rd Qu.:16.00   3rd Qu.:2.75                     
                    Max.   :20.00   Max.   :3.00                                                      

Explanation:

  • Create Data Frame:
    • exam_data = data.frame(...): Creates a data frame named exam_data with four columns: name, score, attempts, and qualify.
  • Define Columns:
    • name = c(...): A vector of names of students.
    • score = c(...): A vector of scores corresponding to the students.
    • attempts = c(...): A vector of the number of attempts each student made.
    • qualify = c(...): A vector indicating whether each student qualified or not.
  • Print Original Data Frame:
    • print("Original dataframe:"): Prints a message indicating that the following output is the original data frame.
    • print(exam_data): Displays the content of the exam_data data frame.
  • Print Statistical Summary:
    • print("Statistical summary and nature of the data of the said dataframe:"): Prints a message indicating that the following output will show the statistical summary of the data frame.
    • print(summary(exam_data)): Displays a statistical summary of the data frame, including summary statistics for each column.

    R Programming Code Editor:



    Have another way to solve this solution? Contribute your code (and comments) through Disqus.

    Previous: Write a R program to get the structure of a given data frame.
    Next: Write a R program to extract specific column from a data frame using column name.

    Test your Programming skills with w3resource's quiz.

What is the difficulty level of this exercise?



Follow us on Facebook and Twitter for latest update.