## What is the five number summary?

The five number summary is a set of statistics that provides a summary of the distribution and spread of a dataset. It is commonly used in statistical analysis to understand the characteristics of a dataset and to identify any potential outliers or anomalies.

## What does the five number summary include?

The five number summary includes five key statistics:

**Minimum value**: The smallest value in the dataset.**First quartile (Q1)**: The value that separates the lowest 25% of the data from the highest 75%.**Median**: The value that separates the lowest 50% of the data from the highest 50%.**Third quartile (Q3)**: The value that separates the lowest 75% of the data from the highest 25%.**Maximum value**: The largest value in the dataset.

## How is the five number summary calculated?

To calculate the five number summary for a dataset, follow these steps:

- Order the data from smallest to largest.
- Calculate the minimum value by taking the smallest value in the dataset.
- Calculate the first quartile (Q1) by finding the value that separates the lowest 25% of the data from the highest 75%. This is also known as the lower quartile.
- Calculate the median by finding the value that separates the lowest 50% of the data from the highest 50%. This is also known as the second quartile.
- Calculate the third quartile (Q3) by finding the value that separates the lowest 75% of the data from the highest 25%. This is also known as the upper quartile.
- Calculate the maximum value by taking the largest value in the dataset.

## What can the five number summary tell us about a dataset?

The five number summary can provide valuable insights into the characteristics of a dataset. Some key points to consider include:

**Spread**: The difference between the minimum and maximum values, and the distance between the first and third quartiles, can give us an idea of the spread or dispersion of the data. A larger spread indicates that the data is more dispersed, while a smaller spread suggests that the data is more concentrated.**Skewness**: The relationship between the median and the first and third quartiles can give us an indication of the skewness of the data. If the median is closer to Q1, the data may be skewed to the left (negatively skewed), while if the median is closer to Q3, the data may be skewed to the right (positively skewed).**Outliers**: Values that are significantly larger or smaller than the other values in the dataset may be considered outliers. Outliers can be identified by comparing the minimum and maximum values to the first and third quartiles. If a value is significantly outside of this range, it may be considered an outlier.

## How can the five number summary be visualized?

The five number summary can be visualized using a variety of graphical tools, including:

The five number summary consists of the minimum value, first quartile (Q1), median, third quartile (Q3), and maximum value of a dataset. It can be visualized using a box plot, also known as a box and whisker plot.

To construct a box plot, you can follow these steps:

Start by drawing a horizontal line, called the “axis”, and marking the values of the five number summary along it.

Draw a box around the first quartile (Q1) and the third quartile (Q3). The length of the box is the interquartile range (IQR), which is the difference between Q3 and Q1.

Draw a vertical line inside the box at the median.

Draw “whiskers” extending from the box to the minimum and maximum values.

This creates a visual representation of the spread and skewness of the data, as well as the presence of any outliers (values that fall outside the whiskers).

