Exploring Statistical Displays: Dot Plots, Histograms, and Box Plots

Situation $1$	A tech company recently launched a new smartphone model. They conduct a survey where customers rate their satisfaction with the phone on a scale of $1$ to $10 .$
Situation $2$	A survey is conducted to gather data on the distribution of ages among participants in a community event.
Situation $3$	A teacher wants to analyze the distribution of test scores in her class by finding the median and quartiles of the scores.
Situation $4$	A study tracks the temperature variations over the course of a week in a particular city.

Situation

1

A tech company recently launched a new smartphone model. They conduct a survey where customers rate their satisfaction with the phone on a scale of

1

10 .

Situation

2

A survey is conducted to gather data on the distribution of ages among participants in a community event.

Situation

3

A teacher wants to analyze the distribution of test scores in her class by finding the median and quartiles of the scores.

Situation

4

A study tracks the temperature variations over the course of a week in a particular city.

Step $1$	Determine the center (median) by finding the middle data point.
Step $2$	Find the maximum and minimum values on the graph. Use these values to calculate the spread (range) of the data.
Step $3$	Analyze the overall shape of the graph. Note any other interest features it may have.

Step

1

Determine the center (median) by finding the middle data point.

Step

2

Find the maximum and minimum values on the graph. Use these values to calculate the spread (range) of the data.

Step

3

Analyze the overall shape of the graph. Note any other interest features it may have.

Step $1$	Title the plot based on the problem. Draw a number line to begin the dot plot, being sure to use values that are appropriate for the data set.
Step $2$	Determine the frequency of each value.
Step $3$	Place dots over each number on the number line that corresponds to the frequency for each value in the data set.

Step

1

Title the plot based on the problem. Draw a number line to begin the dot plot, being sure to use values that are appropriate for the data set.

Step

2

Determine the frequency of each value.

Step

3

Place dots over each number on the number line that corresponds to the frequency for each value in the data set.

Grade	Frequency
$1$	$1$
$2$	$0$
$3$	$1$
$4$	$2$
$5$	$0$
$6$	$3$
$7$	$3$
$8$	$2$
$9$	$1$

Grade

Frequency

1

1

2

0

3

1

4

2

5

0

6

3

7

3

8

2

9

1

Interval	Data Points	Frequency
$1 - 10$	$4,$ $8$	$2$
$11 - 20$	$11,$ $11,$ $13,$ $15,$ $17,$ $19$	$6$
$21 - 30$	$21,$ $25,$ $26$	$3$
$31 - 40$	$37$	$1$

Interval

Data Points

Frequency

1 - 10

4,

8

2

11 - 20

11,

11,

13,

15,

17,

19

6

21 - 30

21,

25,

26

3

31 - 40

37

1

Step $1$	Identify the independent and dependent variables.
Step $2$	List the frequency in each bar.
Step $3$	Interpret the data and describe the bar graph's shape. Use the interpretation to answer any questions about the data.

Step

1

Identify the independent and dependent variables.

Step

2

List the frequency in each bar.

Step

3

Interpret the data and describe the bar graph's shape. Use the interpretation to answer any questions about the data.

Day	Frequency
Monday	$64$
Tuesday	$70$
Wednesday	$62$
Thursday	$137$
Friday	$295$
Saturday	$342$
Sunday	$260$

Day

Frequency

Monday

64

Tuesday

70

Wednesday

62

Thursday

137

Friday

295

Saturday

342

Sunday

260

Step $1$	Choose the number of intervals.
Step $2$	Determine the size of the intervals.
Step $3$	Make a frequency table.
Step $4$	Draw the histogram.

Step

1

Choose the number of intervals.

Step

2

Determine the size of the intervals.

Step

3

Make a frequency table.

Step

4

Draw the histogram.

Interval	Data Points	Frequency
$1 - 15$	$7,$ $9,$ $11,$ $13,$ $15$	$5$
$16 - 30$	$17,$ $18,$ $19,$ $20,$ $21,$ $22,$ $23,$ $24,$ $24,$ $24,$ $25,$ $26,$ $27,$ $28,$ $30$	$15$
$31 - 45$	$32,$ $35,$ $37,$ $39,$ $42,$ $45$	$6$
$46 - 60$	$49,$ $55,$ $60$	$3$
$61 - 75$	$70$	$1$

Interval

Data Points

Frequency

1 - 15

7,

9,

11,

13,

15

5

16 - 30

17,

18,

19,

20,

21,

22,

23,

24,

24,

24,

25,

26,

27,

28,

30

15

31 - 45

32,

35,

37,

39,

42,

45

6

46 - 60

49,

55,

60

3

61 - 75

70

1

a To write the five-number summary for the box plot, its minimum, maximum, median, first and third quartiles need to be determined.

Minimum = ? Maximum = ? Median = ? First Quartile = ? Third Quartile = ?

Recall what each part of a box plot represents.

Now consider the given box plot again.

By comparing the general box plot to this one, the minimum, maximum, median, first and third quartiles can be easily identified.

Minimum Maximum Median First Quartile Third Quartile = 24 = 98 = 63.5 = 39 = 83

The values alone are not very helpful, so to contextualize the values, consider what each of them means in the given context.

Concept	Value	Meaning
Minimum	$24$	The lowest level of pollution in the analyzed areas earned $24$ out of $100$ points.
Maximum	$98$	The highest level of pollution in the analyzed areas earned $98$ out of $100$ points.
Median	$63.5$	The average pollution score in the analyzed areas is $63.5$ out of $100 .$
First Quartile	$39$	A quarter of the analyzed areas have a pollution score of $39$ or lower.
Third Quartile	$83$	A quarter of the analyzed areas have a pollution score of $83$ or higher.

b A box plot can be constructed by following four steps.

Step $1$	Order the data set from least to greatest. Identify the minimum and maximum values.
Step $2$	Determine the median.
Step $3$	Determine the first and third quartiles.
Step $4$	Draw the box plot.

Complete each step one at a time.

Step $1$

Start by ordering the given data values from least to greatest.

7, 9, 12, 14, 18, 22, 25, 26, 32, 36, 38, 42, 45, 48, 52, 55, 60, 63, 69, 71

Now the minimum and maximum are easily identifiable in this ordered data set. Here, the minimum is

7

and the maximum is

71 .

Step $2$

To find the median of the data set, count how many values there are in the data set.

71, 92, 123, 144, 185, 226, 257, 268, 329, 3610, 3811, 4212, 4513, 4814, 5215, 5516, 6017, 6318, 6919, 7120

Now look for the value that lies in the middle of a data set. Since there are

20

values, the median is the mean of the numbers in the

10 th

and

11 th

positions.

71, 92, 123, 144, 185, 226, 257, 268, 329, 3610, 3811, 4212, 4513, 4814, 5215, 5516, 6017, 6318, 6919, 7120 ⇓ Median = \frac{3 6 + 3 8}{2} = 37

The median is

37 .

Step $3$

The next step is to find the first and third quartiles of the data set. The median divides the set into two smaller sets, each with

10

values.

Set 1 71, 92, 123, 144, 185, 226, 257, 268, 329, 3610 Median : 37 381, 422, 453, 484, 525, 556, 607, 638, 699, 7110 Set 2

The first quartile is the middle of the first set. For this set, this means finding the mean of the

5 th

and

6 th

values.

Set 1 71, 92, 123, 144, 185, 226, 257, 268, 329, 3610 ⇓ Q_{1} = \frac{1 8 + 2 2}{2} = 20

The third quartile is the median of the second set. For this half of the data set, the median is the mean of the

5 th

and

6 th

values.

381, 422, 453, 484, 525, 556, 607, 638, 699, 7110 Set 2 ⇓ Q_{3} = \frac{5 2 + 5 5}{2} = 53.5

Step $4$

To draw a box plot, organize the five-number summary of the data set.

Minimum Maximum Median Q_{1} Q_{3} = 7 = 71 = 37 = 20 = 53.5

Mark the minimum and maximum values above a number line with two vertical segments, indicating the range of the box plot.

Next, mark the median with a vertical line segment inside the range above the number line. Remember that the line for the median falls inside the box.

The first and third quartiles are marked as the left and right sides of the box plot. The box plot can be completed by drawing a box between the quartiles and two horizontal segments between the left and right sides of the box and the minimum and maximum values.

The box plot is complete.

Concept	Definition
Cluster	Data values that are grouped closely together
Gap	Numbers that have no data values
Peak	The most frequently occurring values, or the mode
Symmetry	How the left side of the distribution looks compared to the right side
Outlier	A data value that does not seem to fit with the rest of the set

Concept

Definition

Cluster

Data values that are grouped closely together

Gap

Numbers that have no data values

Peak

The most frequently occurring values, or the mode

Symmetry

How the left side of the distribution looks compared to the right side

Outlier

A data value that does not seem to fit with the rest of the set

	Measure of Center	Measure of Spread
Symmetric Distribution	Mean	Mean absolute deviation
Non-Symmetric Distribution	Median	Interquartile range

Measure of Center

Measure of Spread

Symmetric Distribution

Mean

Mean absolute deviation

Non-Symmetric Distribution

Median

Interquartile range

	Plant Growth
Week	$1$	$2$	$3$	$4$	$5$
Height (in.)	$1.5$	$2.3$	$4$	$6.2$	$8$

Plant Growth

Week

1

2

3

4

5

Height (in.)

1.5

2.3

4

6.2

8

Hour	Distance Traveled (mi)
$1$	$70$
$2$	$135$
$3$	$203$
$4$	$278$
$5$	$348$

Hour

Distance Traveled (mi)

1

70

2

135

3

203

4

278

5

348

Type of Display	Best Used to...
Bar Graph	$\dots$ show values corresponding to specific categories
Box Plot	$\dots$ show measures of spread for a data set
Dot Plot	$\dots$ show how many times each value occurs in the set
Histogram	$\dots$ show the frequency of data divided into equal intervals
Line Graph	$\dots$ show change over a period of time or in respect to a different quantity

Type of Display

Best Used to...

Bar Graph

\dots

show values corresponding to specific categories

Box Plot

\dots

show measures of spread for a data set

Dot Plot

\dots

show how many times each value occurs in the set

Histogram

\dots

show the frequency of data divided into equal intervals

Line Graph

\dots

show change over a period of time or in respect to a different quantity

Country	Life Expectancy
United States	$76.3$
Japan	$84.5$
Germany	$80.9$
Brazil	$77.3$
China	$78.2$
India	$68.3$
Australia	$83.3$
South Africa	$62.4$

Country

Life Expectancy

United States

76.3

Japan

84.5

Germany

80.9

Brazil

77.3

China

78.2

India

68.3

Australia

83.3

South Africa

62.4

Type of Display	Best Used to...
Bar Graph	$\dots$ show values corresponding to specific categories
Box Plot	$\dots$ show measures of spread for a data set
Dot Plot	$\dots$ show how many times each value occurs in the set
Histogram	$\dots$ show the frequency of data divided into equal intervals
Line Graph	$\dots$ show change over a period of time or in respect to a different quantity

Type of Display

Best Used to...

Bar Graph

\dots

show values corresponding to specific categories

Box Plot

\dots

show measures of spread for a data set

Dot Plot

\dots

show how many times each value occurs in the set

Histogram

\dots

show the frequency of data divided into equal intervals

Line Graph

\dots

show change over a period of time or in respect to a different quantity

Country	Life Expectancy
United States	$76.3$
Japan	$84.5$
Germany	$80.9$
Brazil	$77.3$
China	$78.2$
India	$68.3$
Australia	$83.3$
South Africa	$62.4$

Country

Life Expectancy

United States

76.3

Japan

84.5

Germany

80.9

Brazil

77.3

China

78.2

India

68.3

Australia

83.3

South Africa

62.4

Situation $1$	A tech company recently launched a new smartphone model. They conduct a survey where customers rate their satisfaction with the phone on a scale of $1$ to $10 .$
Situation $2$	A survey is conducted to gather data on the distribution of ages among participants in a community event.
Situation $3$	A teacher wants to analyze the distribution of test scores in her class by finding the median and quartiles of the scores.
Situation $4$	A study tracks the temperature variations over the course of a week in a particular city.

Situation

1

A tech company recently launched a new smartphone model. They conduct a survey where customers rate their satisfaction with the phone on a scale of

1

10 .

Situation

2

A survey is conducted to gather data on the distribution of ages among participants in a community event.

Situation

3

A teacher wants to analyze the distribution of test scores in her class by finding the median and quartiles of the scores.

Situation

4

A study tracks the temperature variations over the course of a week in a particular city.

Type of Display	Best Ysed to...
Box Plot	$\dots$ measures of spread for a data set
Line Graph	$\dots$ show change over a period of time or in respect to a different quantity

Type of Display

Best Ysed to...

Box Plot

\dots

measures of spread for a data set

Line Graph

\dots

show change over a period of time or in respect to a different quantity

	{{ 'ml-lesson-number-slides' \| message : article.intro.bblockCount }}
	{{ 'ml-lesson-number-exercises' \| message : article.intro.exerciseCount }}
	{{ 'ml-lesson-time-estimation' \| message }}

{{ article.displayTitle }}

Catch-Up and Review

Answer

Hint

Solution

Step 1

Step 2

Step 3

Step 1

Step 2

Step 3

Answer

Hint

Solution

Step 1

Step 2

Step 3

Step 1

Step 2

Step 3

Step 4

Answer

Hint

Solution

Step 1

Step 2

Step 3

Step 4

Answer

Hint

Solution

Answer

Hint

Solution

Answer

Hint

Solution

Situation 1

Situation 2

Situation 3

Situation 4

Step $1$

Step $2$

Step $3$

Step $1$

Step $2$

Step $3$

Step $1$

Step $2$

Step $3$

Step $1$

Step $2$

Step $3$

Step $4$

Step $1$

Step $2$

Step $3$

Step $4$

Situation $1$

Situation $2$

Situation $3$

Situation $4$