Exploring Data Distributions: Histograms, Skewed and Symmetric Frequency Distribution

Average Main Dish Price (Dollars)
$10.12$	$9.29$	$8.29$	$9.78$	$10.69$
$9.68$	$12.09$	$8.94$	$10.81$	$8.62$
$11.39$	$12.62$	$8.71$	$10.74$	$10.52$
$10.77$	$10.15$	$9.18$	$8.45$	$9.52$
$11.89$	$9.77$	$9.44$	$13.24$	$11.01$
$10.62$	$9.38$	$12.15$	$9.68$	$9.60$
$10.32$	$11.31$	$11.41$	$8.62$	$9.27$
$10.96$	$9.18$	$10.28$	$10.71$	$10.02$

Average Main Dish Price (Dollars)

10.12

9.29

8.29

9.78

10.69

9.68

12.09

8.94

10.81

8.62

11.39

12.62

8.71

10.74

10.52

10.77

10.15

9.18

8.45

9.52

11.89

9.77

9.44

13.24

11.01

10.62

9.38

12.15

9.68

9.60

10.32

11.31

11.41

8.62

9.27

10.96

9.18

10.28

10.71

10.02

Skewed Distribution	Description
Skewed Left / Negatively Skewed	The distribution has a long left tail and the median is greater than the mean.
Skewed Right / Positively Skewed	The distribution has a long right tail and the median is less than the mean.

Skewed Distribution

Description

Skewed Left / Negatively Skewed

The distribution has a long left tail and the median is greater than the mean.

Skewed Right / Positively Skewed

The distribution has a long right tail and the median is less than the mean.

Game $1$	Game $2$
$32$	$21$	$27$	$46$	$114$	$87$	$96$	$92$
$9$	$16$	$19$	$19$	$101$	$111$	$80$	$106$
$40$	$28$	$42$	$36$	$85$	$112$	$117$	$94$
$11$	$38$	$23$	$28$	$62$	$43$	$106$	$66$
$8$	$18$	$26$	$59$	$104$	$51$	$76$	$91$
$62$	$40$			$111$	$78$

Game

1

Game

2

32

21

27

46

114

87

96

92

9

16

19

19

101

111

80

106

40

28

42

36

85

112

117

94

11

38

23

28

62

43

106

66

8

18

26

59

104

51

76

91

62

40

111

78

Cricket Runs Scored in Game $1$
Number of Runs Scored	Frequency
$0 - 9$	$2$
$10 - 19$	$5$
$20 - 29$	$6$
$30 - 39$	$3$
$40 - 49$	$4$
$50 - 59$	$1$
$60 - 69$	$1$

Cricket Runs Scored in Game

1

Number of Runs Scored

Frequency

0 - 9

2

10 - 19

5

20 - 29

6

30 - 39

3

40 - 49

4

50 - 59

1

60 - 69

1

Cricket Runs Scored in Game $2$
Number of Runs Scored	Frequency
$40 - 49$	$1$
$50 - 59$	$1$
$60 - 69$	$2$
$70 - 79$	$2$
$80 - 89$	$3$
$90 - 99$	$4$
$100 - 109$	$4$
$110 - 119$	$5$

Cricket Runs Scored in Game

2

Number of Runs Scored

Frequency

40 - 49

1

50 - 59

1

60 - 69

2

70 - 79

2

80 - 89

3

90 - 99

4

100 - 109

4

110 - 119

5

Retirement Age of NFL Players
Age	Frequency
$25 - 26$	$33$
$27 - 28$	$67$
$29 - 30$	$93$
$31 - 32$	$109$
$33 - 34$	$127$
$35 - 36$	$114$
$37 - 38$	$80$
$39 - 40$	$59$
$41 - 42$	$43$

Retirement Age of NFL Players

Age

Frequency

25 - 26

33

27 - 28

67

29 - 30

93

31 - 32

109

33 - 34

127

35 - 36

114

37 - 38

80

39 - 40

59

41 - 42

43

Color	Frequency
Red	$164$
Blue	$168$
Yellow	$168$
Pink	$166$
Green	$165$
Orange	$169$

Color

Frequency

Red

164

Blue

168

Yellow

168

Pink

166

Green

165

Orange

169

Exam Score	Frequency
$70 - 71$	$3$
$72 - 73$	$8$
$74 - 75$	$11$
$76 - 77$	$9$
$78 - 79$	$4$
$80 - 81$	$2$
$82 - 83$	$2$
$84 - 85$	$4$
$86 - 87$	$8$
$88 - 89$	$11$
$90 - 91$	$9$
$92 - 93$	$7$
$94 - 95$	$2$

Exam Score

Frequency

70 - 71

3

72 - 73

8

74 - 75

11

76 - 77

9

78 - 79

4

80 - 81

2

82 - 83

2

84 - 85

4

86 - 87

8

88 - 89

11

90 - 91

9

92 - 93

7

94 - 95

2

Ages of People Who Enter the Italian Restaurant on a Typical Day
$15$	$53$	$55$	$60$	$38$	$56$
$62$	$14$	$44$	$24$	$32$	$10$
$42$	$54$	$47$	$67$	$60$	$50$
$61$	$30$	$30$	$62$	$62$	$65$
$56$	$52$	$35$	$25$	$34$	$32$

Ages of People Who Enter the Italian Restaurant on a Typical Day

15

53

55

60

38

56

62

14

44

24

32

10

42

54

47

67

60

50

61

30

30

62

62

65

56

52

35

25

34

32

Five Number Summary
Minimum Value	$10$
First Quartile	$32$
Median	$48.5$
Third Quartile	$60$
Maximum Value	$67$

Five Number Summary

Minimum Value

10

First Quartile

32

Median

48.5

Third Quartile

60

Maximum Value

67

Appropriate Measures of Center
Girls' Data Set	Boys' Data Set
Mean	Median

Appropriate Measures of Center

Girls' Data Set

Boys' Data Set

Mean

Median

	Women	Men
Survey Size	$100$	$100$
Minimum	$$ 18$	$$ 8$
Maximum	$$ 60$	$$ 28$
$1^{st}$ Quartile	$$ 30$	$$ 14$
Median	$$ 34$	$$ 18$
$3^{rd}$ Quartile	$$ 40$	$$ 22$
Mean	$$ 36$	$$ 18$
Standard Deviation	$$ 8$	$$ 4$

Women

Men

Survey Size

100

100

Minimum

$ 18

$ 8

Maximum

$ 60

$ 28

1^{st}

Quartile

$ 30

$ 14

Median

$ 34

$ 18

3^{rd}

Quartile

$ 40

$ 22

Mean

$ 36

$ 18

Standard Deviation

$ 8

$ 4

a This data can be represented with a double box plot to identify which of the given graphs accurately represents the data set. First, draw a number linethat includes the minimum and maximum values of each gender's data set. Next, plot points above the number line for the given values of the five-number summary.

Next, draw the box for each plot using the first and third quartiles. Finally, draw a line through the median and the whiskers from the box to the minimum and maximum values of each data set.

Notice that this corresponds to the box-plot in option D.

b In order to identify which of the given statements is correct, compare the center and spread of the data sets. Note that for the women's data set, the right whisker is longer than the left one and that the median is closer to the left whisker. This means that the data is skewed right, and the median best describes the center of the data.

\underline{Median of Women ’ s Data Set :} $ 34

Conversely, for the men's data set, the whiskers are approximately equal and the median falls in the middle of the box. Therefore, this data is modeled by a symmetric distribution, and the mean best describes the center of the data.

\underline{Mean of Men ’ s Data Set :} $ 18

Notice that the median amount of money spent by women on clothes each month is almost twice the mean amount of money spent on clothes by men. Recall that the range of a data set is given by the difference of the minimum and maximum values. Using this information, compare the range and standard deviation of the data sets.

	Standard Deviation	Interquartile Range
Women	$$ 8$	$60 - 18 = $ 42$
Men	$$ 4$	$28 - 8 = $ 20$

Both the standard deviation and the interquartile range are greater for women. This means that there is more variability in the amount of money spent by women.

c To calculate how many of the women surveyed are expected to spend between

$ 30

and

$ 40

on clothes per month, consider that structure of a box plot. Each whisker represents

25 %

of the data, and the box represents the middle

50 % .

With this information in mind, the following statements are true.

$25 %$ of the women surveyed are expected to spend between $$ 18$ and $$ 30 .$
$50 %$ of the women surveyed are expected to spend between $$ 30$ and $$ 40 .$
$25 %$ of the women surveyed are expected to spend between $$ 40$ and $$ 60 .$

This means that the

50 %

of the survey size needs to be calculated in order to determine the number of women who are expected to spend between

$ 30

and

$ 40

on clothes. Recall that

100

women participated in the survey.

100 \cdot 0.5 = 50

Therefore,

50

out of the

100

women surveyed are expected to spend between

$ 30

and

$ 40

on clothes per month.

Average Main Dish Price (Dollars)
$10.12$	$9.29$	$8.29$	$9.78$	$10.69$
$9.68$	$12.09$	$8.94$	$10.81$	$8.62$
$11.39$	$12.62$	$8.71$	$10.74$	$10.52$
$10.77$	$10.15$	$9.18$	$8.45$	$9.52$
$11.89$	$9.77$	$9.44$	$13.24$	$11.01$
$10.62$	$9.38$	$12.15$	$9.68$	$9.60$
$10.32$	$11.31$	$11.41$	$8.62$	$9.27$
$10.96$	$9.18$	$10.28$	$10.71$	$10.02$

Average Main Dish Price (Dollars)

10.12

9.29

8.29

9.78

10.69

9.68

12.09

8.94

10.81

8.62

11.39

12.62

8.71

10.74

10.52

10.77

10.15

9.18

8.45

9.52

11.89

9.77

9.44

13.24

11.01

10.62

9.38

12.15

9.68

9.60

10.32

11.31

11.41

8.62

9.27

10.96

9.18

10.28

10.71

10.02

Average Main Dish Price (Dollars)
Price Range	Frequency
$8.00 - 8.99$	$6$
$9.00 - 9.99$	$12$
$10.00 - 10.99$	$13$
$11.00 - 11.99$	$5$
$12.00 - 12.99$	$3$
$13.00 - 13.99$	$1$

Average Main Dish Price (Dollars)

Price Range

Frequency

8.00 - 8.99

6

9.00 - 9.99

12

10.00 - 10.99

13

11.00 - 11.99

5

12.00 - 12.99

3

13.00 - 13.99

1

Distribution	Measure of Center	Measure of Variation
Symmetric	Mean	Standard deviation
Skewed	Median	Five-number summary

Distribution

Measure of Center

Measure of Variation

Symmetric

Mean

Standard deviation

Skewed

Median

Five-number summary

Types of Distributions of Data

Catch-Up and Review

Giving Meaning to a Data Set

Frequency Distribution

Symmetric Frequency Distribution

Skewed Frequency Distribution

Finding the Distribution of Runs Scored in Cricket

Hint

Solution

Analyzing the Retirement Ages of NFL Players

Hint

Solution

Uniform and Bimodal Distributions

Uniform Frequency Distribution

Bimodal Distribution

Example

Frequency Distributions of Data From Different Sources

Hint

Solution

Box-and-Whisker Plots as Distributions

Analyzing the Ages of Customers at an Italian Restaurant

Hint

Solution

Comparing Data From Two Groups

Hint

Solution

Using Box Plots to Interpret Results From a Survey

Hint

Solution

Finding Insights and Drawing Conclusions From Histograms

Hint

Solution

Types of Distributions of Data

Recommended exercises

	12 Theory slides
	9 Exercises - Grade E - A
	Each lesson is meant to take 1-2 classroom sessions