Mastering Statistical Measures: Mean, Median, and Range

Lifespan of Cats (in years)
15	11	14	15
14	17	13

Lifespan of Cats (in years)

Actor	Height
Madzia	5ft 4in.
Magda	5ft 2in.
Ignacio	6ft 1.6in.
Henrik	5ft 10in.
Ali	6ft 1in.
Diego	5ft 2in.
Miłosz	5ft 2in.
Paulina	5ft 3in.
Aybuke	5ft 7in.
Mateusz	6ft 1.2in.
Gamze	5ft 3in.
Marcin	5ft 7in.
Marcial	5ft 8in.
Heichi	5ft 5in.
Arkadiusz	5ft 6in.
Enrique	5ft 10.5in.
Aleksandra	5ft 4in.
Ashli	5ft 4in.
Jordan	5ft 5in.
Paula	5ft 2in.
MacKenzie	5ft 6in.
Joe	6ft 1in.
Flavio	5ft 10in.
Jeremy	5ft 4in.
Umut	6ft 1in.

Actor

Height

Madzia

5ft 4in.

Magda

5ft 2in.

Ignacio

6ft 1.6in.

Henrik

5ft 10in.

Ali

6ft 1in.

Diego

5ft 2in.

Miłosz

5ft 2in.

Paulina

5ft 3in.

Aybuke

5ft 7in.

Mateusz

6ft 1.2in.

Gamze

5ft 3in.

Marcin

5ft 7in.

Marcial

5ft 8in.

Heichi

5ft 5in.

Arkadiusz

5ft 6in.

Enrique

5ft 10.5in.

Aleksandra

5ft 4in.

Ashli

5ft 4in.

Jordan

5ft 5in.

Paula

5ft 2in.

MacKenzie

5ft 6in.

Joe

6ft 1in.

Flavio

5ft 10in.

Jeremy

5ft 4in.

Umut

6ft 1in.

Lifespan of Dogs (in years)
10	21	16	15
13	15	17	11

Lifespan of Dogs (in years)

	Range	IQR
With Outliers	68	15
Without Outliers	44	17

Range

IQR

With Outliers

Without Outliers

Data Value	Absolute Value of Difference
82	\|82- 84\| = 2
85	\|85- 84\| = 1
90	\|90- 84\| = 6
75	\|75- 84\| = 9
95	\|95- 84\| = 11
85	\|85- 84\| = 1
90	\|90- 84\| = 6
70	\|70- 84\| = 14

Data Value

Absolute Value of Difference

|82- 84| = 2

|85- 84| = 1

|90- 84| = 6

|75- 84| = 9

|95- 84| = 11

|85- 84| = 1

|90- 84| = 6

|70- 84| = 14

Mean Absolute Deviation
An average of how much data values differ from the mean.

Data Value	Absolute Value of Difference
9	\|9- 13\| = 4
9	\|9- 13\| = 4
12	\|12- 13\| = 1
15	\|15- 13\| = 2
14	\|14- 13\| = 1
11	\|11- 13\| = 2
14	\|14- 13\| = 1
15	\|15- 13\| = 2
15	\|15- 13\| = 2
10	\|10- 13\| = 3
16	\|16- 13\| = 3
16	\|16- 13\| = 3

Data Value

Absolute Value of Difference

|9- 13| = 4

|12- 13| = 1

|15- 13| = 2

|14- 13| = 1

|11- 13| = 2

|14- 13| = 1

|15- 13| = 2

|10- 13| = 3

|16- 13| = 3

Measures of Center	Measures of Spread
Mean Mode Median	Range Interquartile Range Mean Absolute Deviation Standard Deviation

Measures of Center

Measures of Spread

Mean
Mode
Median

Range
Interquartile Range
Mean Absolute Deviation
Standard Deviation

What is the effect of an outlier on the range of a data set?

We are asked to determine how an outlier affects the range of a data set. First, let's recall that the range is the difference of the maximum and minimum values Range = Max Value-Min Value Now, the data value that is significantly different from other values is an outlier. Algebraically, the value is an outlier if it is less than the first quartile minus 1.5 times the interquartile range. Also, the value is an outlier if it is greater than the third quartile plus 1.5 times the interquartile range.

This means that outliers are either the least data values or the greatest data values. Therefore, they affect range. Let's take a look at the example data set and find its median and both quartiles.

The interquartile range (IQR) for this data set is 25- 15= 10. Let's find what numbers would be outliers for this data set using this information. Q_1-1.5(IQR) &= 15-1.5( 10)=0 Q_3+1.5(IQR) &= 25+1.5( 10) = 40 For our example data set, each data value that is less than 0 or greater than 40 would be considered an outlier. Now, let's compare the range of the original data set with the ranges of the data sets in which we change the least or the greatest values so that they are outliers.

Data Set	Outliers	Range
14,15,17,19,23,25, 26	No Outliers	26- 14=12
-1,15,17,19,23,25, 26	-1	26-( -1)=27
14,15,17,19,23,25, 42	42	42- 14=28
-2,15,17,19,23,25, 41	-2 and 41	41-( -2)=43

We can see that each time we have outliers in our data set, the range is greater than the range of the original data set. In general, when we have an outlier in a data set, it increases the range of this set. The answer is A.

Vincenzo is assigned to create a set of data with 7 values that has a mean of 40, a median of 35, a range of 50, and an interquartile range of 35. The steps Vincenzo followed when creating this data set are shown below.

In which step, if any, did Vincenzo make a mistake?

We know that Vincenzo wants to create a data set with seven values that have defined measures.

Measure	Value
Mean	40
Median	35
Range	50
Interquartile Range	35

Vincenzo draws seven horizontal lines, each representing a data value. We can imagine that the lines represent the ordered data set.

We will examine each step one by one.

Step I

Let's remember what the median is.

Median |- For a set with an odd number of values, the median is the middle value. For a set with an even number of values, the median is the mean of the two middle values.

We have seven values, which is an odd number. This means that the median is the middle value. We also know that the median is equal to 35. Therefore, the middle value must be 35.

Vincenzo's first step is correct. Let's move on the next step.

Step II

We know that the range of the data set is 50. This means that the greatest value is 50 more than the least value. Range=Max Value−Min Value Vincenzo set the smallest value of the data to 20. Then, we greatest value must be 50 more, which is 70.

There is no error in the second step either.

Step III

Next, let's remember what the interquartile range is.

Interquartile Range |- The interquartile range, or the IQR is the difference of the third quartile and the first quartile.

This means that the difference between the third quartile, Q_3 and the first quartile, Q_1 is 35. Q_3 - Q_1 = 35 In this case, the second value is the first quartile and the sixth value is the third quartile.

Vincenzo choose a value for Q_1, which is 25. Notice that Q_1 is greater than the least value and less than the median. Since the interquartile range is 35, the third quartile must be 25+35 = 60.

There is no error in this step either.

Step IV

The last condition is that the mean should be equal to 40. Let's remember what the mean is.

Mean |- The mean of the data set is the sum of the data divided by the number of data values.

Let's name the missing values x, and y.

Now, we can add all values and divide the sum by 7. The result should be equal to 40.

The sum of these values must be 70. However, Vincenzo chose these values 30 and 50, which add up to 80. Third Value && Sixth Value && Sum 30 & +& 50 &=& 80 Therefore, the last step is incorrect. The answer is IV. Vincenzo chose the third data value correctly because it is between 25 and 35. The value of y should be between 35 and 60 and x plus y must be 70.

We can see that for x = 30, the value of y is 40.

This data meets the specified conditions. It is important to note that there can be numerous different data sets that meet these conditions.

	24 Theory slides
	13 Exercises - Grade E - A
	Each lesson is meant to take 1-2 classroom sessions

Statistical Measures

Catch-Up and Review

Hint

Solution

Hint

Solution

Range for the Weights of Cats

Range for the Weights of Dogs

Interquartile Range of Cat Weights

Interquartile Range of Dog Weights

What Does Significantly Different Mean?

Hint

Solution

Finding Range

Finding Interquartile Range

Hint

Solution

Hint

Solution

Step I

Step II

Step III

Step IV

Statistical Measures

Recommended exercises