Sign In
When is a value considered an outlier?
| Data set | Mean | Median | Mode |
|---|---|---|---|
| With Outlier | 103 | 85 | 85 |
| Without Outlier (211) | 85 | 85 | 85 |
The Most Affected Measure: Mean
If a value in a data set is more than 1.5 times the interquartile range away from the lower or upper quartiles, it is considered an outlier. This is why, to identify any outliers, we first have to find these statistical measures, including any outliers.
We want to find the mean, median, and mode of the given data set. 85, 77, 211, 88, 91, 84, 85 Let's begin by calculating the mean.
The mean of a data set is the sum of the values divided by the total number of values in the set. Let's start by calculating the sum of the given values. 85+ 77+ 211+ 88+ 91+ 84+ 85 = 721 There are 7 values in our set, so we have to divide the sum by 7. Mean: 721/7=103 The mean is 103. We can continue by finding the median.
Median : 85 The last measure we need is the mode. Let's find it!
The mode is the value or values that appear most often in a set of data. Arranging the data set from least to greatest makes it easier to see how often each value appears. Let's arrange the values before we find the mode. 77, 84, 85, 85, 88, 91, 211 We can see that the most common value in the given data set is 85. This is the mode of our data set. Mode: 85
To identify any outliers, we have to calculate the interquartile range (IQR). To do this let's recall some information about the quartiles first!
Q_1= 84, IQR= 7
Multiply
Subtract term
Q_1= 57.5, IQR= 7
Multiply
Add terms
Let's repeat the process, this time excluding 211 from the data set.
Let's start by calculating the sum of the given values without 211. 77+ 84+ 85+ 85+ 88+ 91 = 510 There are 6 values in our set, because we excluded an outlier — 211. Mean: 510/6=85
Recall the ordered set with 211 excluded. 77, 84, 85, 85, 88, 91 This time the number of values in our set is 6. This is why the median is the mean of the two middle values. Median : 85+ 85/2 = 85
Let's take a look at the ordered set with outlier excluded one more time! 77, 84, 85, 85, 88, 91 We can see that the most common value in the given data set is still 85. This is the mode of our data set. Mode: 85
Finally, we summarize our findings in the table below so it is easier to compare the results.
| Data set | Mean | Median | Mode |
|---|---|---|---|
| With Outlier | 103 | 85 | 85 |
| Without Outlier (211) | 85 | 85 | 85 |
We can see that removing the outlier modified only the mean. The mean without outlier decreased. This is because the outlier was excluded with the maximum value.