Sign In
When is a value considered an outlier?
| Data set | Mean | Median | Mode |
|---|---|---|---|
| With Outlier | 48.5 | 53 | No mode |
| Without Outlier (17) | 53 | 54 | No mode |
The Most Affected Measure: Mean
If a value in a data set is more than 1.5 times the interquartile range away from the lower or upper quartiles, it is considered an outlier. This is why, to identify any outliers, we first have to find these statistical measures, including any outliers.
We want to find the mean, median, and mode of the given data set. 45, 52, 17, 63, 57, 42, 54, 58 Let's begin by calculating the mean.
The mean of a data set is the sum of the values divided by the total number of values in the set. Let's start by calculating the sum of the given values. 45+ 52+ 17+ 63+ 57+ 42+ 54+ 58 = 388 There are 8 values in our set, so we have to divide the sum by 8. Mean: 388/8=48.5 The mean is 48.5. We can continue by finding the median.
Median : 52+ 54/2 = 53 The last measure we need is the mode. Let's find it!
The mode is the value or values that appear most often in a set of data. Arranging the data set from least to greatest makes it easier to see how often each value appears. Let's take a look at the ordered set one more time! 17, 42, 45, 52, 54, 57, 58, 63 Since the data set does not contain any repeated values, there is no mode. Mode: This set does not have a mode
To identify any outliers, we have to calculate the interquartile range (IQR). To do this, let's recall some information about the quartiles first!
Q_1= 43.5, IQR= 14
Multiply
Subtract term
Q_1= 57.5, IQR= 14
Multiply
Add terms
Let's repeat the process, this time excluding 17 from the data set.
Let's start by calculating the sum of the given values without 17. 42+ 45+ 52+ 54+ 57+ 58+ 63 = 371 There are 7 values in our set, because we excluded an outlier — 17. Mean: 371/7=53
Recall the ordered set with 17 excluded. 42, 45, 52, 54, 57, 58, 63 This time the number of values in our set is 7. This is why there is a middle value. The median is 54. Median : 54
Since the data set still does not contain any repeated values, there is no mode. Mode: This set does not have a mode
Finally, we summarize our findings in the table below so it is easier to compare the results.
| Data set | Mean | Median | Mode |
|---|---|---|---|
| With Outlier | 48.5 | 53 | No mode |
| Without Outlier (17) | 53 | 54 | No mode |
We can see that removing the outlier modified the mean and the median. They both increased but the mean increased more than the median. This set without the outlier still does not have a mode.