{{ item.displayTitle }}

No history yet!

equalizer

rate_review

{{ r.avatar.letter }}

{{ u.avatar.letter }}

+

{{ item.displayTitle }}

{{ item.subject.displayTitle }}

{{ searchError }}

{{ courseTrack.displayTitle }} {{ statistics.percent }}% Sign in to view progress

{{ printedBook.courseTrack.name }} {{ printedBook.name }} {{ greeting }} {{userName}}

{{ 'ml-article-collection-banner-text' | message }}

{{ 'ml-article-collection-your-statistics' | message }}

{{ 'ml-article-collection-overall-progress' | message }}

{{ 'ml-article-collection-challenge' | message }} Coming Soon!

{{ 'ml-article-collection-randomly-challenge' | message }}

{{ 'ml-article-collection-community' | message }}

{{ 'ml-article-collection-follow-latest-updates' | message }}

mean $=number of valuessum of values $

Suppose a data set represents the heights of different towers. The mean of this data set gives an idea of a **typical** height. Calculating the mean could be seen as rearragning the blocks so that all the towers have the same height.

Animate the mean value

Reset

After the blocks are rearranged, the towers each have a height of 4. Therefore, the mean is 4. If the heights are written as x, then the mean is sometimes written as $xˉ.$ The towers' mean height can then be written as $xˉ=4.$

When on vacation in Mexico, Peter finds a rose species he has never seen before. He decides to study how many petals each flower has. The result of his study is this.
How many petals do the flowers on average have?

Show Solution

A measure of spread is a way of quantifying how spread out, or different, the points in a data set are. A small spread means data points are similar, while a large spread means they are different. This is illustrated by the two data sets below. Both have a mean, median and mode of 3, but, we can assume the second data set has a larger spread because of how different its data points are.

Some commonly used measures of spread are range, mean absolute deviation, standard deviation, and interquartile range. These are often used together with a measure of center, to give an idea both of what a typical value is and how much the data can be expected to deviate from it.

The standard deviation, $σ,$ of a data set is calculated using the rule
where n is the number of values in the data set and $xˉ$ is the mean of the set. Performing this calculation in one step makes for a convoluted expression. Therefore, it is best divided into a few, smaller steps. Consider the following data set as an example.

### 2

### 3

### 4

### 5

Finally, take the square root of the just found quotient to get the standard deviation. Here, the fraction is used instead of the quotient, to avoid rounding errors.
Thus, a randomly chosen value from this data set is expected to deviate roughly 4 units from the mean.

1,5,3,8,3,12

Find the deviation of each data value, $x−xˉ$

For each data value, $x−xˉ$ can now be calculated and added to a table. This shows how much each data point varies from the mean.

x | $x−xˉ$ |
---|---|

1 | 1−4=-3 |

5 | 5−4=1 |

3 | 3−4=-1 |

8 | 8−4=4 |

3 | 3−4=-1 |

12 | 12−4=8 |

Square the deviations

Square the deviations, and add them to a new column in the table.

x | $x−xˉ$ | $(x−xˉ)_{2}$ |
---|---|---|

1 | -3 | (-3)2=9 |

5 | 1 | 12=1 |

3 | -1 | (-1)2=1 |

8 | 4 | 42=16 |

3 | -1 | (-1)2=1 |

12 | 8 | 82=64 |

Find the mean of the squared deviations

Square-root the mean of the squared deviations

{{ 'mldesktop-placeholder-grade' | message }} {{ article.displayTitle }}!

{{ focusmode.exercise.exerciseName }}