{{ 'ml-label-loading-course' | message }}
{{ toc.name }}
{{ toc.signature }}
{{ tocHeader }} {{ 'ml-btn-view-details' | message }}
{{ tocSubheader }}
{{ 'ml-toc-proceed-mlc' | message }}
{{ 'ml-toc-proceed-tbs' | message }}
Lesson
Exercises
Recommended
Tests
An error ocurred, try again later!
Chapter {{ article.chapter.number }}
{{ article.number }}. 

{{ article.displayTitle }}

{{ article.intro.summary }}
Show less Show more expand_more
{{ ability.description }} {{ ability.displayTitle }}
Lesson Settings & Tools
{{ 'ml-lesson-number-slides' | message : article.intro.bblockCount }}
{{ 'ml-lesson-number-exercises' | message : article.intro.exerciseCount }}
{{ 'ml-lesson-time-estimation' | message }}
 Normal Distribution
Reference

Normal Distribution

Concept

Normal Distribution

A normal distribution is a type of probability distribution where the mean, the median, and the mode are all equal to each other. The graph that represents a normal distribution is called a normal curve and it is a continuous, bell-shaped curve that is symmetric with respect to the mean of the data set.

A normal curve with mean 950 and standard deviation 25.

This type of distribution is the most common continuous probability distribution that can be observed in real life. When a normal distribution has a mean of and standard deviation of it is called a standard normal distribution. A normal distribution can be standardized by transforming each of its values into their corresponding scores.

Normal Curve with mean 0 and standard deviation 1

The total area under the normal curve is or Because of this, the area under the normal curve in a certain interval represents the percentage of data within that interval or the probability of randomly selecting a value that belongs to that interval. The Empirical Rule can be used to determine the area under the normal curve at specific intervals.

Example

Consider the weights of oranges as an example of normally distributed data. The mean weight of an orange is about grams and the standard deviation is approximately grams. The distribution of a sample of weights of randomly chosen oranges is described by the following histogram.

Histogram
The histogram will look more and more like a normal curve as more and more observations are made. In addition to this example, topics such as human height or income also tend to be normally distributed. It is also worth noting that not all data sets are normally distributed. If the mean and median are not equal, then the data set is skewed.
Concept

Empirical Rule

In statistics, the Empirical Rule, also known as the rule, is a shorthand used to remember the percentage of values that lie within certain intervals in a normal distribution. The rule states the following three facts.

  • About of the values lie within one standard deviation of the mean.
  • About of the values lie within two standard deviations of the mean.
  • About of the values lie within three standard deviations of the mean.
These three facts can be confirmed by observing the area under the normal curve that corresponds to a normal distribution with mean and standard deviation
empirical rule button graph
According to this rule, almost all the values observed lie within three standard deviations of the mean. For this reason, the rule is also called the three-sigma rule. It is worth noting that these facts were observed based on empirical evidence, which is why it is called the Empirical Rule.
Concept

-Score

The score, also known as the value, represents the number of standard deviations that a given value is from the mean of a data set. The following formula can be used to convert any value into its corresponding score.


Here, represents the mean and the standard deviation of the distribution. The value corresponding to a sample mean is called a statistic and is calculated using a similar formula.


In this formula, is the standard deviation of the sample, is the sample size, and is the population mean.

Example

Consider a distribution with mean and standard deviation The score corresponding to is computed as follows.
Consequently, is standard deviations to the left of the mean. The scores can be used to standardize a normal distribution. Then, for a random value of a standard normal distribution, the Standard Normal Table can be used to determine the corresponding area under the curve.
Memo

Standard Normal Table

The left-hand column gives the whole part of while the top row gives the decimal part of

The applet calculates the area below the standard normal curve and to the left of the entered score. It accepts scores up to two decimal places.

Area below the standard normal curve and to the left of a z-score
Loading content