Sign In
| 13 Theory slides |
| 8 Exercises - Grade E - A |
| Each lesson is meant to take 1-2 classroom sessions |
Here are a few recommended readings before getting started with this lesson.
Understanding Probability
Understanding Descriptive Measures
Understanding Types of Data
Other Recommended Readings
A random variable assigns a numerical value to an outcome of a probability experiment. In many situations, it is important to know how likely it is that a random variable will take a specific value. This can be represented by listing or graphing the probability of each value of a random variable. This is called a probability distribution.
A probability distribution of a random variable X is a function that gives the probability of each outcome in the sample space. It can be represented by tables, equations, or graphs. A probability distribution needs to satisfy two conditions to be valid.
Consider the roll of a pair of standard dice. Let X be the random variable that represents the sum of the two dice. By the fundamental counting principle, since rolling each die has 6 possible outcomes, there are a total of 6⋅6=36 possible results. Additionally, the possible values of X are integers from 2 to 12.
A table that represents the theoretical probability distribution of X will now be created. Frequencies represent the number of dice roll results that add up to the given values x of the random variable X. The frequency is divided by 36 to determine the theoretical probability of each outcome.
X=Sum of Two Dice | ||
---|---|---|
x | Frequency | P(X=x) |
2 | 1 | 361≈0.028 |
3 | 2 | 362≈0.056 |
4 | 3 | 363≈0.083 |
5 | 4 | 364≈0.111 |
6 | 5 | 365≈0.139 |
7 | 6 | 366≈0.167 |
8 | 5 | 365≈0.139 |
9 | 4 | 364≈0.111 |
10 | 3 | 363≈0.083 |
11 | 2 | 362≈0.056 |
12 | 1 | 361≈0.028 |
Izabella is a big soccer fan. One weekend, she invited her friend Dylan to watch a world championship match together at her house. During the coin toss ceremony, Izabella asked Dylan about the number of heads they will obtain if they toss a fair coin four times.
Let X be a random variable that represents the number of heads in four coin flips. Help Izabella and Dylan solve the following problems and determine whether they can predict the number of times the experiment results in heads.
Number of Heads, x | Tally | Frequency |
---|---|---|
0 | ∣∣∣∣ ∣ | 6 |
1 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣ | 22 |
2 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣ | 37 |
3 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣ | 28 |
4 | ∣∣∣∣ ∣∣ | 7 |
Use this data to find the experimental probability of each possible value of X.
X=Number of Heads | |||||
---|---|---|---|---|---|
x | 0 | 1 | 2 | 3 | 4 |
P(X=x) | 0.0625 | 0.25 | 0.375 | 0.25 | 0.0625 |
X=Number of Heads | |||||
---|---|---|---|---|---|
x | 0 | 1 | 2 | 3 | 4 |
P(X=x) | 0.06 | 0.22 | 0.37 | 0.28 | 0.07 |
X=Number of Heads | |||||
---|---|---|---|---|---|
x | 0 | 1 | 2 | 3 | 4 |
Possible Outcomes | TTTT | TTTH, TTHT, THTT, HTTT | HHTT, HTTH, TTHH, HTHT, THTH, THHT | HHHT, HHTH, HTHH, THHH | HHHH |
Frequency | 1 | 4 | 6 | 4 | 1 |
X=Number of Heads | |||||
---|---|---|---|---|---|
x | 0 | 1 | 2 | 3 | 4 |
Frequency | 1 | 4 | 6 | 4 | 1 |
P(X=x) | 161 | 164=41 | 166=83 | 164=41 | 161 |
This table describes the theoretical probabilities associated with tossing a fair coin four times. Since only the theoretical probability is required, the Frequency
column can be skipped.
X=Number of Heads | |||||
---|---|---|---|---|---|
x | 0 | 1 | 2 | 3 | 4 |
P(X=x) | 0.0625 | 0.25 | 0.375 | 0.25 | 0.0625 |
Number of Heads, x | Tally | Frequency |
---|---|---|
0 | ∣∣∣∣ ∣ | 6 |
1 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣ | 22 |
2 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣ | 37 |
3 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣ | 28 |
4 | ∣∣∣∣ ∣∣ | 7 |
Calculate the experimental probability of each possible outcome by dividing its frequency by the total number of trials, 100.
Number of Heads, x | Tally | Frequency | P(X=x) |
---|---|---|---|
0 | ∣∣∣∣ ∣ | 6 | 1006 |
1 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣ | 22 | 10022 |
2 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣ | 37 | 10037 |
3 | ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣∣ ∣∣∣ | 28 | 10028 |
4 | ∣∣∣∣ ∣∣ | 7 | 1007 |
Since only the experimental probability is required, the Tally
and Frequency
columns can be skipped. Next, write the table horizontally.
X=Number of Heads | |||||
---|---|---|---|---|---|
x | 0 | 1 | 2 | 3 | 4 |
P(X=x) | 0.06 | 0.22 | 0.37 | 0.28 | 0.07 |
The expected value of a random variable X is the average of the possible outcomes of a random variable. It is used to describe the center of a probability distribution. For a discrete random variable, the expected value E(X) is given by the weighted mean.
E(X)=i=1∑nxi⋅P(X=xi)
In this formula, xi represents a specific outcome, P(X=xi) corresponds to the associated probability of xi, and n is the number of all possible outcomes. According to the law of large numbers, when considering a sequence of random variables, its average tends to the expected value under specific conditions.
n→+∞limSn=μ
The expected value is commonly used with a measure of variation such as the variance or standard deviation to determine how outcome will differ from the expected value.
The standard deviation of a random variable is a measure of variation that describes how spread out the outcomes of a random variable X are from its expected value E(X). The standard deviation is represented by the Greek letter σ — read as sigma
— and is given by the square root of the variance of X.
In this formula, xi is a specific outcome and P(X=xi) is the probability of xi.
Let X be the random variable representing the number of cars sold on a given day in a car dealership. The table below shows the probability distribution of X.
x | 0 | 1 | 2 | 3 | 4 |
---|---|---|---|---|---|
P(X=x) | 151 | 153 | 156 | 153 | 152 |
Substitute values
a⋅cb=ca⋅b
a0=0
Identity Property of Addition
Add fractions
Use a calculator
Round to 2 decimal place(s)
xi | [xi−E(X)]2 | [xi−E(X)]2⋅P(X=xi) |
---|---|---|
0 | (0−2.13)2=4.5369 | 4.5364⋅151≈0.3025 |
1 | (1−2.13)2=1.2769 | 1.2769⋅153≈0.2554 |
2 | (2−2.13)2=0.0169 | 0.0169⋅156≈0.0068 |
3 | (3−2.13)2=0.7569 | 0.7569⋅153≈0.1514 |
4 | (4−2.13)2=3.4969 | 3.4969⋅152≈0.4663 |
Variance σ2 | ≈1.1824 |
Finally, calculate the square root of the variance to get the standard deviation of X.
Izabella's aunt Magdalena owns a clothing store. She needs to increase stock in her shop and plans to invest $15000 in one of the two collections that were offered to her by well-known brands. Each brand claims that they have a great expected rate of return. Their probability distributions are described below.
Dylan and Izabella want to help Magdalena make the best decision. They decided to use their recently acquired knowledge about the expected value and standard deviation of a probability distribution to analyze the offers. Help them answer the following questions and give the best advice to Magdalena.
Substitute values
Multiply
Add and subtract terms
xi | [xi−E(X)]2 | [xi−E(X)]2⋅P(X=xi) |
---|---|---|
1200 | (1200−1125)2=5625 | 5625⋅0.5=2812.5 |
1800 | (1800−1125)2=455625 | 455625⋅0.2=91125 |
900 | (900−1125)2=50625 | 50625⋅0.2=10125 |
-150 | (-150−1125)2=1625625 | 1625625⋅0.1=162562.5 |
Sum of Values | 266625 |
Substitute values
Multiply
Add and subtract terms
xi | [xi−E(X)]2 | [xi−E(X)]2⋅P(X=xi) |
---|---|---|
3600 | (3600−1145)2=6027025 | 6027025⋅0.3=1808107.5 |
2850 | (2850−1145)2=2907025 | 2907025⋅0.1=290702.5 |
-300 | (-300−1145)2=2088025 | 2088025⋅0.4=835210 |
-500 | (-500−1145)2=2706025 | 2706025⋅0.2=541205 |
Sum of Values | 3475225 |
The expected value and the standard distribution of each probability distribution have been calculated. The following table summarizes these measures.
Measures of the Probability Distributions | ||
---|---|---|
Expected Value of Collection I | 1125 | |
Expected Value of Collection II | 1145 | |
Standard Deviation of Collection I | ≈516.23 | |
Standard Deviation of Collection II | ≈1864.20 |
The outcomes of many experiments can be reduced to two possibilities, success or failure. If two more conditions are satisfied, these experiments can be modeled by a binomial experiment.
A binomial experiment is a probability experiment that has the following three properties.
Note that many probability experiments can be reduced so that they satisfy the conditions of a binomial experiment. In case of rolling a die, there are six possible outcomes. However, they can be divided into two groups — even numbers and odd numbers, for example.
Is there a fixed number of trials for each experiment? How many outcomes are possible? Does the probability of each outcome remain constant for each trial? Are trials independent?
Start by recalling the conditions that a binomial experiment should satisfy.
Analyze each situation one at a time to see if it follows all of the conditions of a binomial experiment.
This situation has eight trials of selecting one scratch-off cards at random.
Note that height can vary for every people surveyed.
Because there are 100 possible answers, it is likely that more than 2 different outcomes will occur. This means that this situation does not represent a binomial experiment.
This situation has a fixed number of trials because it involves asking 20 people if their blood type is O.
All four situations have already been analyzed and the results are summarized in the following table.
Experiment | Is It a Binomial Experiment? |
---|---|
The Scratch-off Cards | ✓ |
Heights of 100 People | × |
Rolling a Die | × |
Blood Type O? | ✓ |
Because binomial experiments can simplify many complex situations, it is essential to determine how likely it is to obtain a specific number of successes out of n trials in a given experiment. Also, the expected value, or center of the distribution, will be presented.
The binomial distribution is the probability distribution that describes the number of successes x out of n binomial trials. The trials must satisfy three conditions.
Let X be a random variable representing the total number of successes among n trials. The possible values of X are x=0, 1, 2, …, n. The binomial probability formula can be used to determine the probability of x successes among n trials P(X=x).
P(X=x)=nCxpxqn−x
In this formula, nCx is the binomial coefficient and p and q are the probabilities of success and failure, respectively. Additionally, the expected value of X can be determined by the product of the number of trials n and the probability of success p.
E(X)=np
This means that the expected number of successes in n trials is given by np.
Consider the experiment of drawing 1 card from a standard deck of cards with replacement.
x | 5Cx(41)x(43)5−x | P(X=x)=5Cx(41)x(43)5−x |
---|---|---|
0 | 5C0(41)0(43)5−0=1024243 | ≈0.237 |
1 | 5C1(41)1(43)5−1=1024405 | ≈0.396 |
2 | 5C2(41)2(43)5−2=1024270 | ≈0.264 |
3 | 5C3(41)3(43)5−3=102490 | ≈0.088 |
4 | 5C4(41)4(43)5−4=102415 | ≈0.015 |
5 | 5C5(41)5(43)5−5=10241 | ≈0.001 |
C(n,k)=k!(n−k)!n!
Write as a product
Cancel out common factors
Simplify quotient
Rewrite x as (x−1)+1
a1+m=a⋅am
ca⋅b=a⋅cb
Commutative Property of Multiplication
Factor out np
Rewrite n−x as n−1−(x−1)
Substitute values
C(n,k)=k!(n−k)!n!
Let X be the random variable representing the number of successes in n binomial trials. The binomial probability P(X=x) can be calculated by using the following formula.
P(X=x)=nCxpxqn−x
In this formula, P(X=x) is the probability that the random variable X is equal to x, which means that there are exactly x successes. Additionally, p and q are the probabilities of success and failure, respectively, and nCx is the binomial coefficient.
This proof will begin by calculating the probability of obtaining a fixed sequence with x successes in n independent trials. The number of all possible sequences with x successes will then be calculated. Finally, joining the first and second parts will give the formula.
n=4, r=2
Subtract term
Write as a product
Cross out common factors
Cancel out common factors
Multiply
2!=2
Calculate quotient
Dylan is taking a test that consists of five multiple-choice questions. Each question has five answer choices, only one of which is correct. He did not have time to study, so he decides to guess every answer.
Consider using the Addition Rule of Probability.
To find the probability of guessing at least three answers correctly in a five question multiple-choice test, first, verify if this experiment satisfies the three conditions of a binomial experiment.
Condition | Given Experiment | Is Satisfied? |
---|---|---|
There is a fixed number of independent trials. | There are five trials — five questions. | ✓ |
Each trial has two possible outcomes. | The answer for each question is either correct or incorrect. | ✓ |
The probability of success is constant for each trial. | The probability of guessing the correct answer for each question is 51=0.2, where 5 is the number of options. | ✓ |
x=3
C(n,k)=k!(n−k)!n!
Subtract terms
Write as a product
Cancel out common factors
Simplify quotient
1!=1
Multiply
Calculate quotient
Calculate power and product
x | 5Cx(0.2)x(0.8)5−x | P(X=x)=5Cx(0.2)x(0.8)5−x |
---|---|---|
3 | 5C3(0.2)3(0.8)5−3=3125160 | P(X=3)=0.0512 |
4 | 5C4(0.2)4(0.8)5−4=312520 | P(X=4)=0.0064 |
5 | 5C5(0.2)5(0.8)5−5=31251 | P(X=5)≈0.00032 |
Substitute values
Add terms
Convert to percent
Condition | Given Experiment | Is Satisfied? |
---|---|---|
There is a fixed number of independent trials. | There are 10 trials — the 10 customers that will enter the store. | ✓ |
Each trial has two possible outcomes. | Whether a customer makes a purchase or not. | ✓ |
The probability of success is constant for each trial. | There is a 0.35 probability that a customer will make a purchase. | ✓ |
x=0
C(n,k)=k!(n−k)!n!
Subtract terms
0!=1
a⋅1=a
aa=1
Calculate power and product
Round to 4 decimal place(s)
x | 10Cx(0.35)x(0.65)10−x | P(X=x)=10Cx(0.35)x(0.65)10−x |
---|---|---|
0 | 10C0(0.35)0(0.65)10−0 | P(X=0)≈0.01346 |
1 | 10C1(0.35)1(0.65)10−1 | P(X=1)≈0.07249 |
2 | 10C2(0.35)2(0.65)10−2 | P(X=2)≈0.17565 |
3 | 10C3(0.35)3(0.65)10−3 | P(X=3)≈0.25222 |
4 | 10C4(0.35)4(0.65)10−4 | P(X=4)≈0.23767 |
5 | 10C5(0.35)5(0.65)10−5 | P(X=5)≈0.15357 |
6 | 10C6(0.35)6(0.65)10−6 | P(X=6)≈0.06891 |
7 | 10C7(0.35)7(0.65)10−7 | P(X=7)≈0.02120 |
8 | 10C8(0.35)8(0.65)10−8 | P(X=8)≈0.00428 |
9 | 10C9(0.35)9(0.65)10−9 | P(X=9)≈0.00051 |
10 | 10C10(0.35)10(0.65)10−10 | P(X=10)≈0.00003 |
x=3
C(n,k)=k!(n−k)!n!
Subtract terms
am⋅an=am+n
Split into factors
Cancel out common factors
Simplify quotient
Multiply
Calculate quotient
Calculate power and product
Round to 2 decimal place(s)
The binomial distribution is a discrete distribution that helps find the probability of the number of successes in a binomial experiment. In some cases, these calculations may be too complex. In such instances, a continuous distribution called normal distribution might approximate the calculations.
Read each situation carefully.
Description | |
---|---|
Situation I | Determining the number of girls in a family with 6 children if every newborn baby has a probability of about 0.45 of being male and 0.55 of being female. |
Situation II | Removing cards from a standard deck of cards and recording whether each one is red or black. No replacement occurs. |
Situation III | Ali records the number of days that it rains in September if there is a 37% of probability that it will rain each day in Salt Lake City. |
We want to identify whether we can model the given situations with binomial experiments. Before we analyze each situation individually, let's review the conditions a binomial experiment must satisfy.
Let's begin by analyzing Situation I.
In this situation, we want to determine the number of girls in a family with 6 children.
Because each child can be a girl or a boy, we have two possible outcomes. The success of this experiment will be a child being a girl. By assuming that the probability of a newborn baby being a girl is independent from any previous pregnancies, we can set the probability of success to be 0.55. P(Success)=0.55 Finally, because the family has 6 children, we have a fixed number of independent trials. This means that this situation can be modeled as a binomial experiment.
This situation involves removing cards from a standard deck without replacing them and recording whether each card is red or black.
This situation might be able to be considered as a binomial experiment because there are only two possible outcomes — each card will be either black or red. Also, the number of trials is fixed because a standard deck has exactly 52 cards.
Number of Trials | Possible Outcomes |
---|---|
52 | Red or Black |
However, let's see if the probability of success in each trial is constant. There are 52 cards in the deck, half of which are red. Therefore, the probability of the first drawn card being red is as follows. P(first card is red)=26/52=1/2 After we remove the first card from the deck, there are 51 cards left. Suppose that the first removed card was, in fact, red and that are now 26−1=25 red cards remaining in the deck. We can use these values to calculate the probability of the second card being red. P(second card is red)=25/51 Since 2551≠ 12, the probability of success in the second trial is different from the probability of success in the first trial. Therefore, the third condition is not satisfied. We can conclude that this situation does not represent a binomial experiment.
Finally, let's analyze whether recording the number of days that it rains during September in Salt Lake City can be reduced to a binomial experiment.
We can see that the possible outcomes are that it rains or it does not. If we consider a rainy day a success, its probability of rain each day is 37 %, or 0.37. Moreover, the number of trials is given by the number of days in September.
Number of Trials | Possible Outcomes |
---|---|
30 | Rain and no rain |
Finally, if we assume that the probability of rain each day is independent from any other day, we can say that this situation can be represented as a binomial experiment.
We have analyzed each situation individually. The table summarizes whether or not each situation can be represented by a binomial experiment.
Binomial? | |
---|---|
Situation I | Yes ✓ |
Situation II | No * |
Situation III | Yes ✓ |
From historical data, a company knows there is a 45% chance that a person visiting their website will make a purchase.
We want to determine the probability that 10 out of the next 20 visitors to the company's website will make a purchase. We can see that this situation seems to be a binomial experiment because there are only two outcomes — a visitor will either make a purchase or will not. Let's see if this situation meets the other conditions of a binomial experiment.
Condition | Given Experiment | Is Satisfied? |
---|---|---|
There is a fixed number of independent trials. | There are 20 trials — the 20 visitors at the website. | ✓ |
Each trial has two possible outcomes. | Whether a visitor makes a purchase or not. | ✓ |
The probability of success is constant for each trial. | There is a 0.45 probability that a visitor will make a purchase. | ✓ |
We have determined that this situation can be modeled as a binomial experiment. This means that we can use the Binomial Probability Formula to calculate the probability that exactly 10 out of the 20 customers will make a purchase. P(X=x)=_nC_xp^xq^(n-x) In this case, X is the random variable representing the number of customers that will make a purchase. The number of trials n is 20. The probability of success p, which is a visitor making a purchase, is 0.45, and the probability of failure q is 1− 0.45= 0.55, and x= 10. P(X=x)=_nC_xp^xq^(n-x) ⇓ P(X= 10)=_(20)C_(10)( 0.45)^(10)( 0.55)^(20- 10) Now we can evaluate the formula to determine the value of P(X= 10).
Therefore, the probability that 10 out of the next 20 visitors to the website make a purchase is about 0.159, or 15.9 %.
A bag contains 20 red marbles and 25 blue marbles.
We want to determine the probability of getting at most 3 blue marbles when drawing 6 marbles, with replacement, from the bag. Let X be the random variable representing this situation. X=Number of blue marbles There are two possible outcomes: the drawn marble is either red or blue. Because we replace every marble we draw before we take out the next one, the number of marbles of each color remains the same. As a result, the probability of choosing a red or a blue marble is constant in every trial. To calculate these probabilities, let's first calculate the total number of marbles. Total Number of Marbles= 25+ 20 ⇕ Total Number of Marbles= 45 This means that the probability of success p, drawing a blue marble, is 25 45, and the probability of failure q is 20 45. Since there are 6 trials in this experiment, we can use the Binomial Probability Formula to determine the probability of getting at most 3 blue marbles. P(X≤ 3)=? To find this probability, we can use the Addition Rule of Probability to find and add the probabilities of P(X=0), P(X=1), P(X=2), and P(X=3). Let's first find the probability of P(X=0).
We can follow a similar process to determine the probabilities of P(X=1), P(X=2), and P(X=3).
x | _6C_x(25/45)^x(20/45)^(6-x) | P(X=x)= _6C_x(25/45)^x(20/45)^(6-x) |
---|---|---|
0 | _6C_0( 25/45)^0( 20/45)^(6- 0) | P(X= 0)≈ 0.0077 |
1 | _6C_1( 25/45)^1( 20/45)^(6- 1) | P(X= 1)≈ 0.0578 |
2 | _6C_2( 25/45)^2( 20/45)^(6- 2) | P(X= 2)≈ 0.1806 |
3 | _6C_3( 25/45)^3( 20/45)^(6- 3) | P(X= 3)≈ 0.3011 |
We can add the values of the last column to get the probability of P(X≤3).
This means that the probability of getting at most three blue marbles is about 54.72 %.
The probability of flipping heads on a biased coin is 0.58. Ignacio calculated the probability of getting four heads in six coin tosses of this biased coin.
To find the mistake in the given procedure, we will evaluate the Binomial Probability Formula step by step and compare it with the ones made by Ignacio. Consider the Binomial Probability Formula. P(X=x)=_nC_xp^xq^(n-x) In this case, X is the random variable representing the number of heads results in six coin tosses. We want to determine the probability of P(X= 4) given that there are 6 trials. Because the probability of success p, getting heads, is 0.58, the probability of failure is 0.42. Let's substitute this information into the formula.
Now let's compare our expression with the one Ignacio wrote in Step 1.
Our Expression | Ignacio's Expression |
---|---|
P(X=4)=_6C_4(0.58)^4*(0.42)^2 | P(X=4)=_6C_4(0.58)^2*(0.42)^4 |
We can see that the exponents are switched. This means that Ignacio made a mistake in Step 1 when calculating the exponents. Now, let's calculate the probability using the correct expression.
Therefore, the probability Ignacio was looking for is about 0.30, or 30 %. Ignacio's calculations were wrong because of the mistake was made in Step 1 when he switched the exponents.