| {{ 'ml-lesson-number-slides' | message : article.intro.bblockCount}} |
| {{ 'ml-lesson-number-exercises' | message : article.intro.exerciseCount}} |
| {{ 'ml-lesson-time-estimation' | message }} |
Here are a few recommended readings before getting started with this lesson.
Understanding Probability
Understanding Descriptive Measures
Understanding Types of Data
Other Recommended Readings
A random variable assigns a numerical value to an outcome of a probability experiment. In many situations, it is important to know how likely it is that a random variable will take a specific value. This can be represented by listing or graphing the probability of each value of a random variable. This is called a probability distribution.
The expected value of a random variable X is the average of the possible outcomes of a random variable. It is used to describe the center of a probability distribution. For a discrete random variable, the expected value E(X) is given by the weighted mean.
E(X)=i=1∑nxi⋅P(X=xi)
In this formula, xi represents a specific outcome, P(X=xi) corresponds to the associated probability of xi, and n is the number of all possible outcomes. According to the law of large numbers, when considering a sequence of random variables, its average tends to the expected value under specific conditions.
The expected value is commonly used with a measure of variation such as the variance or standard deviation to determine how outcome will differ from the expected value.
The standard deviation of a random variable is a measure of variation that describes how spread out the outcomes of a random variable X are from its expected value E(X). The standard deviation is represented by the Greek letter σ — read as sigma
— and is given by the square root of the variance of X.
In this formula, xi is a specific outcome and P(X=xi) is the probability of xi.
Let X be the random variable representing the number of cars sold on a given day in a car dealership. The table below shows the probability distribution of X.
x | 0 | 1 | 2 | 3 | 4 |
---|---|---|---|---|---|
P(X=x) | 151 | 153 | 156 | 153 | 152 |
Substitute values
a⋅cb=ca⋅b
a0=0
\IdPropAdd
Add fractions
Use a calculator
Round to 2 decimal place(s)
xi | [xi−E(X)]2 | [xi−E(X)]2⋅P(X=xi) |
---|---|---|
0 | (0−2.13)2=4.5369 | 4.5364⋅151≈0.3025 |
1 | (1−2.13)2=1.2769 | 1.2769⋅153≈0.2554 |
2 | (2−2.13)2=0.0169 | 0.0169⋅156≈0.0068 |
3 | (3−2.13)2=0.7569 | 0.7569⋅153≈0.1514 |
4 | (4−2.13)2=3.4969 | 3.4969⋅152≈0.4663 |
Variance σ2 | ≈1.1824 |
Finally, calculate the square root of the variance to get the standard deviation of X.
Izabella's aunt Magdalena owns a clothing store. She needs to increase stock in her shop and plans to invest $15000 in one of the two collections that were offered to her by well-known brands. Each brand claims that they have a great expected rate of return. Their probability distributions are described below.
Dylan and Izabella want to help Magdalena make the best decision. They decided to use their recently acquired knowledge about the expected value and standard deviation of a probability distribution to analyze the offers. Help them answer the following questions and give the best advice to Magdalena.
Substitute values
Multiply
Add and subtract terms
xi | [xi−E(X)]2 | [xi−E(X)]2⋅P(X=xi) |
---|---|---|
1200 | (1200−1125)2=5625 | 5625⋅0.5=2812.5 |
1800 | (1800−1125)2=455625 | 455625⋅0.2=91125 |
900 | (900−1125)2=50625 | 50625⋅0.2=10125 |
-150 | (-150−1125)2=1625625 | 1625625⋅0.1=162562.5 |
Sum of Values | 266625 |
Substitute values
Multiply
Add and subtract terms
xi | [xi−E(X)]2 | [xi−E(X)]2⋅P(X=xi) |
---|---|---|
3600 | (3600−1145)2=6027025 | 6027025⋅0.3=1808107.5 |
2850 | (2850−1145)2=2907025 | 2907025⋅0.1=290702.5 |
-300 | (-300−1145)2=2088025 | 2088025⋅0.4=835210 |
-500 | (-500−1145)2=2706025 | 2706025⋅0.2=541205 |
Sum of Values | 3475225 |
The expected value and the standard distribution of each probability distribution have been calculated. The following table summarizes these measures.
Measures of the Probability Distributions | ||
---|---|---|
Expected Value of Collection I | 1125 | |
Expected Value of Collection II | 1145 | |
Standard Deviation of Collection I | ≈516.23 | |
Standard Deviation of Collection II | ≈1864.20 |
The outcomes of many experiments can be reduced to two possibilities, success or failure. If two more conditions are satisfied, these experiments can be modeled by a binomial experiment.
Is there a fixed number of trials for each experiment? How many outcomes are possible? Does the probability of each outcome remain constant for each trial? Are trials independent?
Start by recalling the conditions that a binomial experiment should satisfy.
Analyze each situation one at a time to see if it follows all of the conditions of a binomial experiment.
This situation has eight trials of selecting one scratch-off cards at random.
Each card could win a prize or not, which means there are two possible outcomes for each trial. Moreover, the probability of success, which is winning a prize, is 25% or 0.25, for every card.Note that height can vary for every people surveyed.
Because there are 100 possible answers, it is likely that more than 2 different outcomes will occur. This means that this situation does not represent a binomial experiment.
This situation has a fixed number of trials because it involves asking 20 people if their blood type is O.
Each trial has only two possible solutions, blood type O or another type. Moreover, the probability of having blood type O in each trial is 0.4, which represents the probability of success.All four situations have already been analyzed and the results are summarized in the following table.
Experiment | Is It a Binomial Experiment? |
---|---|
The Scratch-off Cards | ✓ |
Heights of 100 People | × |
Rolling a Die | × |
Blood Type O? | ✓ |
Because binomial experiments can simplify many complex situations, it is essential to determine how likely it is to obtain a specific number of successes out of n trials in a given experiment. Also, the expected value, or center of the distribution, will be presented.
Let X be the random variable representing the number of successes in n binomial trials. The binomial probability P(X=x) can be calculated by using the following formula.
P(X=x)=nCxpxqn−x
In this formula, P(X=x) is the probability that the random variable X is equal to x, which means that there are exactly x successes. Additionally, p and q are the probabilities of success and failure, respectively, and nCx is the binomial coefficient.
This proof will begin by calculating the probability of obtaining a fixed sequence with x successes in n independent trials. The number of all possible sequences with x successes will then be calculated. Finally, joining the first and second parts will give the formula.
n=4, r=2
Subtract term
Write as a product
Cross out common factors
Cancel out common factors
Multiply
2!=2
Calculate quotient
Dylan is taking a test that consists of five multiple-choice questions. Each question has five answer choices, only one of which is correct. He did not have time to study, so he decides to guess every answer.
What is the probability of guessing at least three answers correctly? Write the answer as a percentage. Round to two decimal places.Consider using the Addition Rule of Probability.
To find the probability of guessing at least three answers correctly in a five question multiple-choice test, first, verify if this experiment satisfies the three conditions of a binomial experiment.
Condition | Given Experiment | Is Satisfied? |
---|---|---|
There is a fixed number of independent trials. | There are five trials — five questions. | ✓ |
Each trial has two possible outcomes. | The answer for each question is either correct or incorrect. | ✓ |
The probability of success is constant for each trial. | The probability of guessing the correct answer for each question is 51=0.2, where 5 is the number of options. | ✓ |
x=3
C(n,k)=k!(n−k)!n!
Subtract terms
Write as a product
Cancel out common factors
Simplify quotient
1!=1
Multiply
Calculate quotient
Calculate power and product
x | 5Cx(0.2)x(0.8)5−x | P(X=x)=5Cx(0.2)x(0.8)5−x |
---|---|---|
3 | 5C3(0.2)3(0.8)5−3=3125160 | P(X=3)=0.0512 |
4 | 5C4(0.2)4(0.8)5−4=312520 | P(X=4)=0.0064 |
5 | 5C5(0.2)5(0.8)5−5=31251 | P(X=5)≈0.00032 |
Substitute values
Add terms
Convert to percent
Condition | Given Experiment | Is Satisfied? |
---|---|---|
There is a fixed number of independent trials. | There are 10 trials — the 10 customers that will enter the store. | ✓ |
Each trial has two possible outcomes. | Whether a customer makes a purchase or not. | ✓ |
The probability of success is constant for each trial. | There is a 0.35 probability that a customer will make a purchase. | ✓ |
x=0
C(n,k)=k!(n−k)!n!
Subtract terms
0!=1
a⋅1=a
aa=1
Calculate power and product
Round to 4 decimal place(s)
x | 10Cx(0.35)x(0.65)10−x | P(X=x)=10Cx(0.35)x(0.65)10−x |
---|---|---|
0 | 10C0(0.35)0(0.65)10−0 | P(X=0)≈0.01346 |
1 | 10C1(0.35)1(0.65)10−1 | P(X=1)≈0.07249 |
2 | 10C2(0.35)2(0.65)10−2 | P(X=2)≈0.17565 |
3 | 10C3(0.35)3(0.65)10−3 | P(X=3)≈0.25222 |
4 | 10C4(0.35)4(0.65)10−4 | P(X=4)≈0.23767 |
5 | 10C5(0.35)5(0.65)10−5 | P(X=5)≈0.15357 |
6 | 10C6(0.35)6(0.65)10−6 | P(X=6)≈0.06891 |
7 | 10C7(0.35)7(0.65)10−7 | P(X=7)≈0.02120 |
8 | 10C8(0.35)8(0.65)10−8 | P(X=8)≈0.00428 |
9 | 10C9(0.35)9(0.65)10−9 | P(X=9)≈0.00051 |
10 | 10C10(0.35)10(0.65)10−10 | P(X=10)≈0.00003 |
x=3
C(n,k)=k!(n−k)!n!
Subtract terms
am⋅an=am+n
Split into factors
Cancel out common factors
Simplify quotient
Multiply
Calculate quotient
Calculate power and product
Round to 2 decimal place(s)