Sign In
| 8 Theory slides |
| 10 Exercises - Grade E - A |
| Each lesson is meant to take 1-2 classroom sessions |
Here are a few recommended readings before getting started with this lesson.
A line of best fit, also known as a regression line, is a line of fit that estimates the relationship between the values of a data set. The equation of the line of best fit has been determined using a strict mathematical method.
One commonly used method to determine a line of best fit is the method of least squares. The methods used to find the line of best fit are usually hard to do by hand. Therefore, a line of best fit can be found by performing a linear regression on a graphing calculator. As an example, consider the data set graphed above.
x | y |
---|---|
0.6 | 1.5 |
1.2 | 3.6 |
2.6 | 5.2 |
3.6 | 6.3 |
4.5 | 8.7 |
6 | 10.3 |
6.6 | 11.8 |
7.1 | 11.7 |
For a school project, Ramsha wants to investigate if there is a correlation between the width of a tree and its height. To do so, she measured the diameter at chest height and the height of some trees in a local park. Her findings are shown in the following table.
Diameter at chest (cm) | Height (m) |
---|---|
8 | 7 |
10 | 10 |
15 | 14 |
18 | 15 |
20 | 18 |
22 | 21 |
25 | 15 |
30 | 20 |
Edit.
Then the data values are written in the columns.
By pressing the STAT button and then selecting the CALC menu, the option LinReg(ax+b)
can be found. This option gives the line of best fit, expressed as a linear function in slope-intercept form.
Then, to graph the scatter plot push the buttons 2nd and Y=. Choose one of the plots in the list. Select the option ON,
choose the type to be a scatter plot, and assign L1 and L2 as XList
and Ylist,
respectively.
The plot can be made by pressing the button GRAPH. It is possible that after drawing the plot the window-size is not adequate for seeing all the information.
To fix this press ZOOM and select the option ZoomStat.
After doing so the window will resize to show the important information.
Use the linear regression feature of a graphing calculator to find the equation of the line of best fit for the given data set. Compare the obtained equation with the equations shown in the applet, and choose the closest one.
The following table displays some values of atmospheric pressures at different altitudes.
Altitude (thousand feet) | 0 | 1 | 2 | 3 | 4 | 5 |
---|---|---|---|---|---|---|
Pressure (PSI) | 14.71 | 14.18 | 13.75 | 13.21 | 12.69 | 12.20 |
Edit.
Then the data values can be written in the columns.
Finally, by pressing the STAT button and then selecting the menu item CALC, the option LinReg(ax+b)
can be found. This option gives a line of best fit, expressed as a linear function in slope-intercept form.
To graph the scatter plot, first push the buttons 2nd and Y=. Then, choose one of the plots in the list. Select the option ON,
choose the type to be a scatter plot, and assign L1 and L2 as XList
and Ylist,
respectively.
The plot can be made by pressing the button GRAPH. It is possible that after drawing the plot the window-size is not large enough to see all of the information.
To fix this, press ZOOM and select the option ZoomStat.
After doing that, the window will resize to show the important information.
To find the value of y when x=6, press CALC (2ND and TRACE). Then press ENTER to insert the value of 6 for x. Finally, press ENTER again.
The value of the pressure at 6000 feet is about 11.7 PSI. Since all the data values are close to the line of best fit and the data is strongly correlated, it can be said that this is a good approximation of the actual value.
Davontay has a math assignment that consists of eight different exercises. He registered the time (in minutes) in which he completed the first seven exercises.
Exercise | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
---|---|---|---|---|---|---|---|
Time (minutes) | 4 | 15 | 7 | 16 | 8 | 15 | 5 |
Edit.
The data values can be written in the columns.
Finally, by pressing the STAT button and then selecting the menu item CALC, the option LinReg(ax+b)
can be found. This option gives a line of best fit, expressed as a linear function in slope-intercept form.
To graph the scatter plot, first push the buttons 2nd and Y=. Then, choose one of the plots in the list. Select the option ON,
choose the type to be a scatter plot, and assign L1 and L2 as XList
and Ylist,
respectively.
The plot can be made by pressing the button GRAPH. It is possible that after drawing the plot the window-size is not large enough to see all of the information.
To fix this, press ZOOM and select the option ZoomStat.
After doing that, the window will resize to show the important information.
Looking at the graph, it can be seen that the line of best fit is not close to any of the provided data points.
From Part C it can be noted that the line is not representative of the given data points. This means that these measures do not reflect the reality of the exercises.
Then, to find the value of y when x=8, press CALC (2ND and TRACE). Then press ENTER to insert the value of 8 for x. Finally, press ENTER again.
The value of y when x=8 is about 10.57. This means that Davontay will finish the eighth exercise in less than 11 minutes. Since none of the given data values are really close to the line of best fit and the data is not correlated, it can be said that this is a not good approximation for the actual value.
In this lesson it was shown how to find the line of best fit for data sets and how to make predictions using these lines. Considering the examples discussed throughout the lesson, it is possible to make two conclusions.
The table shows some data for two variables.
x | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
---|---|---|---|---|---|---|---|---|
y | 1 | 3 | 3 | 2 | 4 | 3 | 2.5 | 5 |
We have been given a table with data for x and y.
x | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
---|---|---|---|---|---|---|---|---|
y | 1 | 3 | 3 | 2 | 4 | 3 | 2.5 | 5 |
In finding the line of line of fit using our calculator, we will begin by entering the values. Let's press the STAT button.
Then we will choose the first option in the menu, Edit,
and fill in the values from the lists L1 and L2.
We can perform a regression analysis on the data by pressing the STAT button again, followed by using the right-arrow key to select the CALC menu.
This menu lists the various regressions that are available. If we choose the fourth option in the menu LinReg(ax+b)
and press ENTER, the calculator performs a linear regression using the data that was entered.
Just before completing the process, we can round the values of a and b, then substitute them into the equation y= ax+ b. This gives us the equation for the line of best fit. y= 0.33x+ 1.46 We can see how the line fits with the data by plotting the data points and graphing the line on the same coordinate plane.
Consider the following data set.
x | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
---|---|---|---|---|---|---|---|---|
y | 1 | -4 | -2 | -5 | -4 | -9 | -8 | -11 |
We have been given a table with data for x and y.
x | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
---|---|---|---|---|---|---|---|---|
y | 1 | -4 | -2 | -5 | -4 | -9 | -8 | -11 |
In order to find a line of fit using our calculator, we need to first enter the values. Let's press the STAT button.
We will then choose the first option in the menu, Edit,
and fill in the values in the lists L1 and L2.
We can perform a regression analysis about the data by pressing the STAT button again, followed by using the right-arrow key to select the CALC menu.
This menu lists the various regressions that are available. If we choose the fourth option in the menu LinReg(ax+b)
and press ENTER, the calculator performs a linear regression using the data that was entered.
The correlation coefficient is the value of r displayed on the output screen. Finally, by rounding the value we get that the correlation coefficient r is about -0.92.
We found the correlation coefficient in the previous part. r ≈ - 0.92 Since the value of the correlation coefficient is negative, the data values have a negative correlation. Also importantly, the correlation coefficient is really close to -1. Thanks to that fact, the data have a strong negative correlation.
When doing homework, Maya obtained the following display on her graphing calculator.
Let's begin by looking at the given display screen.
The correlation coefficient is the value of r on the display screen. Looking at the display, we can see that the correlation coefficient is r=- 0.998972... Therefore, the data have a strong negative correlation. The mistake was that Maya interchanged r with r^2. r = - 0.998972 ... ⇓ Strong Negative Correlation On the other hand, if the correlation coefficient were 0.997945, it would mean that the data have a strong positive correlation. Therefore, Maya's interpretation was correct but that value was not the correlation coefficient.
Consider the following display of a graphing calculator.
We are asked to find and correct the error that Tearrik made. We will first create the line of best fit ourselves. Then we will describe the error.
Let's consider the graphing calculator display.
We will substitute the values of a and b into the equation y= ax+ b to find the equation of the line of best fit. y= 7.19x+ 25.32
The written equation has interchanged the values of a and b. According to the display, we have a= 7.19 and b= 25.32. The coefficient of x should be 7.19 and the constant 25.32.
Ali is the founder and face of the brand of sunglasses named Daytas.
Ali wants to gain a better understanding of how the company did in the past and how that is relevant to the future. The table indicates the sales of Daytas from 2010 to 2015.
Year | 2010 | 2011 | 2012 | 2013 | 2014 | 2015 |
---|---|---|---|---|---|---|
Sales (Thousands of Sunglasses) | 4.42 | 5.4 | 8.7 | 13.2 | 15.4 | 19.3 |
We have been given a table that shows the sales of sunglasses.
Year | 2010 | 2011 | 2012 | 2013 | 2014 | 2015 |
---|---|---|---|---|---|---|
Sales (Thousands of Sunglasses) | 4.42 | 5.4 | 8.7 | 13.2 | 15.4 | 19.3 |
In order to find a line of fit using our graphing calculator, we will first enter the values. Let's going and press the STAT button.
Then we choose the first option in the menu, Edit,
and fill in the values in lists L1 and L2.
We can perform a regression analysis about the data by pressing the STAT button again, followed by using the right-arrow key to select the CALC menu.
This menu lists the various regressions that are available. If we choose the fourth option in the menu LinReg(ax+b)
and press ENTER, the calculator performs a linear regression using the data that was entered.
We will substitute the rounded values of a and b into the equation y= ax+ b to find the equation of the line of best fit. y= 3.11x -6250.68
To find the sales of the year 2025, we will substitute 2025 for x in the equation of the line of best fit found previously.
Therefore, Ali can expect to sale about 47.07 thousand sunglasses in 2025.
Tiffaniqua is buying her first car, which she wants to be a used car. After determining the model and year of the car she wants to buy, she finds a table online that shows the relationship between the mileage and prices of the cars.
Mileage, x | 23 | 15 | 19 | 31 | 9 | 25 |
---|---|---|---|---|---|---|
Price, y | 11 | 12 | 13 | 10 | 13 | 12 |
We have been given a table with the car's price for different mileages.
Mileage, x | 23 | 15 | 19 | 31 | 9 | 25 |
---|---|---|---|---|---|---|
Price, y | 11 | 12 | 13 | 10 | 13 | 12 |
In order to find a line of fit using our calculator, we need to first enter the values. Let's press the STAT button.
Then we choose the first option in the menu, Edit,
and fill in the values in lists L1 and L2.
We can perform a regression analysis on the data by pressing the STAT button again, followed by using the right-arrow key to select the CALC menu.
This menu lists the various regressions that are available. If we choose the fourth option in the menu LinReg(ax+b)
and press ENTER, the calculator performs a linear regression using the data that was entered.
We will substitute the rounded values of a and b into the equation y= ax+ b to find the equation of the line of best fit. y= - 0.12x+ 14.31
The correlation coefficient is the value of r in the linear regression output.
Looking at the screen, we can find and round this value. r ≈ - 0.81
The correlation coefficient r is always between -1≤ r≤ 1, where positive values represent a positive correlation and negative values represent a negative correlation. Additionally, a value close to 0 represents a weak correlation. Therefore, with our value of -0.81 we can say that the data have a strong negative correlation.