A professor is attempting to identify trends among final exam scores. His class has a mixture of students, so he wonders if there is any relationship between age and final exam scores. One way for him to analyze the scores is by creating a diagram that relates the age of each student to the exam score received. In this section, we will examine one such diagram known as a scatter plot.
Drawing and Interpreting Scatter Plots
A scatter plot is a graph of plotted points that may show a relationship between two sets of data. If the relationship is from a linear model, or a model that is nearly linear, the professor can draw conclusions using his knowledge of linear functions. [link] shows a sample scatter plot.
Notice this scatter plot does not indicate a linear relationship. The points do not appear to follow a trend. In other words, there does not appear to be a relationship between the age of the student and the score on the final exam.
[link] shows the number of cricket chirps in 15 seconds, for several different air temperatures, in degrees Fahrenheit
Chirps | 44 | 35 | 20.4 | 33 | 31 | 35 | 18.5 | 37 | 26 |
Temperature | 80.5 | 70.5 | 57 | 66 | 68 | 72 | 52 | 73.5 | 53 |
Plotting this data, as depicted in [link] suggests that there may be a trend. We can see from the trend in the data that the number of chirps increases as the temperature increases. The trend appears to be roughly linear, though certainly not perfectly so.
Finding the Line of Best Fit
Once we recognize a need for a linear function to model that data, the natural follow-up question is “what is that linear function?” One way to approximate our linear function is to sketch the line that seems to best fit the data. Then we can extend the line until we can verify the y-intercept. We can approximate the slope of the line by extending it until we can estimate the $\frac{\text{rise}}{\text{run}}.$
Find a linear function that fits the data in [link] by “eyeballing” a line that seems to fit.
On a graph, we could try sketching a line.
Using the starting and ending points of our hand drawn line, points (0, 30) and (50, 90), this graph has a slope of
and a y-intercept at 30. This gives an equation of
where $c$ is the number of chirps in 15 seconds, and $T\left(c\right)$ is the temperature in degrees Fahrenheit. The resulting equation is represented in [link].
Recognizing Interpolation or Extrapolation
While the data for most examples does not fall perfectly on the line, the equation is our best guess as to how the relationship will behave outside of the values for which we have data. We use a process known as interpolation when we predict a value inside the domain and range of the data. The process of extrapolation is used when we predict a value outside the domain and range of the data.
[link] compares the two processes for the cricket-chirp data addressed in [link]. We can see that interpolation would occur if we used our model to predict temperature when the values for chirps are between 18.5 and 44. Extrapolation would occur if we used our model to predict temperature when the values for chirps are less than 18.5 or greater than 44.
There is a difference between making predictions inside the domain and range of values for which we have data and outside that domain and range. Predicting a value outside of the domain and range has its limitations. When our model no longer applies after a certain point, it is sometimes called model breakdown. For example, predicting a cost function for a period of two years may involve examining the data where the input is the time in years and the output is the cost. But if we try to extrapolate a cost when $x=\mathrm{50,}$ that is in 50 years, the model would not apply because we could not account for factors fifty years in the future.
Different methods of making predictions are used to analyze data.
- The method of interpolation involves predicting a value inside the domain and/or range of the data.
- The method of extrapolation involves predicting a value outside the domain and/or range of the data.
- Model breakdown occurs at the point when the model no longer applies.
Use the cricket data from [link] to answer the following questions:
- Would predicting the temperature when crickets are chirping 30 times in 15 seconds be interpolation or extrapolation? Make the prediction, and discuss whether it is reasonable.
- Would predicting the number of chirps crickets will make at 40 degrees be interpolation or extrapolation? Make the prediction, and discuss whether it is reasonable.
- The number of chirps in the data provided varied from 18.5 to 44. A prediction at 30 chirps per 15 seconds is inside the domain of our data, so would be interpolation. Using our model:
Based on the data we have, this value seems reasonable. - The temperature values varied from 52 to 80.5. Predicting the number of chirps at 40 degrees is extrapolation because 40 is outside the range of our data. Using our model:
We can compare the regions of interpolation and extrapolation using [link].
Our model predicts the crickets would chirp 8.33 times in 15 seconds. While this might be possible, we have no reason to believe our model is valid outside the domain and range. In fact, generally crickets stop chirping altogether below around 50 degrees.
According to the data from [link], what temperature can we predict it is if we counted 20 chirps in 15 seconds?
$54\xb0\text{F}$
Finding the Line of Best Fit Using a Graphing Utility
While eyeballing a line works reasonably well, there are statistical techniques for fitting a line to data that minimize the differences between the line and data values
Given data of input and corresponding outputs from a linear function, find the best fit line using linear regression.
- Enter the input in List 1 (L1).
- Enter the output in List 2 (L2).
- On a graphing utility, select Linear Regression (LinReg).
Find the least squares regression line using the cricket-chirp data in [link].
- Enter the input (chirps) in List 1 (L1).
- Enter the output (temperature) in List 2 (L2). See [link].
L1 44 35 20.4 33 31 35 18.5 37 26 L2 80.5 70.5 57 66 68 72 52 73.5 53 - On a graphing utility, select Linear Regression (LinReg). Using the cricket chirp data from earlier, with technology we obtain the equation:
Notice that this line is quite similar to the equation we “eyeballed” but should fit the data better. Notice also that using this equation would change our prediction for the temperature when hearing 30 chirps in 15 seconds from 66 degrees to:
The graph of the scatter plot with the least squares regression line is shown in [link].
Will there ever be a case where two different lines will serve as the best fit for the data?
No. There is only one best fit line.
Distinguishing Between Linear and Non-Linear Models
As we saw above with the cricket-chirp model, some data exhibit strong linear trends, but other data, like the final exam scores plotted by age, are clearly nonlinear. Most calculators and computer software can also provide us with the correlation coefficient, which is a measure of how closely the line fits the data. Many graphing calculators require the user to turn a ”diagnostic on” selection to find the correlation coefficient, which mathematicians label as $r.$ The correlation coefficient provides an easy way to get an idea of how close to a line the data falls.
We should compute the correlation coefficient only for data that follows a linear pattern or to determine the degree to which a data set is linear. If the data exhibits a nonlinear pattern, the correlation coefficient for a linear regression is meaningless. To get a sense for the relationship between the value of $r$ and the graph of the data, [link] shows some large data sets with their correlation coefficients. Remember, for all plots, the horizontal axis shows the input and the vertical axis shows the output.
The correlation coefficient is a value,$r,$ between –1 and 1.
- r > 0 suggests a positive (increasing) relationship
- r < 0 suggests a negative (decreasing) relationship
- The closer the value is to 0, the more scattered the data.
- The closer the value is to 1 or –1, the less scattered the data is.
Calculate the correlation coefficient for cricket-chirp data in [link].
Because the data appear to follow a linear pattern, we can use technology to calculate $r.$ Enter the inputs and corresponding outputs and select the Linear Regression. The calculator will also provide you with the correlation coefficient, $r=\mathrm{0.9509.}$ This value is very close to 1, which suggests a strong increasing linear relationship.
Note: For some calculators, the Diagnostics must be turned "on" in order to get the correlation coefficient when linear regression is performed: [2nd]>[0]>[alpha][x–1], then scroll to DIAGNOSTICSON.
Predicting with a Regression Line
Once we determine that a set of data is linear using the correlation coefficient, we can use the regression line to make predictions. As we learned above, a regression line is a line that is closest to the data in the scatter plot, which means that only one such line is a best fit for the data.
Gasoline consumption in the United States has been steadily increasing. Consumption data from 1994 to 2004 is shown in [link]
Year | '94 | '95 | '96 | '97 | '98 | '99 | '00 | '01 | '02 | '03 | '04 |
Consumption (billions of gallons) | 113 | 116 | 118 | 119 | 123 | 125 | 126 | 128 | 131 | 133 | 136 |
The scatter plot of the data, including the least squares regression line, is shown in [link].
We can introduce new input variable, $t,$representing years since 1994.
The least squares regression equation is:
Using technology, the correlation coefficient was calculated to be 0.9965, suggesting a very strong increasing linear trend.
Using this to predict consumption in 2008 $(t=14),$
The model predicts 144.244 billion gallons of gasoline consumption in 2008.
Use the model we created using technology in [link] to predict the gas consumption in 2011. Is this an interpolation or an extrapolation?
150.871 billion gallons; extrapolation
Access these online resources for additional instruction and practice with fitting linear models to data.
Visit this website for additional practice questions from Learningpod.
Key Concepts
- Scatter plots show the relationship between two sets of data. See [link].
- Scatter plots may represent linear or non-linear models.
- The line of best fit may be estimated or calculated, using a calculator or statistical software. See [link].
- Interpolation can be used to predict values inside the domain and range of the data, whereas extrapolation can be used to predict values outside the domain and range of the data. See [link].
- The correlation coefficient, $r,$ indicates the degree of linear relationship between data. See [link].
- A regression line best fits the data. See [link].
- The least squares regression line is found by minimizing the squares of the distances of points from a line passing through the data and may be used to make predictions regarding either of the variables. See [link].
Section Exercises
Verbal
Describe what it means if there is a model breakdown when using a linear model.
When our model no longer applies, after some value in the domain, the model itself doesn’t hold.
What is interpolation when using a linear model?
What is extrapolation when using a linear model?
We predict a value outside the domain and range of the data.
Explain the difference between a positive and a negative correlation coefficient.
Explain how to interpret the absolute value of a correlation coefficient.
The closer the number is to 1, the less scattered the data, the closer the number is to 0, the more scattered the data.
Algebraic
A regression was run to determine whether there is a relationship between hours of TV watched per day $(x)$ and number of sit-ups a person can do $(y).$ The results of the regression are given below. Use this to predict the number of sit-ups a person who watches 11 hours of TV can do.
A regression was run to determine whether there is a relationship between the diameter of a tree ($x,$in inches) and the tree’s age ($y,$in years). The results of the regression are given below. Use this to predict the age of a tree with diameter 10 inches.
61.966 years
For the following exercises, draw a scatter plot for the data provided. Does the data appear to be linearly related?
0 | 2 | 4 | 6 | 8 | 10 |
–22 | –19 | –15 | –11 | –6 | –2 |
1 | 2 | 3 | 4 | 5 | 6 |
46 | 50 | 59 | 75 | 100 | 136 |
No.
100 | 250 | 300 | 450 | 600 | 750 |
12 | 12.6 | 13.1 | 14 | 14.5 | 15.2 |
1 | 3 | 5 | 7 | 9 | 11 |
1 | 9 | 28 | 65 | 125 | 216 |
No.
For the following data, draw a scatter plot. If we wanted to know when the population would reach 15,000, would the answer involve interpolation or extrapolation? Eyeball the line, and estimate the answer.
Year | 1990 | 1995 | 2000 | 2005 | 2010 |
Population | 11,500 | 12,100 | 12,700 | 13,000 | 13,750 |
For the following data, draw a scatter plot. If we wanted to know when the temperature would reach 28 °F, would the answer involve interpolation or extrapolation? Eyeball the line and estimate the answer.
Temperature, °F | 16 | 18 | 20 | 25 | 30 |
Time, seconds | 46 | 50 | 54 | 55 | 62 |
Interpolation. About $60\xb0\text{F}.$
Graphical
For the following exercises, match each scatterplot with one of the four specified correlations in [link] and [link].
$r=0.\text{95}$
$r=-0.\text{89}$
C
$r=0.26$
$r=-0.39$
B
For the following exercises, draw a best-fit line for the plotted data.
Numeric
The U.S. Census tracks the percentage of persons 25 years or older who are college graduates. That data for several years is given in [link]
Year | 1990 | 1992 | 1994 | 1996 | 1998 | 2000 | 2002 | 2004 | 2006 | 2008 |
Percent Graduates | 21.3 | 21.4 | 22.2 | 23.6 | 24.4 | 25.6 | 26.7 | 27.7 | 28 | 29.4 |
The U.S. import of wine (in hectoliters) for several years is given in [link]. Determine whether the trend appears linear. If so, and assuming the trend continues, in what year will imports exceed 12,000 hectoliters?
Year | 1992 | 1994 | 1996 | 1998 | 2000 | 2002 | 2004 | 2006 | 2008 | 2009 |
Imports | 2665 | 2688 | 3565 | 4129 | 4584 | 5655 | 6549 | 7950 | 8487 | 9462 |
Yes, trend appears linear because $r=0.\text{985}$ and will exceed 12,000 near midyear, 2016, 24.6 years since 1992.
[link] shows the year and the number of people unemployed in a particular city for several years. Determine whether the trend appears linear. If so, and assuming the trend continues, in what year will the number of unemployed reach 5?
Year | 1990 | 1992 | 1994 | 1996 | 1998 | 2000 | 2002 | 2004 | 2006 | 2008 |
Number Unemployed | 750 | 670 | 650 | 605 | 550 | 510 | 460 | 420 | 380 | 320 |
Technology
For the following exercises, use each set of data to calculate the regression line using a calculator or other technology tool, and determine the correlation coefficient to 3 decimal places of accuracy.
$x$ | 8 | 15 | 26 | 31 | 56 |
$y$ | 23 | 41 | 53 | 72 | 103 |
$y=\text{1}.\text{64}0x+\text{13}.\text{8}00$, $r=0.\text{987}$
$x$ | 5 | 7 | 10 | 12 | 15 |
$y$ | 4 | 12 | 17 | 22 | 24 |
$x$ | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
$y$ | 21.9 | 22.22 | 22.74 | 22.26 | 20.78 | 17.6 | 16.52 | 18.54 |
$x$ | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 |
$y$ | 15.76 | 13.68 | 14.1 | 14.02 | 11.94 | 12.76 | 11.28 | 9.1 |
$$y=-0.962x+26.86,r=-0.965$$
$x$ | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 |
$$y$$ | 44.8 | 43.1 | 38.8 | 39 | 38 | 32.7 | 30.1 | 29.3 | 27 | 25.8 |
$x$ | 21 | 25 | 30 | 31 | 40 | 50 |
$y$ | 17 | 11 | 2 | $\mathrm{-1}$ | $\mathrm{-18}$ | $\mathrm{-40}$ |
$y=-\text{1}.\text{981}x+\text{6}0.\text{197}$; $r=-0.\text{998}$
$x$ | 100 | 80 | 60 | 55 | 40 | 20 |
$y$ | 2000 | 1798 | 1589 | 1580 | 1390 | 1202 |
$x$ | 900 | 988 | 1000 | 1010 | 1200 | 1205 |
$y$ | 70 | 80 | 82 | 84 | 105 | 108 |
$y=0.\text{121}x-38.841,\text{\hspace{0.17em}}r=0.998$
Extensions
Graph $f(x)=0.5x+10$. Pick a set of 5 ordered pairs using inputs $x=\text{\u22122},\text{1},\text{5},\text{6},\text{9}$ and use linear regression to verify that the function is a good fit for the data.
Graph $f(x)=-2x-10$. Pick a set of 5 ordered pairs using inputs $x=\text{\u22122},\text{1},\text{5},\text{6},\text{9}$ and use linear regression to verify the function.
$\left(\text{\u22122},\mathrm{-6}\right),\left(\text{1},\text{\u221212}\right),\left(\text{5},\text{\u22122}0\right),\left(\text{6},\text{\u221222}\right),\left(\text{9},\text{\u221228}\right)$; $y=\mathrm{-2}x\mathrm{-10}$
For the following exercises, consider this scenario: The profit of a company decreased steadily over a ten-year span. The following ordered pairs shows dollars and the number of units sold in hundreds and the profit in thousands of over the ten-year span, (number of units sold, profit) for specific recorded years:
$\left(\text{46},\text{1},\text{6}00\right),\left(\text{48},\text{1},\text{55}0\right),\left(\text{5}0,\text{1},\text{5}0\text{5}\right),\left(\text{52},\text{1},\text{54}0\right),\left(\text{54},\text{1},\text{495}\right)$.
Use linear regression to determine a function $P$ where the profit in thousands of dollars depends on the number of units sold in hundreds.
Find to the nearest tenth and interpret the x-intercept.
$\left(\text{189}.\text{8},0\right)$ If 18,980 units are sold, the company will have a profit of zero dollars.
Find to the nearest tenth and interpret the y-intercept.
Real-World Applications
For the following exercises, consider this scenario: The population of a city increased steadily over a ten-year span. The following ordered pairs shows the population and the year over the ten-year span, (population, year) for specific recorded years:
$\text{(2500,2000),(2650,2001),(3000,2003),(3500,2006),(4200,2010)}$
Use linear regression to determine a function $y,$ where the year depends on the population. Round to three decimal places of accuracy.
$y=0.00587x+\text{1985}.4\text{1}$
Predict when the population will hit 8,000.
For the following exercises, consider this scenario: The profit of a company increased steadily over a ten-year span. The following ordered pairs show the number of units sold in hundreds and the profit in thousands of over the ten year span, (number of units sold, profit) for specific recorded years:
$\left(\text{46},\text{25}0\right),\left(\text{48},\text{3}0\text{5}\right),\left(\text{5}0,\text{35}0\right),\left(\text{52},\text{39}0\right),\left(\text{54},\text{41}0\right)$.
Use linear regression to determine a function y, where the profit in thousands of dollars depends on the number of units sold in hundreds .
$y=\text{2}0.\text{25}x-\text{671}.\text{5}$
Predict when the profit will exceed one million dollars.
For the following exercises, consider this scenario: The profit of a company decreased steadily over a ten-year span. The following ordered pairs show dollars and the number of units sold in hundreds and the profit in thousands of over the ten-year span (number of units sold, profit) for specific recorded years:
$\text{(46,250),(48,225),(50,205),(52,180),(54,165)}\text{.}$
Use linear regression to determine a function y, where the profit in thousands of dollars depends on the number of units sold in hundreds .
$y=-\text{1}0.\text{75}x+\text{742}.\text{5}0$
Predict when the profit will dip below the $25,000 threshold.
Chapter Review Exercises
Linear Functions
Determine whether the algebraic equation is linear. $2x+3y=7$
Yes
Determine whether the algebraic equation is linear. $6{x}^{2}-y=5$
Determine whether the function is increasing or decreasing.
$f\left(x\right)=7x-2$
Increasing.
Determine whether the function is increasing or decreasing.
$g(x)=-x+2$
Given each set of information, find a linear equation that satisfies the given conditions, if possible.
Passes through $\left(\text{7},\text{5}\right)$ and $\left(\text{3},\text{17}\right)$
$y=-\text{3}x+\text{26}$
Given each set of information, find a linear equation that satisfies the given conditions, if possible.
x-intercept at $\left(\text{6},0\right)$ and y-intercept at $\left(0,\text{1}0\right)$
Find the slope of the line shown in the line graph.
3
Find the slope of the line graphed.
Write an equation in slope-intercept form for the line shown.
$y=\text{2}x-\text{2}$
Does the following table represent a linear function? If so, find the linear equation that models the data.
$x$ | –4 | 0 | 2 | 10 |
$g(x)$ | 18 | –2 | –12 | –52 |
Does the following table represent a linear function? If so, find the linear equation that models the data.
$x$ | 6 | 8 | 12 | 26 |
$g(x)$ | –8 | –12 | –18 | –46 |
Not linear.
On June 1^{st}, a company has $4,000,000 profit. If the company then loses 150,000 dollars per day thereafter in the month of June, what is the company’s profit n^{th}day after June 1^{st}?
Graphs of Linear Functions
For the following exercises, determine whether the lines given by the equations below are parallel, perpendicular, or neither parallel nor perpendicular:
$\begin{array}{l}2x-6y=12\hfill \\ -x+3y=1\hfill \end{array}$
parallel
$\begin{array}{l}\begin{array}{l}\\ y=\frac{1}{3}x-2\end{array}\hfill \\ 3x+y=-9\hfill \end{array}$
For the following exercises, find the x- and y- intercepts of the given equation
$7x+9y=\mathrm{-63}$
$(\mathrm{\u20139},0);(0,\mathrm{\u20137})$
$f(x)=2x-1$
For the following exercises, use the descriptions of the pairs of lines to find the slopes of Line 1 and Line 2. Is each pair of lines parallel, perpendicular, or neither?
Line 1: $m=-2;$ Line 2: $m=-2;$ Parallel
Write an equation for a line perpendicular to $f(x)=5x-1$ and passing through the point (5, 20).
$y=-0.2x+21$
Find the equation of a line with a y- intercept of $\left(0,\text{}2\right)$ and slope $-\frac{1}{2}$.
Sketch a graph of the linear function $f(t)=2t-5$.
Find the point of intersection for the 2 linear functions: $\begin{array}{l}x=y+6\\ 2x-y=13\end{array}$
A car rental company offers two plans for renting a car.
How many miles would you need to drive for plan B to save you money?
250.
Modeling with Linear Functions
Find the area of a triangle bounded by the y axis, the line $f\left(x\right)=10-2x$, and the line perpendicular to $f$ that passes through the origin.
A town’s population increases at a constant rate. In 2010 the population was 55,000. By 2012 the population had increased to 76,000. If this trend continues, predict the population in 2016.
118,000.
The number of people afflicted with the common cold in the winter months dropped steadily by 50 each year since 2004 until 2010. In 2004, 875 people were inflicted.
Find the linear function that models the number of people afflicted with the common cold C as a function of the year, $t.$ When will no one be afflicted?
For the following exercises, use the graph in [link] showing the profit, $y,$in thousands of dollars, of a company in a given year, $x,$where $x$ represents years since 1980.
Find the linear function y, where y depends on $x,$ the number of years since 1980.
$y=-\text{3}00x\text{\hspace{0.17em}}+\text{\hspace{0.17em}}\text{11},\text{5}00$
Find and interpret the y-intercept.
For the following exercise, consider this scenario: In 2004, a school population was 1,700. By 2012 the population had grown to 2,500.
Assume the population is changing linearly.
- How much did the population grow between the year 2004 and 2012?
- What is the average population growth per year?
- Find an equation for the population, P, of the school t years after 2004.
For the following exercises, consider this scenario: In 2000, the moose population in a park was measured to be 6,500. By 2010, the population was measured to be 12,500. Assume the population continues to change linearly.
Find a formula for the moose population, $P.$
What does your model predict the moose population to be in 2020?
18,500
For the following exercises, consider this scenario: The median home values in subdivisions Pima Central and East Valley (adjusted for inflation) are shown in [link]. Assume that the house values are changing linearly.
Year | Pima Central | East Valley |
1970 | 32,000 | 120,250 |
2010 | 85,000 | 150,000 |
In which subdivision have home values increased at a higher rate?
If these trends were to continue, what would be the median home value in Pima Central in 2015?
$91,625
Fitting Linear Models to Data
Draw a scatter plot for the data in [link]. Then determine whether the data appears to be linearly related.
0 | 2 | 4 | 6 | 8 | 10 |
–105 | –50 | 1 | 55 | 105 | 160 |
Draw a scatter plot for the data in [link]. If we wanted to know when the population would reach 15,000, would the answer involve interpolation or extrapolation?
Year | 1990 | 1995 | 2000 | 2005 | 2010 |
Population | 5,600 | 5,950 | 6,300 | 6,600 | 6,900 |
Extrapolation.
Eight students were asked to estimate their score on a 10-point quiz. Their estimated and actual scores are given in [link]. Plot the points, then sketch a line that fits the data.
Predicted | 6 | 7 | 7 | 8 | 7 | 9 | 10 | 10 |
Actual | 6 | 7 | 8 | 8 | 9 | 10 | 10 | 9 |
Draw a best-fit line for the plotted data.
For the following exercises, consider the data in [link], which shows the percent of unemployed in a city of people 25 years or older who are college graduates is given below, by year.
Year | 2000 | 2002 | 2005 | 2007 | 2010 |
Percent Graduates | 6.5 | 7.0 | 7.4 | 8.2 | 9.0 |
Determine whether the trend appears to be linear. If so, and assuming the trend continues, find a linear regression model to predict the percent of unemployed in a given year to three decimal places.
In what year will the percentage exceed 12%?
Midway through 2024.
Based on the set of data given in [link], calculate the regression line using a calculator or other technology tool, and determine the correlation coefficient to three decimal places.
$x$ | 17 | 20 | 23 | 26 | 29 |
$y$ | 15 | 25 | 31 | 37 | 40 |
Based on the set of data given in [link], calculate the regression line using a calculator or other technology tool, and determine the correlation coefficient to three decimal places.
$x$ | 10 | 12 | 15 | 18 | 20 |
$y$ | 36 | 34 | 30 | 28 | 22 |
$y=-1.294x+49.412;\text{}r=-0.974$
For the following exercises, consider this scenario: The population of a city increased steadily over a ten-year span. The following ordered pairs show the population and the year over the ten-year span (population, year) for specific recorded years:
$\text{(3,600,2000);(4,000,2001);(4,700,2003);(6,000,2006)}$
Use linear regression to determine a function $y,$where the year depends on the population, to three decimal places of accuracy.
Predict when the population will hit 12,000.
Early in 2022
What is the correlation coefficient for this model to three decimal places of accuracy?
According to the model, what is the population in 2014?
7,660
Practice Test
Determine whether the following algebraic equation can be written as a linear function. $2x\text{\hspace{0.17em}}+\text{\hspace{0.17em}}3y=7$
Yes.
Determine whether the following function is increasing or decreasing. $f\left(x\right)=-2x\text{\hspace{0.17em}}+\text{\hspace{0.17em}}5$
Determine whether the following function is increasing or decreasing. $f\left(x\right)=7x\text{\hspace{0.17em}}+\text{\hspace{0.17em}}9$
Increasing
Given the following set of information, find a linear equation satisfying the conditions, if possible.
Passes through (5, 1) and (3, –9)
Given the following set of information, find a linear equation satisfying the conditions, if possible.
x intercept at (–4, 0) and y-intercept at (0, –6)
$y\text{\hspace{0.17em}}=\mathrm{-1.5}x\text{\hspace{0.17em}}-6$
Find the slope of the line in [link].
Write an equation for line in [link].
$y=-2x\text{\hspace{0.17em}}-\text{\hspace{0.17em}}1$
Does [link] represent a linear function? If so, find a linear equation that models the data.
$x$ | –6 | 0 | 2 | 4 |
$g\left(x\right)$ | 14 | 32 | 38 | 44 |
Does [link] represent a linear function? If so, find a linear equation that models the data.
$x$ | 1 | 3 | 7 | 11 |
$g(x)$ | 4 | 9 | 19 | 12 |
No.
At 6 am, an online company has sold 120 items that day. If the company sells an average of 30 items per hour for the remainder of the day, write an expression to represent the number of items that were sold $n$ after 6 am.
For the following exercises, determine whether the lines given by the equations below are parallel, perpendicular, or neither parallel nor perpendicular:
$\begin{array}{l}\begin{array}{l}\\ y=\frac{3}{4}x-9\end{array}\hfill \\ -4x-3y=8\hfill \end{array}$
Perpendicular
$\begin{array}{l}\begin{array}{l}\\ -2x+y=3\end{array}\hfill \\ 3x+\frac{3}{2}y=5\hfill \end{array}$
Find the x- and y-intercepts of the equation $2x\text{\hspace{0.17em}}+\text{\hspace{0.17em}}7y=-14.$
$\left(-\text{7},0\right)$; $\left(0,-\text{2}\right)$
Given below are descriptions of two lines. Find the slopes of Line 1 and Line 2. Is the pair of lines parallel, perpendicular, or neither?
Line 1: Passes through $(\mathrm{-2},\mathrm{-6})$ and $(3,14)$
Line 2: Passes through $(2,6)$ and $(4,14)$
Write an equation for a line perpendicular to $f(x)=4x+3$ and passing through the point $\left(8,10\right).$
$y=-0.25x+12$
Sketch a line with a y-intercept of $\left(0,\text{5}\right)$ and slope $-\frac{5}{2}$.
Graph of the linear function $f(x)=\mathrm{-x}+6$.
For the two linear functions, find the point of intersection: $\begin{array}{l}x=y+2\\ 2x-3y=\mathrm{-1}\end{array}$
A car rental company offers two plans for renting a car.
How many miles would you need to drive for plan B to save you money?
150
Find the area of a triangle bounded by the y axis, the line $f\left(x\right)=12-4x$, and the line perpendicular to $f$ that passes through the origin.
A town’s population increases at a constant rate. In 2010 the population was 65,000. By 2012 the population had increased to 90,000. Assuming this trend continues, predict the population in 2018.
165,000
The number of people afflicted with the common cold in the winter months dropped steadily by 25 each year since 2002 until 2012. In 2002, 8,040 people were inflicted. Find the linear function that models the number of people afflicted with the common cold $C$ as a function of the year, $t.$ When will less than 6,000 people be afflicted?
For the following exercises, use the graph in [link], showing the profit, $y$, in thousands of dollars, of a company in a given year, $x$, where $x$ represents years since 1980.
Find the linear function $y$, where $y$ depends on $x$, the number of years since 1980.
$y=875x\text{\hspace{0.17em}}+\text{\hspace{0.17em}}10,675$
Find and interpret the y-intercept.
In 2004, a school population was 1250. By 2012 the population had dropped to 875. Assume the population is changing linearly.
- How much did the population drop between the year 2004 and 2012?
- What is the average population decline per year?
- Find an equation for the population, P, of the school t years after 2004.
Draw a scatter plot for the data provided in [link]. Then determine whether the data appears to be linearly related.
0 | 2 | 4 | 6 | 8 | 10 |
–450 | –200 | 10 | 265 | 500 | 755 |
Draw a best-fit line for the plotted data.
For the following exercises, use [link], which shows the percent of unemployed persons 25 years or older who are college graduates in a particular city, by year.
Year | 2000 | 2002 | 2005 | 2007 | 2010 |
Percent Graduates | 8.5 | 8.0 | 7.2 | 6.7 | 6.4 |
Determine whether the trend appears linear. If so, and assuming the trend continues, find a linear regression model to predict the percent of unemployed in a given year to three decimal places.
In what year will the percentage drop below 4%?
Early in 2018
Based on the set of data given in [link], calculate the regression line using a calculator or other technology tool, and determine the correlation coefficient. Round to three decimal places of accuracy.
$x$ | 16 | 18 | 20 | 24 | 26 |
$y$ | 106 | 110 | 115 | 120 | 125 |
For the following exercises, consider this scenario: The population of a city increased steadily over a ten-year span. The following ordered pairs shows the population (in hundreds) and the year over the ten-year span, (population, year) for specific recorded years:
$\text{(4,500,2000);(4,700,2001);(5,200,2003);(5,800,2006)}$
Use linear regression to determine a function y, where the year depends on the population. Round to three decimal places of accuracy.
$y=0.00455x\text{\hspace{0.17em}}+\text{\hspace{0.17em}}1979.5$
Predict when the population will hit 20,000.
What is the correlation coefficient for this model?
$r=0.999$
- Precalculus
- Preface
- Functions
- Linear Functions
- Polynomial and Rational Functions
- Exponential and Logarithmic Functions
- Trigonometric Functions
- Periodic Functions
- Trigonometric Identities and Equations
- Further Applications of Trigonometry
- Systems of Equations and Inequalities
- Introduction to Systems of Equations and Inequalities
- Systems of Linear Equations: Two Variables
- Systems of Linear Equations: Three Variables
- Systems of Nonlinear Equations and Inequalities: Two Variables
- Partial Fractions
- Matrices and Matrix Operations
- Solving Systems with Gaussian Elimination
- Solving Systems with Inverses
- Solving Systems with Cramer's Rule
- Analytic Geometry
- Sequences, Probability and Counting Theory
- Introduction to Calculus
- Appendix
This linear equation can then be used to approximate answers to various questions we might ask about the trend.