
(a)
To find:
The sum of squared errors, SSE.

Answer to Problem 9E
Solution:
The equation of the least squares regression line is
Squared error (SSE) is
Explanation of Solution
Given Information:
The numbers of parking tickets students received during one semester and their monthly
Parking Tickets and Monthly Income | ||||||||||
Number of Tickets, ![]() |
10 | 8 | 3 | 2 | 0 | 5 | 4 | 2 | 1 | 0 |
Monthly Income (in Dollars), ![]() |
4000 | 3800 | 1500 | 2000 | 870 | 2500 | 1800 | 1200 | 1400 |
The least Squares regression line is the line for which the average variation from the data is the smallest, also called the line of best fit, given by
Where is the slope of the least-squares regression line for paired data from a sample and
is the
-intercept for the regression line.
The sum of squared errors (SSE) for a regression line is the sum of the squares of the residuals.
Formula used:
The equation of least-squares regression line is given by
Where, is the slope of the least-squares regression line given as,
intercept given as,
Here n is the number of data pairs in the sample, is the
value of the explanatory variable and
is the
value of response variable.
The sum of squared errors (SSE) for a regression line is the sum of the squares of the residuals is given by,
Here is the
observed value of the response variable and
is the predicted variable of
using the list square regression model.
Calculation:
The table of the numbers of parking tickets students received during one semester and
their monthly Incomes is given by,
Number of Tickets, |
Monthly Income (in Dollars), |
|||
10 | 4000 | 40000 | 100 | 16000000 |
8 | 3800 | 30400 | 64 | 14440000 |
3 | 1500 | 4500 | 9 | 2250000 |
2 | 2000 | 4000 | 4 | 4000000 |
0 | 870 | 0 | 0 | 756900 |
5 | 2500 | 12500 | 25 | 6250000 |
4 | 1800 | 7200 | 16 | 3240000 |
2 | 1000 | 2000 | 4 | 1000000 |
1 | 1200 | 1200 | 1 | 1440000 |
0 | 1400 | 0 | 0 | 1960000 |
Where,
Substitute 10 for, 8 for
, 3 for
… 0 for
in the above equation,
Substitute for
,
for
,
for
…
for
in the above equation,
In order to calculate the value of and 4000 for
in
Proceed in the same manner to calculate
In order to calculate the value of, substitute 10 for
in
.
Proceed in the same manner to calculate for the rest of the data and refer table for the rest of the
values calculated.
In order to calculate the value of substitute 4000 for
in
.
Proceed in the same manner to calculate for the rest of the data and refer table for the rest of the
values calculated.
The slope of the least-squares regression line is given as,
Substitute 35 for in the above equation of
,
The -intercept of regression line is given as,
Substitute 35 for
The equation of least-squares regression line is,
The table of the numbers of parking tickets students received during one semester and
their monthly incomes is given by,
Number of Tickets, |
Monthly Income (in Dollars), |
Predicted value, |
Residual, |
Squared error, |
10 | 4000 | 4047.87 | -47.87 | 2291.5369 |
8 | 3800 | 3419.91 | 380.09 | 144468.4081 |
3 | 1500 | 1850.01 | -350.01 | 122507.0001 |
2 | 2000 | 1536.03 | 463.97 | 215268.1609 |
0 | 870 | 908.07 | -38.07 | 1449.3249 |
5 | 2500 | 2477.97 | 22.03 | 485.3209 |
4 | 1800 | 2163.99 | -363.99 | 132488.7201 |
2 | 1000 | 1536.03 | -536.03 | 287328.1609 |
1 | 1200 | 1222.05 | -22.05 | 486.2025 |
0 | 1400 | 908.07 | 491.93 | 241995.1249 |
The predicted values are obtained by substituting the values of
in the fitted regression line,
For, substitute 10 for
in the above equation,
Similarly the other values of are obtained and are shown in the third column of the table:
The residuals and
.
Substitute 4000 for and 4047.87 for
in above equation,
Similarly the other values of
The squared errors are obtained as,
Substitute for
in
Thus,
Similarly the other values of
The sum of squared errors (SSE) for a regression line is,
From the above table SSE is given by,
Thus, the sum of squared error (SSE) is 114767.96.
Conclusion:
The equation of least-squares regression line is,
The sum of squared errors (SSE) is 1148767.96.
(b)
To find:
The Standard error of estimate, .

Answer to Problem 9E
Solution:
The Standard error of estimate is 378.94.
Explanation of Solution
Given Information:
The numbers of parking tickets students received during one semester and their monthly
Parking Tickets and Monthly Income | ||||||||||
Number of Tickets, ![]() |
10 | 8 | 3 | 2 | 0 | 5 | 4 | 2 | 1 | 0 |
Monthly Income (in Dollars), ![]() |
4000 | 3800 | 1500 | 2000 | 870 | 2500 | 1800 | 1000 | 1200 | 1400 |
Incomes is,
Formula used:
The standard error of estimate is a measure of the deviation of the sample data points from the regression line and is given by:
Here SSE is the sum of squared error and is the number of paired data set in the sample,
is the
observed value of the response variable,
is the predicted variable of
using the list square regression model.
Calculation:
The formula of Standard error of estimate is,
From part a substitute for SSE and
for
in the above equation of Standard error of estimate.
Conclusion:
Thus, the Standard error of estimate is 378.94.
(c)
To Construct:
A 95% prediction interval for the given value of explanatory variable, .

Answer to Problem 9E
Solution:
The prediction interval is (2032.14, 3551.76).
Explanation of Solution
The prediction interval is a confidence interval for an individual value of the response
Variable y, at a given fixed value of the explanatory variable, x.
Formula used:
The formula to calculate the margin of error of a prediction interval for an individual value of the response variable, y, is given by,
n is the number of data pairs in the sample.
The formula to calculate the Prediction interval is given by,
or
Where
margin of error
The formula to calculate arithmetic mean is given by,
Where
Variables.
Calculation:
Arithmetic Mean is given as,
From part a substitute 10 for, 10 for
, 8 for
…..0 for
in the above equation of arithmetic mean,
Since the level of confidence is 95%,
Then using the t-distribution table, the critical value for this test
For -distribution with 8 degrees of freedom,
The formula to calculate the margin of error of a prediction interval is,
Substitute 1.86 for, 3.5 for
, 10 for
, 35 for
, 223 for
, 6 for
and
for
in the above equation of margin of error,
Predicted value for
is
The prediction interval is:
Substitute 2791.95for and 759.81 for
in the above formula,
The prediction interval is
Conclusion:
Thus, the prediction interval is
(d)
To Construct:
A 95% confidence interval for the intercept of regression line.

Answer to Problem 9E
Solution:
The 95% confidence interval for the intercept
is (496.45, 1319.69).
Explanation of Solution
Given Information:
The numbers of parking tickets students received during one semester and their monthly
Incomes is,
Parking Tickets and Monthly Income | ||||||||||
Number of Tickets, x | 10 | 8 | 3 | 2 | 0 | 5 | 4 | 2 | 1 | 0 |
Monthly Income | 4000 | 3800 | 1500 | 2000 | 870 | 2500 | 1800 | 1000 | 1200 | 1400 |
(in Dollars), y |
Calculation:
To generate the regression table in excel follow the given steps:
1. Under data tab, choose data analytics and then select regression.
2. Select the input Y and select the input X range and enter the range of the given
data.
3.Choose 95% confidence interval and click OK.
The following table will appear.
SUMMARY OUTPUT
Regression Statistics | |||||
Multiple R | 0.94662523 | ||||
R Square | 0.89609937 | ||||
Adjusted R Square | 0.88311172 | ||||
Standard Error | 378.940622 | ||||
Observations | 10 | ||||
ANOVA |
|||||
df | SS | MS | F | Significance F | |
Regression | 1 | 9907642.04 | 9907642 | 68.99665 | 3.32835E-05 |
Residual | 8 | 1148767.96 | 143596 | ||
Total | 9 | 11056410 |
Coefficients | Standard Error | t Stat | P-value | Lower 95.0% | Upper 95.0% | |
Intercept | 908.0696517 | 178.5009634 | 5.087197 | 0.000945 | 496.4456924 | 1319.69361 |
X | 313.9800995 | 37.79968083 | 8.306422 | 3.33E-05 | 226.8138793 | 401.14632 |
From the above result,
1. Multiple R is the absolute value of the .
2.
3. Standard Error is the standard error of the estimate, .
4. The intersection of the Residual row and the SS column is the sum of squared errors, SSE.
5. The Lower 95.0% and the Upper 95.0% columns give the lower and upper endpoints of the 95% confidence intervals for the intercept and slope.
6. The coefficient columns gives the values for the coefficients, that is, the
intercept and slope, of the regression line.
The row labeled Intercept is the row for the values corresponding to the intercept. The last two values in this row are the lower and upper endpoints for a 95% confidence for the
intercept of the regression line,
Thus, the 95% confidence interval for
Confidence Interval=(496.45, 1319.69).
Conclusion:
Thus, the 95% confidence interval for the intercept
is(496.45, 1319.69).
(e)
To Construct:
A 95% confidence interval for the slope of regression line.

Answer to Problem 9E
Solution:
Thus, the 95% confidence interval for the slope
Explanation of Solution
Given Information:
The numbers of parking tickets students received during one semester and their monthly
Incomes is,
Parking Tickets and Monthly Income | ||||||||||
Number of Tickets, x | 10 | 8 | 3 | 2 | 0 | 5 | 4 | 2 | 1 | 0 |
Monthly Income | 4000 | 3800 | 1500 | 2000 | 870 | 2500 | 1800 | 1000 | 1200 | 1400 |
(in Dollars), y |
Calculation:
To generate the regression table in excel follow the given steps:
1. Under data tab, choose data analytics and then select regression.
2. Select the input Y range and enter the range of the given and select the input X range and enter the range of the given
data.
3.Choose 95% confidence interval and click OK.
The following table will appear.
SUMMARY OUTPUT
Regression Statistics | ||||||
Multiple R | 0.946625253 | |||||
R Square | 0.89609937 | |||||
Adjusted R Square | 0.883111792 | |||||
Standard Error | 378.940622 | |||||
Observations | 10 | |||||
ANOVA | ||||||
df | SS | MS | F | Significance F | ||
Regression | 1 | 9907642.04 | 9907642 | 68.99665 | 3.32835E-05 | |
Residual | 8 | 1148767.96 | 143596 | |||
Total | 9 | 11056410 | ||||
Coefficients | Standard Error | t Stat | P-value | Lower 95.0% | Upper 95.0% | |
Intercept | 908.0696517 | 178.5009634 | 5.087197 | 0.000945 | 496.4456924 | 1319.69361 |
X | 313.9800995 | 37.79968083 | 8.306422 | 3.33E-05 | 226.8138793 | 401.14632 |
From the above result,
1. Multiple R is the absolute value of the .
2.
3. Standard Error is the standard error of the estimate, .
4. The intersection of the Residual row and the SS column is the sum of squared errors, SSE.
5. The Lower 95.0% and the Upper 95.0% columns give the lower and upper endpoints of the 95% confidence intervals for the intercept and slope.
6. The coefficient columns gives the values for the coefficients, that is, the
intercept and slope, of the regression line.
The row labeled is the row for the values corresponding to the slope of regression line. The last two values in this row are the lower and upper endpoints for a 95% confidence for the slope of the regression line,
Thus, the 95% confidence interval for
Confidence Interval=(226.81, 401.15).
Conclusion:
Thus, the 95% confidence interval for the slope is(226.81, 401.15).
Want to see more full solutions like this?
Chapter 12 Solutions
Beginning Statistics, 2nd Edition
- This problem is based on the fundamental option pricing formula for the continuous-time model developed in class, namely the value at time 0 of an option with maturity T and payoff F is given by: We consider the two options below: Fo= -rT = e Eq[F]. 1 A. An option with which you must buy a share of stock at expiration T = 1 for strike price K = So. B. An option with which you must buy a share of stock at expiration T = 1 for strike price K given by T K = T St dt. (Note that both options can have negative payoffs.) We use the continuous-time Black- Scholes model to price these options. Assume that the interest rate on the money market is r. (a) Using the fundamental option pricing formula, find the price of option A. (Hint: use the martingale properties developed in the lectures for the stock price process in order to calculate the expectations.) (b) Using the fundamental option pricing formula, find the price of option B. (c) Assuming the interest rate is very small (r ~0), use Taylor…arrow_forwardDiscuss and explain in the picturearrow_forwardBob and Teresa each collect their own samples to test the same hypothesis. Bob’s p-value turns out to be 0.05, and Teresa’s turns out to be 0.01. Why don’t Bob and Teresa get the same p-values? Who has stronger evidence against the null hypothesis: Bob or Teresa?arrow_forward
- Review a classmate's Main Post. 1. State if you agree or disagree with the choices made for additional analysis that can be done beyond the frequency table. 2. Choose a measure of central tendency (mean, median, mode) that you would like to compute with the data beyond the frequency table. Complete either a or b below. a. Explain how that analysis can help you understand the data better. b. If you are currently unable to do that analysis, what do you think you could do to make it possible? If you do not think you can do anything, explain why it is not possible.arrow_forward0|0|0|0 - Consider the time series X₁ and Y₁ = (I – B)² (I – B³)Xt. What transformations were performed on Xt to obtain Yt? seasonal difference of order 2 simple difference of order 5 seasonal difference of order 1 seasonal difference of order 5 simple difference of order 2arrow_forwardCalculate the 90% confidence interval for the population mean difference using the data in the attached image. I need to see where I went wrong.arrow_forward
- Microsoft Excel snapshot for random sampling: Also note the formula used for the last column 02 x✓ fx =INDEX(5852:58551, RANK(C2, $C$2:$C$51)) A B 1 No. States 2 1 ALABAMA Rand No. 0.925957526 3 2 ALASKA 0.372999976 4 3 ARIZONA 0.941323044 5 4 ARKANSAS 0.071266381 Random Sample CALIFORNIA NORTH CAROLINA ARKANSAS WASHINGTON G7 Microsoft Excel snapshot for systematic sampling: xfx INDEX(SD52:50551, F7) A B E F G 1 No. States Rand No. Random Sample population 50 2 1 ALABAMA 0.5296685 NEW HAMPSHIRE sample 10 3 2 ALASKA 0.4493186 OKLAHOMA k 5 4 3 ARIZONA 0.707914 KANSAS 5 4 ARKANSAS 0.4831379 NORTH DAKOTA 6 5 CALIFORNIA 0.7277162 INDIANA Random Sample Sample Name 7 6 COLORADO 0.5865002 MISSISSIPPI 8 7:ONNECTICU 0.7640596 ILLINOIS 9 8 DELAWARE 0.5783029 MISSOURI 525 10 15 INDIANA MARYLAND COLORADOarrow_forwardSuppose the Internal Revenue Service reported that the mean tax refund for the year 2022 was $3401. Assume the standard deviation is $82.5 and that the amounts refunded follow a normal probability distribution. Solve the following three parts? (For the answer to question 14, 15, and 16, start with making a bell curve. Identify on the bell curve where is mean, X, and area(s) to be determined. 1.What percent of the refunds are more than $3,500? 2. What percent of the refunds are more than $3500 but less than $3579? 3. What percent of the refunds are more than $3325 but less than $3579?arrow_forwardA normal distribution has a mean of 50 and a standard deviation of 4. Solve the following three parts? 1. Compute the probability of a value between 44.0 and 55.0. (The question requires finding probability value between 44 and 55. Solve it in 3 steps. In the first step, use the above formula and x = 44, calculate probability value. In the second step repeat the first step with the only difference that x=55. In the third step, subtract the answer of the first part from the answer of the second part.) 2. Compute the probability of a value greater than 55.0. Use the same formula, x=55 and subtract the answer from 1. 3. Compute the probability of a value between 52.0 and 55.0. (The question requires finding probability value between 52 and 55. Solve it in 3 steps. In the first step, use the above formula and x = 52, calculate probability value. In the second step repeat the first step with the only difference that x=55. In the third step, subtract the answer of the first part from the…arrow_forward
- If a uniform distribution is defined over the interval from 6 to 10, then answer the followings: What is the mean of this uniform distribution? Show that the probability of any value between 6 and 10 is equal to 1.0 Find the probability of a value more than 7. Find the probability of a value between 7 and 9. The closing price of Schnur Sporting Goods Inc. common stock is uniformly distributed between $20 and $30 per share. What is the probability that the stock price will be: More than $27? Less than or equal to $24? The April rainfall in Flagstaff, Arizona, follows a uniform distribution between 0.5 and 3.00 inches. What is the mean amount of rainfall for the month? What is the probability of less than an inch of rain for the month? What is the probability of exactly 1.00 inch of rain? What is the probability of more than 1.50 inches of rain for the month? The best way to solve this problem is begin by a step by step creating a chart. Clearly mark the range, identifying the…arrow_forwardClient 1 Weight before diet (pounds) Weight after diet (pounds) 128 120 2 131 123 3 140 141 4 178 170 5 121 118 6 136 136 7 118 121 8 136 127arrow_forwardClient 1 Weight before diet (pounds) Weight after diet (pounds) 128 120 2 131 123 3 140 141 4 178 170 5 121 118 6 136 136 7 118 121 8 136 127 a) Determine the mean change in patient weight from before to after the diet (after – before). What is the 95% confidence interval of this mean difference?arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





