
(a)
To calculate:
The sum of squared errors, SSE.

Answer to Problem 9CR
Solution:
The required SSE is 16.7246.
Explanation of Solution
Given Information:
The following data is collected on the number of years of post high-school education and the annual incomes of eight people ten years after graduation from high school.
Thread Count | 150 | 200 | 225 | 250 | 275 | 300 | 350 | 400 |
Price (in Dollars) | 18 | 21 | 25 | 28 | 30 | 31 | 35 | 45 |
The least Squares regression line is the line for which the average variation from the data is the smallest, also called the line of best fit, given by
Where
And
Formula used:
The equation of least-squares regression line is given by,
Where
And
Where n is the number of data pairs in the sample,
And
The sum of squared errors (SSE) for a regression line is calculated as,
Where,
And
Calculation:
Thread Count | Price(in Dollars) | |||
150 | 18 | 2700 | 22500 | 324 |
200 | 21 | 4200 | 40000 | 441 |
225 | 25 | 5625 | 50625 | 625 |
250 | 28 | 7000 | 62500 | 784 |
275 | 30 | 8250 | 75625 | 900 |
300 | 31 | 9300 | 90000 | 961 |
350 | 35 | 12250 | 122500 | 1225 |
400 | 45 | 18000 | 160000 | 2025 |
Let
And
The slope of the least-squares regression line is calculated as,
Where,
Substitute 150 for
Proceed in the same manner to calculate
The slope of the least-squares regression line is calculated as,
Substitute 2150 for
The y-intercept of regression line is calculated as,
Substitute 2150 for
The equation of least-squares regression line is given by,
Substitute 28.6514 for
Number of years |
Annual income |
Predicted value |
||
150 | 18 | 16.965 | 1.035 | 1.071225 |
200 | 21 | 22.085 | -1.085 | 1.177225 |
225 | 25 | 24.645 | 0.355 | 0.126025 |
250 | 28 | 27.205 | 0.795 | 0.632025 |
275 | 30 | 29.765 | 0.235 | 0.055225 |
300 | 31 | 32.325 | -1.325 | 1.755625 |
350 | 35 | 37.445 | -2.445 | 5.978025 |
400 | 45 | 42.565 | 2.435 | 5.929225 |
The predicted values are calculated as,
The predicted value
Substitute 150 for
Proceed in the same manner to calculate
The residual is calculated as,
Substitute 18 for
Square both sides of the equation.
Proceed in the same manner to calculate
Conclusion:
Thus, the SSE is 16.7246
(b)
To calculate:
The standard error of estimate,

Answer to Problem 9CR
Solution:
The required standard error of estimate is
Explanation of Solution
Given Information:
The following data is collected on the number of years of post high-school education and the annual incomes of eight people ten years after graduation from high school.
Thread Count | 150 | 200 | 225 | 250 | 275 | 300 | 350 | 400 |
Price (in Dollars) | 18 | 21 | 25 | 28 | 30 | 31 | 35 | 45 |
Formula used:
The standard error of estimate, which is used to measure by how much the sample data points deviate from regression line is given by,
Where,
n is the number of data pairs in the sample,
And SSE is the sum of squared errors.
Calculation:
The standard error of estimate is calculated as,
Substitute 5868.153 for SSE and 8 for n in the above formula.
Conclusion:
Thus, the standard error of estimate is
(c)
The 95% prediction interval for the price of 350-thread count sheets.

Answer to Problem 9CR
Solution:
The required prediction interval is.
Explanation of Solution
Given Information:
The following data is collected on the number of years of post high-school education and the annual incomes of eight people ten years after graduation from high school.
Thread Count | 150 | 200 | 225 | 250 | 275 | 300 | 350 | 400 |
Price (in Dollars) | 18 | 21 | 25 | 28 | 30 | 31 | 35 | 45 |
Formula used:
The margin of error of a prediction interval for an individual y-value is calculated as,
With degree of freedom
Where,
n is the number of data pairs in the sample,
SSE is the sum of squared errors,
And
Then the prediction interval for an individual y-value is,
Calculation:
It is given that the level of prediction is 0.95 then the level of significance is calculated as,
Then,
The mean of the number of years of post high school education is calculated as,
Substitute 2150 for
The margin of error of a prediction interval for an individual y-value is calculated as,
Substitute 2.447 for
The
The regression line is,
Substitute 350 for
The prediction interval is,
Conclusion:
The required prediction interval is.
(d)
The 95% confidence interval for the y-intercept of the regression line.

Answer to Problem 9CR
Solution:
The required confidence interval is
Explanation of Solution
Given Information:
The following data is collected on the number of years of post high-school education and the annual incomes of eight people ten years after graduation from high school.
Thread Count | 150 | 200 | 225 | 250 | 275 | 300 | 350 | 400 |
Price (in Dollars) | 18 | 21 | 25 | 28 | 30 | 31 | 35 | 45 |
Formula Used:
The
Coefficient of determination measures the proportion of variation in the response variable caused by explanatory variable which is simply the square of r, the correlation coefficient.
The standard error of estimate,
In ANOVA,
Grand Mean is the weighted mean of the
Sum of Squares among Treatments (SST) is the measures the variation between the sample means and the grand mean, given by,
Sum of Squares for Error (SSE) is the measures the variation in the sample data resulting from the variability within each sample,
Total Variation, it is the sum of the squared deviations from the grand mean for all of the data values in each sample, given by
Mean Square for Treatments (MST) found by dividing the sum of squares among treatments by its degrees of freedom, given by
Mean Square for Error (MSE) found by dividing the sum of squares for error by its degrees of freedom, given by
Test Statistic for an ANOVA Test is used when independent, simple random samples are taken from populations with variances that are unknown and assumed to be equal, where all of the
Calculation:
To generate the regression table in excel follow the given steps:
1. Under data tab, choose data analytics and then select regression.
2. Select the input Y range and enter the range of the given
3.Choose 95% confidence interval and click OK.
The following table will appear.
Regression Statistics | |
Multiple R | 0.983094904 |
R Square | 0.96647559 |
Adjusted R Square | 0.960888189 |
Standard Error | 1.66955532 |
Observations | 8 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 1 | 482.1505102 | 482.1505102 | 172.9740696 | 1.19253E-05 |
Residual | 6 | 16.7244898 | 2.787414966 | ||
Total | 7 | 498.875 |
Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | |
Intercept | 1.591836735 | 2.17509095 | 0.73184835 | 0.491845191 | -3.730419077 | 6.914092547 |
Slope | 0.10244898 | 0.007789635 | 13.15196067 | 1.19253E-05 | 0.083388428 | 0.121509531 |
RESIDUAL OUTPUT | |||
Observation | Predicted y | Residuals | Standard Residuals |
1 | 16.95918367 | 1.040816327 | 0.673359012 |
2 | 22.08163265 | -1.081632653 | -0.699765248 |
3 | 24.64285714 | 0.357142857 | 0.231054563 |
4 | 27.20408163 | 0.795918367 | 0.514921598 |
5 | 29.76530612 | 0.234693878 | 0.151835856 |
6 | 32.32653061 | -1.326530612 | -0.858202663 |
7 | 37.44897959 | -2.448979592 | -1.584374147 |
8 | 42.57142857 | 2.428571429 | 1.571171029 |
The confidence interval of the y-intercept can be constructed by adding and subtracting the margin of error to the point estimate by using Microsoft excel.
Referring regression statistics,
Standard error is the standard error of estimate,
The lower 95% and the upper 95% gives the confidence interval of the y-intercept.
The intercept given in the row of the table above is the
So the regression line is,
The lower and the upper endpoints for a 95% confidence interval for the y-intercept of the regression line,
Conclusion:
Thus, the 95% confidence interval for the y-intercept of the regression line is.
(e)
Construct a 95% confidence interval for the slope of the regression line.

Answer to Problem 9CR
Solution:
The required confidence interval is
Explanation of Solution
Given Information:
The following data is collected on the number of years of post high-school education and the annual incomes of eight people ten years after graduation from high school.
Thread Count | 150 | 200 | 225 | 250 | 275 | 300 | 350 | 400 |
Price (in Dollars) | 18 | 21 | 25 | 28 | 30 | 31 | 35 | 45 |
Formula Used:
The
Coefficient of determination measures the proportion of variation in the response variable caused by explanatory variable which is simply the square of r, the correlation coefficient.
The standard error of estimate,
In ANOVA,
Grand Mean is the weighted mean of the
Sum of Squares among Treatments (SST) is the measures the variation between the sample means and the grand mean, given by,
Sum of Squares for Error (SSE) is the measures the variation in the sample data resulting from the variability within each sample,
Total Variation, it is the sum of the squared deviations from the grand mean for all of the data values in each sample, given by
Mean Square for Treatments (MST) found by dividing the sum of squares among treatments by its degrees of freedom, given by
Mean Square for Error (MSE) found by dividing the sum of squares for error by its degrees of freedom, given by
Test Statistic for an ANOVA Test is used when independent, simple random samples are taken from populations with variances that are unknown and assumed to be equal, where all of the
Calculation:
To generate the regression table in excel follow the given steps:
1. Under data tab, choose data analytics and then select regression.
2. Select the input Y range and enter the range of the given
3.Choose 95% confidence interval and click OK.
The following table will appear.
Regression Statistics | |||||||||||||||||
Multiple R | 0.983094904 | ||||||||||||||||
R Square | 0.96647559 | ||||||||||||||||
Adjusted R Square | 0.960888189 | ||||||||||||||||
Standard Error | 1.66955532 | ||||||||||||||||
Observations | 8 | ||||||||||||||||
ANOVA | |||||||||||||||||
df | SS | MS | F | Significance F | |||||||||||||
Regression | 1 | 482.1505102 | 482.1505102 | 172.9740696 | 1.19253E-05 | ||||||||||||
Residual | 6 | 16.7244898 | 2.787414966 | ||||||||||||||
Total | 7 | 498.875 | |||||||||||||||
Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | ||||||||||||
Intercept | 1.591836735 | 2.17509095 | 0.731848356 | 0.491845191 | -3.730419077 | 6.914092547 | |||||||||||
Slope | 0.10244898 | 0.007789635 | 13.15196067 | 1.19253E-05 | 0.083388428 | 0.121509531 | |||||||||||
RESIDUAL OUTPUT | |||||||||||||||||
Observation | Predicted y | Residuals | Standard Residuals | ||||||||||||||
1 | 16.95918367 | 1.040816327 | 0.673359012 | ||||||||||||||
2 | 22.08163265 | -1.081632653 | -0.699765248 | ||||||||||||||
3 | 24.64285714 | 0.357142857 | 0.231054563 | ||||||||||||||
4 | 27.20408163 | 0.795918367 | 0.514921598 | ||||||||||||||
5 | 29.76530612 | 0.234693878 | 0.151835856 | ||||||||||||||
6 | 32.32653061 | -1.326530612 | -0.858202663 | ||||||||||||||
7 | 37.44897959 | -2.448979592 | -1.584374147 | ||||||||||||||
8 | 42.57142857 | 2.428571429 | 1.571171029 | ||||||||||||||
The lower 95% and the upper 95% gives the confidence interval of the slope.
The intercept given in the row of the table above is the
So the regression line is,
The lower and the upper endpoints for a 95% confidence interval for the slope of the regression line,
Conclusion:
Thus, the 95% confidence interval for the slope of the regression line is
Want to see more full solutions like this?
Chapter 12 Solutions
BEGINNING STAT.-SOFTWARE+EBOOK ACCESS
- K The mean height of women in a country (ages 20-29) is 63.7 inches. A random sample of 65 women in this age group is selected. What is the probability that the mean height for the sample is greater than 64 inches? Assume σ = 2.68. The probability that the mean height for the sample is greater than 64 inches is (Round to four decimal places as needed.)arrow_forwardIn a survey of a group of men, the heights in the 20-29 age group were normally distributed, with a mean of 69.6 inches and a standard deviation of 4.0 inches. A study participant is randomly selected. Complete parts (a) through (d) below. (a) Find the probability that a study participant has a height that is less than 68 inches. The probability that the study participant selected at random is less than 68 inches tall is 0.4. (Round to four decimal places as needed.) 20 2arrow_forwardPEER REPLY 1: Choose a classmate's Main Post and review their decision making process. 1. Choose a risk level for each of the states of nature (assign a probability value to each). 2. Explain why each risk level is chosen. 3. Which alternative do you believe would be the best based on the maximum EMV? 4. Do you feel determining the expected value with perfect information (EVWPI) is worthwhile in this situation? Why or why not?arrow_forward
- Questions An insurance company's cumulative incurred claims for the last 5 accident years are given in the following table: Development Year Accident Year 0 2018 1 2 3 4 245 267 274 289 292 2019 255 276 288 294 2020 265 283 292 2021 263 278 2022 271 It can be assumed that claims are fully run off after 4 years. The premiums received for each year are: Accident Year Premium 2018 306 2019 312 2020 318 2021 326 2022 330 You do not need to make any allowance for inflation. 1. (a) Calculate the reserve at the end of 2022 using the basic chain ladder method. (b) Calculate the reserve at the end of 2022 using the Bornhuetter-Ferguson method. 2. Comment on the differences in the reserves produced by the methods in Part 1.arrow_forwardYou are provided with data that includes all 50 states of the United States. Your task is to draw a sample of: o 20 States using Random Sampling (2 points: 1 for random number generation; 1 for random sample) o 10 States using Systematic Sampling (4 points: 1 for random numbers generation; 1 for random sample different from the previous answer; 1 for correct K value calculation table; 1 for correct sample drawn by using systematic sampling) (For systematic sampling, do not use the original data directly. Instead, first randomize the data, and then use the randomized dataset to draw your sample. Furthermore, do not use the random list previously generated, instead, generate a new random sample for this part. For more details, please see the snapshot provided at the end.) Upload a Microsoft Excel file with two separate sheets. One sheet provides random sampling while the other provides systematic sampling. Excel snapshots that can help you in organizing columns are provided on the next…arrow_forwardThe population mean and standard deviation are given below. Find the required probability and determine whether the given sample mean would be considered unusual. For a sample of n = 65, find the probability of a sample mean being greater than 225 if μ = 224 and σ = 3.5. For a sample of n = 65, the probability of a sample mean being greater than 225 if μ=224 and σ = 3.5 is 0.0102 (Round to four decimal places as needed.)arrow_forward
- ***Please do not just simply copy and paste the other solution for this problem posted on bartleby as that solution does not have all of the parts completed for this problem. Please answer this I will leave a like on the problem. The data needed to answer this question is given in the following link (file is on view only so if you would like to make a copy to make it easier for yourself feel free to do so) https://docs.google.com/spreadsheets/d/1aV5rsxdNjHnkeTkm5VqHzBXZgW-Ptbs3vqwk0SYiQPo/edit?usp=sharingarrow_forwardThe data needed to answer this question is given in the following link (file is on view only so if you would like to make a copy to make it easier for yourself feel free to do so) https://docs.google.com/spreadsheets/d/1aV5rsxdNjHnkeTkm5VqHzBXZgW-Ptbs3vqwk0SYiQPo/edit?usp=sharingarrow_forwardThe following relates to Problems 4 and 5. Christchurch, New Zealand experienced a major earthquake on February 22, 2011. It destroyed 100,000 homes. Data were collected on a sample of 300 damaged homes. These data are saved in the file called CIEG315 Homework 4 data.xlsx, which is available on Canvas under Files. A subset of the data is shown in the accompanying table. Two of the variables are qualitative in nature: Wall construction and roof construction. Two of the variables are quantitative: (1) Peak ground acceleration (PGA), a measure of the intensity of ground shaking that the home experienced in the earthquake (in units of acceleration of gravity, g); (2) Damage, which indicates the amount of damage experienced in the earthquake in New Zealand dollars; and (3) Building value, the pre-earthquake value of the home in New Zealand dollars. PGA (g) Damage (NZ$) Building Value (NZ$) Wall Construction Roof Construction Property ID 1 0.645 2 0.101 141,416 2,826 253,000 B 305,000 B T 3…arrow_forward
- Rose Par posted Apr 5, 2025 9:01 PM Subscribe To: Store Owner From: Rose Par, Manager Subject: Decision About Selling Custom Flower Bouquets Date: April 5, 2025 Our shop, which prides itself on selling handmade gifts and cultural items, has recently received inquiries from customers about the availability of fresh flower bouquets for special occasions. This has prompted me to consider whether we should introduce custom flower bouquets in our shop. We need to decide whether to start offering this new product. There are three options: provide a complete selection of custom bouquets for events like birthdays and anniversaries, start small with just a few ready-made flower arrangements, or do not add flowers. There are also three possible outcomes. First, we might see high demand, and the bouquets could sell quickly. Second, we might have medium demand, with a few sold each week. Third, there might be low demand, and the flowers may not sell well, possibly going to waste. These outcomes…arrow_forwardConsider the state space model X₁ = §Xt−1 + Wt, Yt = AX+Vt, where Xt Є R4 and Y E R². Suppose we know the covariance matrices for Wt and Vt. How many unknown parameters are there in the model?arrow_forwardBusiness Discussarrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





