Refer back to the data in Exercise 4, in which y = ammonium concentration (mg/L) and x = transpiration (ml/h). Summary quantities include n = 13, Σ x i = 303.7, Σ y i = 52.8, S xx = 1585.230769, S v = −341.959231. and Syy = 77.270769. a. Obtain the equation of the estimated regression line and use it to calculate a point prediction of ammonium concentration for a future observation made when ammonium concentration is 25 ml/h. b. What happens if the estimated regression line is used to calculate a point estimate of true average concentration when transpiration is 45 ml/h? Why does it not make sense to calculate this point estimate? c. Calculate and interpret s . d. Do you think the simple linear regression model does a good job of explaining observed variation in concentration? Explain.

Question

Want to see more full solutions like this?

Answer 1

Textbook Question

Chapter 12.3, Problem 35E

Refer back to the data in Exercise 4, in which y = ammonium concentration (mg/L) and x = transpiration (ml/h). Summary quantities include n = 13, Σx_i = 303.7, Σy_i = 52.8, S_xx = 1585.230769, S_v = −341.959231. and Syy = 77.270769.

a. Obtain the equation of the estimated regression line and use it to calculate a point prediction of ammonium concentration for a future observation made when ammonium concentration is 25 ml/h.
b. What happens if the estimated regression line is used to calculate a point estimate of true average concentration when transpiration is 45 ml/h? Why does it not make sense to calculate this point estimate?
c. Calculate and interpret s.
d. Do you think the simple linear regression model does a good job of explaining observed variation in concentration? Explain.

a.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression.

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, ∑i=1nxi=222.1, ∑i=1nyi=193, ∑i=1nyi2=2,975,∑i=1nxi2=3,056.69 and ∑i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2759.6−222.1×193173,056.69−222.1×222.117=2759.6−2,521.4883,056.69−2,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,975−(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2759.6−222.1×19317)23,056.69−222.1×222.117=(2759.6−2,521.488)23,056.69−2,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=783.8824−365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=418.137617−2=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.283,056.69−222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.536−(2.131×0.424)≤β1≤1.536+(2.131×0.424))=(1.536±0.903544)≃(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

b.

Expert Solution

To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

H1:β1≠0

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1−β1sβ^1∼t(n−2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1−β1sβ^1=1.536−00.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(test statistic > critical value), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

c.

Expert Solution

To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

The y-intercept of the regression model is obtained as follows:

β^0=∑iyi−β^1∑ixin=193−1.536×222.117=−8.715

Thus, the y-intercept of the regression model is β^0=−8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y⌢=−8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y⌢=−8.715+1.536x=−8.715+1.536×0.5=−7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

d.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=17−1=16.

Sum of the variable:

∑i=1nxi=222.1−6=216.1,∑i=1nyi=193−2.50=191.5.

Sum of squares of the variable:

∑i=1nxi2=3,056.69−62=3,020.69,∑i=1nyi2=2,975−2.52=2,968.75,and ∑i=1nxiyi=2,759.6−6×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2,744.6−216.1×191.5163,020.69−216.1×216.116=2,744.6−2,586.4473,020.69−2,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,968.75−(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2,744.6−216.1×191.516)23,020.69−216.1×216.116=(2,744.6−2,586.447)23,020.69−2,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=676.7344−245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=431.489116−2=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.5523,020.69−216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n−2=16−2=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.551−(2.145×0.5497)≤β1≤1.551+(2.145×0.5497))=(1.551±1.1791)≃(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719≤β1≤2.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632≤β1≤2.440_.

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Here, by observing both the intervals it is clear that the (6.0,2.50) has an impact on the slope coefficient of the regression line.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!

Students have asked these similar questions

A major company in the Montreal area, offering a range of engineering services from project preparation to construction execution, and industrial project management, wants to ensure that the individuals who are responsible for project cost estimation and bid preparation demonstrate a certain uniformity in their estimates. The head of civil engineering and municipal services decided to structure an experimental plan to detect if there could be significant differences in project evaluation. Seven projects were selected, each of which had to be evaluated by each of the two estimators, with the order of the projects submitted being random. The obtained estimates are presented in the table below. a) Complete the table above by calculating: i. The differences (A-B) ii. The sum of the differences iii. The mean of the differences iv. The standard deviation of the differences b) What is the value of the t-statistic? c) What is the critical t-value for this test at a significance level of 1%?…

Compute the relative risk of falling for the two groups (did not stop walking vs. did stop). State/interpret your result verbally.

Microsoft Excel include formulas

Answer 2

Textbook Question

Chapter 12.3, Problem 35E

Refer back to the data in Exercise 4, in which y = ammonium concentration (mg/L) and x = transpiration (ml/h). Summary quantities include n = 13, Σx_i = 303.7, Σy_i = 52.8, S_xx = 1585.230769, S_v = −341.959231. and Syy = 77.270769.

a. Obtain the equation of the estimated regression line and use it to calculate a point prediction of ammonium concentration for a future observation made when ammonium concentration is 25 ml/h.
b. What happens if the estimated regression line is used to calculate a point estimate of true average concentration when transpiration is 45 ml/h? Why does it not make sense to calculate this point estimate?
c. Calculate and interpret s.
d. Do you think the simple linear regression model does a good job of explaining observed variation in concentration? Explain.

a.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression.

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, ∑i=1nxi=222.1, ∑i=1nyi=193, ∑i=1nyi2=2,975,∑i=1nxi2=3,056.69 and ∑i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2759.6−222.1×193173,056.69−222.1×222.117=2759.6−2,521.4883,056.69−2,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,975−(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2759.6−222.1×19317)23,056.69−222.1×222.117=(2759.6−2,521.488)23,056.69−2,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=783.8824−365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=418.137617−2=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.283,056.69−222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.536−(2.131×0.424)≤β1≤1.536+(2.131×0.424))=(1.536±0.903544)≃(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

b.

Expert Solution

To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

H1:β1≠0

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1−β1sβ^1∼t(n−2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1−β1sβ^1=1.536−00.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(test statistic > critical value), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

c.

Expert Solution

To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

The y-intercept of the regression model is obtained as follows:

β^0=∑iyi−β^1∑ixin=193−1.536×222.117=−8.715

Thus, the y-intercept of the regression model is β^0=−8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y⌢=−8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y⌢=−8.715+1.536x=−8.715+1.536×0.5=−7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

d.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=17−1=16.

Sum of the variable:

∑i=1nxi=222.1−6=216.1,∑i=1nyi=193−2.50=191.5.

Sum of squares of the variable:

∑i=1nxi2=3,056.69−62=3,020.69,∑i=1nyi2=2,975−2.52=2,968.75,and ∑i=1nxiyi=2,759.6−6×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2,744.6−216.1×191.5163,020.69−216.1×216.116=2,744.6−2,586.4473,020.69−2,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,968.75−(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2,744.6−216.1×191.516)23,020.69−216.1×216.116=(2,744.6−2,586.447)23,020.69−2,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=676.7344−245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=431.489116−2=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.5523,020.69−216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n−2=16−2=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.551−(2.145×0.5497)≤β1≤1.551+(2.145×0.5497))=(1.551±1.1791)≃(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719≤β1≤2.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632≤β1≤2.440_.

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Here, by observing both the intervals it is clear that the (6.0,2.50) has an impact on the slope coefficient of the regression line.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!

Answer 3

Textbook Question

Chapter 12.3, Problem 35E

Refer back to the data in Exercise 4, in which y = ammonium concentration (mg/L) and x = transpiration (ml/h). Summary quantities include n = 13, Σx_i = 303.7, Σy_i = 52.8, S_xx = 1585.230769, S_v = −341.959231. and Syy = 77.270769.

a. Obtain the equation of the estimated regression line and use it to calculate a point prediction of ammonium concentration for a future observation made when ammonium concentration is 25 ml/h.
b. What happens if the estimated regression line is used to calculate a point estimate of true average concentration when transpiration is 45 ml/h? Why does it not make sense to calculate this point estimate?
c. Calculate and interpret s.
d. Do you think the simple linear regression model does a good job of explaining observed variation in concentration? Explain.

a.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression.

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, ∑i=1nxi=222.1, ∑i=1nyi=193, ∑i=1nyi2=2,975,∑i=1nxi2=3,056.69 and ∑i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2759.6−222.1×193173,056.69−222.1×222.117=2759.6−2,521.4883,056.69−2,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,975−(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2759.6−222.1×19317)23,056.69−222.1×222.117=(2759.6−2,521.488)23,056.69−2,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=783.8824−365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=418.137617−2=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.283,056.69−222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.536−(2.131×0.424)≤β1≤1.536+(2.131×0.424))=(1.536±0.903544)≃(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

b.

Expert Solution

To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

H1:β1≠0

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1−β1sβ^1∼t(n−2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1−β1sβ^1=1.536−00.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(test statistic > critical value), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

c.

Expert Solution

To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

The y-intercept of the regression model is obtained as follows:

β^0=∑iyi−β^1∑ixin=193−1.536×222.117=−8.715

Thus, the y-intercept of the regression model is β^0=−8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y⌢=−8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y⌢=−8.715+1.536x=−8.715+1.536×0.5=−7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

d.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=17−1=16.

Sum of the variable:

∑i=1nxi=222.1−6=216.1,∑i=1nyi=193−2.50=191.5.

Sum of squares of the variable:

∑i=1nxi2=3,056.69−62=3,020.69,∑i=1nyi2=2,975−2.52=2,968.75,and ∑i=1nxiyi=2,759.6−6×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2,744.6−216.1×191.5163,020.69−216.1×216.116=2,744.6−2,586.4473,020.69−2,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,968.75−(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2,744.6−216.1×191.516)23,020.69−216.1×216.116=(2,744.6−2,586.447)23,020.69−2,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=676.7344−245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=431.489116−2=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.5523,020.69−216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n−2=16−2=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.551−(2.145×0.5497)≤β1≤1.551+(2.145×0.5497))=(1.551±1.1791)≃(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719≤β1≤2.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632≤β1≤2.440_.

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Here, by observing both the intervals it is clear that the (6.0,2.50) has an impact on the slope coefficient of the regression line.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!

Answer 4

Textbook Question

Chapter 12.3, Problem 35E

Refer back to the data in Exercise 4, in which y = ammonium concentration (mg/L) and x = transpiration (ml/h). Summary quantities include n = 13, Σx_i = 303.7, Σy_i = 52.8, S_xx = 1585.230769, S_v = −341.959231. and Syy = 77.270769.

a. Obtain the equation of the estimated regression line and use it to calculate a point prediction of ammonium concentration for a future observation made when ammonium concentration is 25 ml/h.
b. What happens if the estimated regression line is used to calculate a point estimate of true average concentration when transpiration is 45 ml/h? Why does it not make sense to calculate this point estimate?
c. Calculate and interpret s.
d. Do you think the simple linear regression model does a good job of explaining observed variation in concentration? Explain.

a.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression.

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, ∑i=1nxi=222.1, ∑i=1nyi=193, ∑i=1nyi2=2,975,∑i=1nxi2=3,056.69 and ∑i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2759.6−222.1×193173,056.69−222.1×222.117=2759.6−2,521.4883,056.69−2,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,975−(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2759.6−222.1×19317)23,056.69−222.1×222.117=(2759.6−2,521.488)23,056.69−2,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=783.8824−365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=418.137617−2=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.283,056.69−222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.536−(2.131×0.424)≤β1≤1.536+(2.131×0.424))=(1.536±0.903544)≃(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

b.

Expert Solution

To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

H1:β1≠0

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1−β1sβ^1∼t(n−2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1−β1sβ^1=1.536−00.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(test statistic > critical value), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

c.

Expert Solution

To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

The y-intercept of the regression model is obtained as follows:

β^0=∑iyi−β^1∑ixin=193−1.536×222.117=−8.715

Thus, the y-intercept of the regression model is β^0=−8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y⌢=−8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y⌢=−8.715+1.536x=−8.715+1.536×0.5=−7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

d.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=17−1=16.

Sum of the variable:

∑i=1nxi=222.1−6=216.1,∑i=1nyi=193−2.50=191.5.

Sum of squares of the variable:

∑i=1nxi2=3,056.69−62=3,020.69,∑i=1nyi2=2,975−2.52=2,968.75,and ∑i=1nxiyi=2,759.6−6×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2,744.6−216.1×191.5163,020.69−216.1×216.116=2,744.6−2,586.4473,020.69−2,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,968.75−(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2,744.6−216.1×191.516)23,020.69−216.1×216.116=(2,744.6−2,586.447)23,020.69−2,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=676.7344−245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=431.489116−2=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.5523,020.69−216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n−2=16−2=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.551−(2.145×0.5497)≤β1≤1.551+(2.145×0.5497))=(1.551±1.1791)≃(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719≤β1≤2.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632≤β1≤2.440_.

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Here, by observing both the intervals it is clear that the (6.0,2.50) has an impact on the slope coefficient of the regression line.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!

Answer 5

Textbook Question

Answer 6

Textbook Question

Answer 7

a.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression.

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, ∑i=1nxi=222.1, ∑i=1nyi=193, ∑i=1nyi2=2,975,∑i=1nxi2=3,056.69 and ∑i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2759.6−222.1×193173,056.69−222.1×222.117=2759.6−2,521.4883,056.69−2,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,975−(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2759.6−222.1×19317)23,056.69−222.1×222.117=(2759.6−2,521.488)23,056.69−2,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=783.8824−365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=418.137617−2=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.283,056.69−222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.536−(2.131×0.424)≤β1≤1.536+(2.131×0.424))=(1.536±0.903544)≃(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

b.

Expert Solution

To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

H1:β1≠0

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1−β1sβ^1∼t(n−2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1−β1sβ^1=1.536−00.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(test statistic > critical value), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

c.

Expert Solution

To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

The y-intercept of the regression model is obtained as follows:

β^0=∑iyi−β^1∑ixin=193−1.536×222.117=−8.715

Thus, the y-intercept of the regression model is β^0=−8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y⌢=−8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y⌢=−8.715+1.536x=−8.715+1.536×0.5=−7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

d.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=17−1=16.

Sum of the variable:

∑i=1nxi=222.1−6=216.1,∑i=1nyi=193−2.50=191.5.

Sum of squares of the variable:

∑i=1nxi2=3,056.69−62=3,020.69,∑i=1nyi2=2,975−2.52=2,968.75,and ∑i=1nxiyi=2,759.6−6×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2,744.6−216.1×191.5163,020.69−216.1×216.116=2,744.6−2,586.4473,020.69−2,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,968.75−(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2,744.6−216.1×191.516)23,020.69−216.1×216.116=(2,744.6−2,586.447)23,020.69−2,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=676.7344−245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=431.489116−2=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.5523,020.69−216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n−2=16−2=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.551−(2.145×0.5497)≤β1≤1.551+(2.145×0.5497))=(1.551±1.1791)≃(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719≤β1≤2.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632≤β1≤2.440_.

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Here, by observing both the intervals it is clear that the (6.0,2.50) has an impact on the slope coefficient of the regression line.

Answer 8

a.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression.

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, ∑i=1nxi=222.1, ∑i=1nyi=193, ∑i=1nyi2=2,975,∑i=1nxi2=3,056.69 and ∑i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2759.6−222.1×193173,056.69−222.1×222.117=2759.6−2,521.4883,056.69−2,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,975−(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2759.6−222.1×19317)23,056.69−222.1×222.117=(2759.6−2,521.488)23,056.69−2,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=783.8824−365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=418.137617−2=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.283,056.69−222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.536−(2.131×0.424)≤β1≤1.536+(2.131×0.424))=(1.536±0.903544)≃(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

Answer 9

a.

Expert Solution

Answer 10

a.

Expert Solution

Answer 11

Expert Solution

Answer 12

To determine

Find the interval estimate for the slope of the population regression.

Answer 13

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Answer 14

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, ∑i=1nxi=222.1, ∑i=1nyi=193, ∑i=1nyi2=2,975,∑i=1nxi2=3,056.69 and ∑i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2759.6−222.1×193173,056.69−222.1×222.117=2759.6−2,521.4883,056.69−2,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,975−(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2759.6−222.1×19317)23,056.69−222.1×222.117=(2759.6−2,521.488)23,056.69−2,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=783.8824−365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=418.137617−2=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.283,056.69−222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.536−(2.131×0.424)≤β1≤1.536+(2.131×0.424))=(1.536±0.903544)≃(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632≤β1≤2.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

Answer 15

b.

Expert Solution

To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

H1:β1≠0

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1−β1sβ^1∼t(n−2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1−β1sβ^1=1.536−00.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(test statistic > critical value), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer 16

b.

Expert Solution

Answer 17

b.

Expert Solution

Answer 18

Expert Solution

Answer 19

To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer 20

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer 21

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

H1:β1≠0

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1−β1sβ^1∼t(n−2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n−2=17−2=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1−β1sβ^1=1.536−00.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(test statistic > critical value), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer 22

c.

Expert Solution

To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

The y-intercept of the regression model is obtained as follows:

β^0=∑iyi−β^1∑ixin=193−1.536×222.117=−8.715

Thus, the y-intercept of the regression model is β^0=−8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y⌢=−8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y⌢=−8.715+1.536x=−8.715+1.536×0.5=−7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer 23

c.

Expert Solution

Answer 24

c.

Expert Solution

Answer 25

Expert Solution

Answer 26

To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer 27

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer 28

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

The y-intercept of the regression model is obtained as follows:

β^0=∑iyi−β^1∑ixin=193−1.536×222.117=−8.715

Thus, the y-intercept of the regression model is β^0=−8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y⌢=−8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y⌢=−8.715+1.536x=−8.715+1.536×0.5=−7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer 29

d.

Expert Solution

To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=17−1=16.

Sum of the variable:

∑i=1nxi=222.1−6=216.1,∑i=1nyi=193−2.50=191.5.

Sum of squares of the variable:

∑i=1nxi2=3,056.69−62=3,020.69,∑i=1nyi2=2,975−2.52=2,968.75,and ∑i=1nxiyi=2,759.6−6×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2,744.6−216.1×191.5163,020.69−216.1×216.116=2,744.6−2,586.4473,020.69−2,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,968.75−(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2,744.6−216.1×191.516)23,020.69−216.1×216.116=(2,744.6−2,586.447)23,020.69−2,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=676.7344−245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=431.489116−2=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.5523,020.69−216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n−2=16−2=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.551−(2.145×0.5497)≤β1≤1.551+(2.145×0.5497))=(1.551±1.1791)≃(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719≤β1≤2.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632≤β1≤2.440_.

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Here, by observing both the intervals it is clear that the (6.0,2.50) has an impact on the slope coefficient of the regression line.

Answer 30

d.

Expert Solution

Answer 31

d.

Expert Solution

Answer 32

Expert Solution

Answer 33

To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer 34

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719≤β1≤2.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Answer 35

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=17−1=16.

Sum of the variable:

∑i=1nxi=222.1−6=216.1,∑i=1nyi=193−2.50=191.5.

Sum of squares of the variable:

∑i=1nxi2=3,056.69−62=3,020.69,∑i=1nyi2=2,975−2.52=2,968.75,and ∑i=1nxiyi=2,759.6−6×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯−β^1x¯=∑iyi−β^1∑ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[∑ixiyi−(∑ixi)(∑iyi)n]∑ixi2−(∑ixi)2n=2,744.6−216.1×191.5163,020.69−216.1×216.116=2,744.6−2,586.4473,020.69−2,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=∑i(yi−y¯)2 where yi be the i^th observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=∑iyi2−(∑iyi)2n=2,968.75−(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=∑i(y^i−y¯)2 where y^i be the predicted value of the i^th observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[∑ixiyi−(∑ixi)(∑iyi)n]2∑ixi2−(∑ixi)2n=(2,744.6−216.1×191.516)23,020.69−216.1×216.116=(2,744.6−2,586.447)23,020.69−2,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

The general formula to obtain error sum of square is,

SSE=SST−SSR.

The error sum of squares is obtained as,

SSE=SST−SSR=676.7344−245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn−2.

The estimate of error standard deviation is obtained as,

s=SSEn−2=431.489116−2=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=∑i(yi−y^)2 where yi be the predicted value of the i^th observation and y¯ be the predicted value for the i^th observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=∑ixi2−(∑ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=s∑ixi2−(∑ixi)2n=5.5523,020.69−216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n−2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1−α=1−0.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n−2=16−2=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1−(ta/2×sβ^1)≤β1≤β^1+(ta/2×sβ^1)=(1.551−(2.145×0.5497)≤β1≤1.551+(2.145×0.5497))=(1.551±1.1791)≃(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719≤β1≤2.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632≤β1≤2.440_.