Concept explainers
Refer back to the data in Exercise 4, in which y = ammonium concentration (mg/L) and x = transpiration (ml/h). Summary quantities include n = 13, Σxi = 303.7, Σyi = 52.8, Sxx = 1585.230769, Sv = −341.959231. and Syy = 77.270769.
- a. Obtain the equation of the estimated regression line and use it to calculate a point prediction of ammonium concentration for a future observation made when ammonium concentration is 25 ml/h.
- b. What happens if the estimated regression line is used to calculate a point estimate of true average concentration when transpiration is 45 ml/h? Why does it not make sense to calculate this point estimate?
- c. Calculate and interpret s.
- d. Do you think the simple linear regression model does a good job of explaining observed variation in concentration? Explain.
a.

Find the interval estimate for the slope of the population regression.
Answer to Problem 35E
The 95% confidence interval for the slope of the population regression is
Explanation of Solution
Given info:
The summary statistics of the data correspond to the variables motion sickness dose
Calculation:
Linear regression model:
In a linear equation
A linear regression model is given as
Y-intercept:
In a linear equation
The general formula to obtain y-intercept is,
Slope:
In a linear equation
The general formula to obtain slope is,
The slope coefficient of the simple linear regression is,
Thus, the point estimate of the slope is
Total sum of square: (SST)
The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is
The total sum of square is obtained as ,
Therefore, the total sum of squares is
Regression sum of square: (SSR)
The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is
The regression sum of squares is obtained as is,
Error sum of square: (SSE)
The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is
The general formula to obtain error sum of square is,
The error sum of squares is obtained as,
Therefore, the error sum of squares is
Estimate of error standard deviation:
The general formula for the estimate of error standard deviation is,
The estimate of error standard deviation is obtained as,
Thus, the estimate of error standard deviation is
Error sum of square: (SSE)
The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is
Estimate of error standard deviation of slope coefficient:
The general formula for the estimate of error standard deviation of slope coefficient is,
The defining formula for
The estimate of error standard deviation of slope coefficient is,
Thus, the estimate of error standard deviation of slope coefficient is
Confidence interval:
The general formula for the confidence interval for the slope of the regression line is,
Where,
Since, the level of confidence is not specified. The prior confidence level 95% can be used.
Critical value:
For 95% confidence level,
Degrees of freedom:
The sample size is
The degrees of freedom is,
From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.
Thus, the critical value is
The 95% confidence interval is,
Thus, the 95% confidence interval for the slope of the population regression is
Interpretation:
There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.
b.

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.
Answer to Problem 35E
There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.
Explanation of Solution
Calculation:
From part (a), the slope coefficient of the regression line is
The test hypotheses are given below:
Null hypothesis:
That is, there is no useful relationship between the variables motion sickness dose
Alternative hypothesis:
That is, there is useful relationship between the variables motion sickness dose
T-test statistic:
The test statistic is,
Degrees of freedom:
The sample size is
The degrees of freedom is,
Thus, the degree of freedom is 15.
Level of significance:
Here, level of significance is not given.
So, the prior level of significance
For the level of significance
From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.
Thus, the critical value is
From part (a), the estimate of error standard deviation of slope coefficient is
Test statistic under null hypothesis:
Under the null hypothesis, the test statistic is obtained as follows:
Thus, the test statistic is 3.6226.
Decision criteria for the classical approach:
If
Conclusion:
Here, the test statistic is 3.6226 and critical value is 2.131.
The t statistic is greater than the critical value.
That is,
Based on the decision rule, the null hypothesis is rejected.
Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.
Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.
c.

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.
Answer to Problem 35E
No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.
Explanation of Solution
Calculation:
Linear regression model:
A linear regression model is given as
Y-intercept:
In a linear equation
The general formula to obtain y-intercept is,
The y-intercept of the regression model is obtained as follows:
Thus, the y-intercept of the regression model is
From part (a), the slope coefficient of the regression line is
Therefore, the regression equation of the variables motion sickness dose
Predicted value of % reported nausea when the motion sickness dose is 5.0:
The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:
Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.
Here, the % reported nausea is resulted as a negative value, which is not possible in reality.
Thus, the predicted value is a flaw.
Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.
The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.
Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.
Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.
d.

Find the interval estimate for the slope of the population regression after eliminating the observation
Comment whether the observation
Answer to Problem 35E
The 95% confidence interval for the slope of the population regression after eliminating the observation
Yes, the observation
Explanation of Solution
Calculation:
Linear regression model:
In a linear equation
A linear regression model is given as
Here, the observation
That is, the value 6.0 has to be removed from the variable motion sickness dose
The results of the summary statistics after eliminating the observation
Sample size:
Sum of the variable:
Sum of squares of the variable:
Y-intercept:
In a linear equation
The general formula to obtain y-intercept is,
Slope:
In a linear equation
The general formula to obtain slope is,
The slope coefficient of the simple linear regression is,
Thus, the point estimate of the slope is
Total sum of square: (SST)
The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is
The total sum of square is obtained as ,
Therefore, the total sum of squares is
Regression sum of square: (SSR)
The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is
The regression sum of squares is obtained as is,
Error sum of square: (SSE)
The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is
The general formula to obtain error sum of square is,
The error sum of squares is obtained as,
Therefore, the error sum of squares is
Estimate of error standard deviation:
The general formula for the estimate of error standard deviation is,
The estimate of error standard deviation is obtained as,
Thus, the estimate of error standard deviation is
Error sum of square: (SSE)
The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is
Estimate of error standard deviation of slope coefficient:
The general formula for the estimate of error standard deviation of slope coefficient is,
The defining formula for
The estimate of error standard deviation of slope coefficient is,
Thus, the estimate of error standard deviation of slope coefficient is
Confidence interval:
The general formula for the confidence interval for the slope of the regression line is,
Where,
Since, the level of confidence is not specified. The prior confidence level 95% can be used.
Critical value:
For 95% confidence level,
Degrees of freedom:
The sample size is
The degrees of freedom is,
From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.
Thus, the critical value is
The 95% confidence interval is,
Thus, the 95% confidence interval for the slope of the population regression is
Interpretation:
There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.
Comparison:
The 95% confidence interval for the slope of the population regression with the observation
The 95% confidence interval for the slope of the population regression after eliminating the observation
Here, by observing both the intervals it is clear that the
Want to see more full solutions like this?
Chapter 12 Solutions
PROBABILITY & STATS FOR ENGINEERING &SCI
- Question 1 The data shown in Table 1 are and R values for 24 samples of size n = 5 taken from a process producing bearings. The measurements are made on the inside diameter of the bearing, with only the last three decimals recorded (i.e., 34.5 should be 0.50345). Table 1: Bearing Diameter Data Sample Number I R Sample Number I R 1 34.5 3 13 35.4 8 2 34.2 4 14 34.0 6 3 31.6 4 15 37.1 5 4 31.5 4 16 34.9 7 5 35.0 5 17 33.5 4 6 34.1 6 18 31.7 3 7 32.6 4 19 34.0 8 8 33.8 3 20 35.1 9 34.8 7 21 33.7 2 10 33.6 8 22 32.8 1 11 31.9 3 23 33.5 3 12 38.6 9 24 34.2 2 (a) Set up and R charts on this process. Does the process seem to be in statistical control? If necessary, revise the trial control limits. [15 pts] (b) If specifications on this diameter are 0.5030±0.0010, find the percentage of nonconforming bearings pro- duced by this process. Assume that diameter is normally distributed. [10 pts] 1arrow_forward4. (5 pts) Conduct a chi-square contingency test (test of independence) to assess whether there is an association between the behavior of the elderly person (did not stop to talk, did stop to talk) and their likelihood of falling. Below, please state your null and alternative hypotheses, calculate your expected values and write them in the table, compute the test statistic, test the null by comparing your test statistic to the critical value in Table A (p. 713-714) of your textbook and/or estimating the P-value, and provide your conclusions in written form. Make sure to show your work. Did not stop walking to talk Stopped walking to talk Suffered a fall 12 11 Totals 23 Did not suffer a fall | 2 Totals 35 37 14 46 60 Tarrow_forwardQuestion 2 Parts manufactured by an injection molding process are subjected to a compressive strength test. Twenty samples of five parts each are collected, and the compressive strengths (in psi) are shown in Table 2. Table 2: Strength Data for Question 2 Sample Number x1 x2 23 x4 x5 R 1 83.0 2 88.6 78.3 78.8 3 85.7 75.8 84.3 81.2 78.7 75.7 77.0 71.0 84.2 81.0 79.1 7.3 80.2 17.6 75.2 80.4 10.4 4 80.8 74.4 82.5 74.1 75.7 77.5 8.4 5 83.4 78.4 82.6 78.2 78.9 80.3 5.2 File Preview 6 75.3 79.9 87.3 89.7 81.8 82.8 14.5 7 74.5 78.0 80.8 73.4 79.7 77.3 7.4 8 79.2 84.4 81.5 86.0 74.5 81.1 11.4 9 80.5 86.2 76.2 64.1 80.2 81.4 9.9 10 75.7 75.2 71.1 82.1 74.3 75.7 10.9 11 80.0 81.5 78.4 73.8 78.1 78.4 7.7 12 80.6 81.8 79.3 73.8 81.7 79.4 8.0 13 82.7 81.3 79.1 82.0 79.5 80.9 3.6 14 79.2 74.9 78.6 77.7 75.3 77.1 4.3 15 85.5 82.1 82.8 73.4 71.7 79.1 13.8 16 78.8 79.6 80.2 79.1 80.8 79.7 2.0 17 82.1 78.2 18 84.5 76.9 75.5 83.5 81.2 19 79.0 77.8 20 84.5 73.1 78.2 82.1 79.2 81.1 7.6 81.2 84.4 81.6 80.8…arrow_forward
- Name: Lab Time: Quiz 7 & 8 (Take Home) - due Wednesday, Feb. 26 Contingency Analysis (Ch. 9) In lab 5, part 3, you will create a mosaic plot and conducted a chi-square contingency test to evaluate whether elderly patients who did not stop walking to talk (vs. those who did stop) were more likely to suffer a fall in the next six months. I have tabulated the data below. Answer the questions below. Please show your calculations on this or a separate sheet. Did not stop walking to talk Stopped walking to talk Totals Suffered a fall Did not suffer a fall Totals 12 11 23 2 35 37 14 14 46 60 Quiz 7: 1. (2 pts) Compute the odds of falling for each group. Compute the odds ratio for those who did not stop walking vs. those who did stop walking. Interpret your result verbally.arrow_forwardSolve please and thank you!arrow_forward7. In a 2011 article, M. Radelet and G. Pierce reported a logistic prediction equation for the death penalty verdicts in North Carolina. Let Y denote whether a subject convicted of murder received the death penalty (1=yes), for the defendant's race h (h1, black; h = 2, white), victim's race i (i = 1, black; i = 2, white), and number of additional factors j (j = 0, 1, 2). For the model logit[P(Y = 1)] = a + ß₁₂ + By + B²², they reported = -5.26, D â BD = 0, BD = 0.17, BY = 0, BY = 0.91, B = 0, B = 2.02, B = 3.98. (a) Estimate the probability of receiving the death penalty for the group most likely to receive it. [4 pts] (b) If, instead, parameters used constraints 3D = BY = 35 = 0, report the esti- mates. [3 pts] h (c) If, instead, parameters used constraints Σ₁ = Σ₁ BY = Σ; B = 0, report the estimates. [3 pts] Hint the probabilities, odds and odds ratios do not change with constraints.arrow_forward
- Solve please and thank you!arrow_forwardSolve please and thank you!arrow_forwardQuestion 1:We want to evaluate the impact on the monetary economy for a company of two types of strategy (competitive strategy, cooperative strategy) adopted by buyers.Competitive strategy: strategy characterized by firm behavior aimed at obtaining concessions from the buyer.Cooperative strategy: a strategy based on a problem-solving negotiating attitude, with a high level of trust and cooperation.A random sample of 17 buyers took part in a negotiation experiment in which 9 buyers adopted the competitive strategy, and the other 8 the cooperative strategy. The savings obtained for each group of buyers are presented in the pdf that i sent: For this problem, we assume that the samples are random and come from two normal populations of unknown but equal variances.According to the theory, the average saving of buyers adopting a competitive strategy will be lower than that of buyers adopting a cooperative strategy.a) Specify the population identifications and the hypotheses H0 and H1…arrow_forward
- You assume that the annual incomes for certain workers are normal with a mean of $28,500 and a standard deviation of $2,400. What’s the chance that a randomly selected employee makes more than $30,000?What’s the chance that 36 randomly selected employees make more than $30,000, on average?arrow_forwardWhat’s the chance that a fair coin comes up heads more than 60 times when you toss it 100 times?arrow_forwardSuppose that you have a normal population of quiz scores with mean 40 and standard deviation 10. Select a random sample of 40. What’s the chance that the mean of the quiz scores won’t exceed 45?Select one individual from the population. What’s the chance that his/her quiz score won’t exceed 45?arrow_forward
- Functions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage LearningAlgebra & Trigonometry with Analytic GeometryAlgebraISBN:9781133382119Author:SwokowskiPublisher:Cengage

