a.
Test whether the given model is useful or not at the 0.05 level of significance.
a.
Answer to Problem 46E
There is convincing evidence that the given model is useful at the 0.05 level of significance.
Explanation of Solution
Calculation:
It is given that the variable y is absorption,
1.
The model is
2.
Null hypothesis:
That is, there is no useful relationship between y and any of the predictors.
3.
Alternative hypothesis:
That is, there is a useful relationship between y and any of the predictors.
4.
Here, the significance level is
5.
Test statistic:
Here, n is the sample size, and k is the number of variables in the model.
6.
Assumptions:
Since there is no availability of original data to check the assumptions, there is a need to assume that the variables are related to the model, and the random deviation is distributed normally with mean 0 and the fixed standard deviation.
7.
Calculation:
From the MINITAB output, the value of
The value of F-test statistic is calculated as follows:
8.
P-value:
Software procedure:
Step-by-step procedure to find the P-value using the MINITAB software:
- Choose Graph > Probability Distribution Plot choose View Probability > OK.
- From Distribution, choose ‘F’ distribution.
- Enter the Numerator df as 2 and Denominator df as 25.
- Click the Shaded Area tab.
- Choose X Value and Right Tail for the region of the curve to shade.
- Enter the X value as 334.722.
- Click OK.
Output obtained using the MINITAB software is represented as follows:
From the MINITAB output, the P-value is 0.
9.
Conclusion:
If the
Therefore, the P-value of 0 is less than the 0.05 level of significance.
Hence, reject the null hypothesis.
Thus, there is convincing evidence that the given model is useful at the 0.05 level of significance.
b.
Calculate a 95% confidence interval for
b.
Answer to Problem 46E
The 95% confidence interval for
Explanation of Solution
Calculation:
Here,
Since there is no availability of original data to check the assumptions, there is a need to assume that the variables are related to the model, and the random deviation is distributed normally with mean 0 and the fixed standard deviation.
The formula for confidence interval for
Where,
Degrees of freedom:
Software procedure:
Step-by-step procedure to find the P-value using the MINITAB software:
- Choose Graph > Probability Distribution Plot choose View Probability > OK.
- From Distribution, choose ‘t’ distribution.
- Enter the Degrees of freedom as 25.
- Click the Shaded Area tab.
- Choose Probability and Both for the region of the curve to shade.
- Enter the Probability value as 0.05.
- Click OK.
Output obtained using the MINITAB software is represented as follows:
From the MINITAB output, the critical value is 2.060.
From the given MINITAB output, the value of
The confidence interval for mean change in exam score is calculated as follows:
Thus, the 95% confidence interval for
There is 95% confident that the average increase in y is associated with 1-unit increase in starch damage that is between 0.298 and 0.373, when the other predictors are fixed.
c.
- i. Test the hypothesis
H 0 : β 1 = 0 versus H a : β 1 ≠ 0 .
- ii. Test the hypothesis
H 0 : β 2 = 0 versus H a : β 2 ≠ 0
c.
Answer to Problem 46E
It can be concluded that the quadratic term should not be eliminated from the model and simple linear model should not be sufficient.
Explanation of Solution
Calculation:
i.
1.
The predictor
2.
Null hypothesis:
3.
Alternative hypothesis:
4.
Here, the common significance levels are
5.
Test statistic:
Here,
6.
Assumptions:
The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.
7.
Calculation:
From the MINITAB output, the t test statistic value of
8.
P-value:
From the MINITAB output, the P-value for
9.
Conclusion:
If the
Therefore, the P-value of 0 is less than any common levels of significance, such as 0.05, 0.01, and 0.10.
Hence, reject the null hypothesis.
Thus, there is convincing evidence that the flour protein variable is important and it is
ii.
1.
The predictor
2.
Null hypothesis:
3.
Alternative hypothesis:
4.
Here, the common significance levels are
5.
Test statistic:
Here,
6.
Assumptions:
The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.
7.
Calculation:
From the MINITAB output, the t test statistic value of
8.
P-value:
From the MINITAB output, the P-value for
9.
Conclusion:
If the
Therefore, the P-value of 0 is less than any common levels of significance, such as 0.05, 0.01, and 0.10.
Hence, reject the null hypothesis.
Thus, there is convincing evidence that the starch damage variable is important and it is
d.
Explain whether both the independent variables are important or not.
d.
Explanation of Solution
From the results of Part (c), there is evidence that the variables of flour protein and starch damage are important.
e.
Calculate a 90% confidence interval and interpret it.
e.
Answer to Problem 46E
The 90% confidence interval is
Explanation of Solution
Calculation:
It is given that the value of
Assume that the exam score is associated with the predictors according to the model. The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.
The formula for prediction interval for mean y value is as follows:
The value of
Degrees of freedom:
Software procedure:
Step-by-step procedure to find the P-value using the MINITAB software:
- Choose Graph > Probability Distribution Plot choose View Probability > OK.
- From Distribution, choose ‘t’ distribution.
- Enter the Degrees of freedom as 25.
- Click the Shaded Area tab.
- Choose Probability and Both for the region of the curve to shade.
- Enter the Probability value as 0.10.
- Click OK.
Output obtained using the MINITAB software is represented as follows:
From the MINITAB output, the critical value is 1.708.
The prediction interval for mean y value is calculated as follows:
Thus, the 90% confidence interval is
There is 90% confident that the mean water absorption for wheat with 11.7 flour protein and 57 starch damage is between 54.5084 and 56.2916.
f.
Predict the water absorption for the shipment by 90% interval.
f.
Answer to Problem 46E
The water absorption values for the particular shipment are between 53.3297 and 57.4703 by 90% prediction interval.
Explanation of Solution
Calculation:
It is given that the value of
Assume that the exam score is associated with the predictors according to the model. The random deviations from the values by the population regression equation are distributed normally with mean 0 and fixed standard deviation.
The formula for prediction interval for mean y value is as follows:
From the given MINITAB output, the standard deviation is 1.094.
From Part e., the value of
From the MINITAB output in Part e., the critical value is 1.708.
The prediction interval for water absorption for shipment is calculated as follows:
Thus, the 90% prediction interval is
By prediction at 90% interval, the water absorption values for the particular shipment are between 53.3297 and 57.4703.
Want to see more full solutions like this?
Chapter 14 Solutions
Introduction To Statistics And Data Analysis
- 2arrow_forward14.34 Gutsell (1951) measured hemoglobin in the blood of brown trout after treat- ment with four rates of sulfamerazine. Two methods of administering the sul- famerazine were used. Ten fish were measured for each rate and each method. The data are given in Table 14.7. Test for effect of rate and method and interaction. TABLE 14.7 Hemoglobin Concentration (g/mL) in Blood of Brown Trout Rate: 1 3 Method: A 6.7 7.8 5.5 8.4 7.0 7.8 8.6 7.4 5.8 7.0 278 B 7.0 7.8 6.8 7.0 7.5 6.5 5.8 7.1 6.5 5.5 A 9.9 8.4 10.4 9.3 10.7 11.9 7.1 6.4 8.6 10.6 2 B 9.9 9.6 10.2 10.4 11.3 9.1 9.0 10.6 11.7 9.6 A 10.4 8.1 10.6 8.7 10.7 9.1 8.8 8.1 7.8 8.0 B 9.9 9.6 10.4 10.4 11.3 10.9 8.0 10.2 6.1 10.7 A 9.3 9.3 7.8 7.8 9.3 10.2 8.7 8.6 9.3 7.2 4 B 11.0 9.3 11.0 9.0 8.4 8.4 6.8 7.2 8.1 11.0 "After 35 days of treatment the daily rates of 0, 5, 10, and 15g of sulfamerazine per 100 lb of fish employing two methods for each rate.arrow_forwardOne is interested in the ceteris paribus relationship between the dependent variable y, and the explanatory variable ₁1. For this, one collects data on two control variables Xi2 and Xiz and runs two LS regressions. Regression 1: yi = B₁x₁1 + Ui Regression 2: y = B₁xil+ B₂x₁2 + B3x13 + Ei Let ₁ denote the LS estimate of ₁ of Regression 1 and let ₁ denote the LS estimate of ₁ in Regression 2. Assume that the true model is given by Regression 2 and that all variables are centered. a) Would you expect a difference between ₁ and ₁ when is highly correlated with 2 and 3 and the partial effects of xi2 and xi3 on yi are also high? b) Would you expect a difference between ₁ and 3₁ when ₁ has almost no correlation with and Xi3 but X2 and 3 are highly correlated? Xi2 c) Which of the two estimators is more efficient if x₁1 is highly correlated with 2 and 13, and Xiz have small partial effects on yi? but i2 d) Which of the two estimators is more efficient if x₁1 is almost uncorrelated with 2 and…arrow_forward
- A) In the screenshot i provided B) Does either explanatory variable improve the fit of the model that uses the other? Use a test statistic for each. Use α=0.05.arrow_forward11) A simple linear regression model based on 20 observations. The F-stat for the model is 21.44 and the SSE is 1.41. The standard error for the coefficient of X is 0.2. a) Complete the ANOVA table. b) Find the t-stat of the co-efficient of X c) Find the co-efficient of X.arrow_forward1.Consider the following two simple regression models : Model I : Y, = B, + B,X, + µ; : Y, = a, +a, (X, - )+v, Model II (1) What is the estimator for B1? What is the estimator for a1? Are they the same to each other? Do they have the same variance? (2) What is the estimator for B2? What is the estimator for a2? Are they the same to each other? Do they have the same variance?arrow_forward
- 9) For the same set of observations on a specified dependent variable, two different independent variables were used to develop two separate simple linear regression models. A portion of the results is presented below. R² (R-squared) Model 1 0.92 Based on R2 of each model, which model is more useful? Why? Model 2 0.85arrow_forwardPLS ANSWERarrow_forwardYou have estimated a multiple regression model with 6 explanatory variables and an intercept from a sample with 46 observations. What is the critical value of the test statistic (tc) if you want to perform a test for the significance of a single right-hand side (explanatory) variable at α = 0.05? a.) 2.023 b.) 2.708 c.) 2.423 d.) 2.704arrow_forward
- The spotted lanternfly, Lycorma delicatula, is an invasive species to the United States that has the potential to do significant agricultural damage. An ecologist is studying the relationship between the size of the female spotted lanternflies and the number of eggs they produce. The data are summarized below. Length of insect: AVG = 1 inch, SD = 0.15 inchNumber of eggs: AVG = 40 eggs, SD = 5 eggsr = 0.2 Using regression, we can say that the average number of eggs produced by female spotted lanternflies who are 1.1 inch long is closest to... Group of answer choices 42 39 40 41arrow_forwards) Use the Durbin-Watson test to test, at 5% level of significance, for the presence of significant linear correlation among error terms in the following cases: a) Given that the sample size is n=20, the Durbin Watson (DW) statistic 0.62, when the regression model is of the form E(Y) = Bo + B,X b) Given that the sample size is #=35, the Durbin Watson (DW) statistic = 1.82, when the regression model is of the form E(Y) = Bo + B,X, + B,X2 . c) Given that the sample size is n-40, the Durbin Watson (DW) statistic =2.78, when the regression model is of the form E(Y) = Bo + BzX, + B2X2 + B,X3 +B,X4 Table Criical value for the Durbin-Watson d Statistic (a = 05)arrow_forwardBefore surgical removal of a diseased parathyroid gland, two tests are often performed: the standard intact test and the turbo test. Both tests measure parathyroid hormone (PTH, in ng/l), but the turbo test is very expensive. Researchers obtained data from both tests in a sample of 48 patients to predict turbo test results (y) from standard intact test results (x). The data ranged from roughly O to 500 ng/l, and a scatterplot showed a clear linear relationship. The published findings given are summarized exactly: y = 1.08x-4.36(r = 0.97; n = 48) What is the slope of the regression line? 48 -4.36 1.08 0.97arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman