Concept explainers
In a simulation of 30 mobile computer networks, the average speed, pause time, and number of neighbor were measured. A “neighbor” is a computer within the transmission
- a. Fit the model with Neighbors as the dependent variable, and independent variables Speed, Pause, Speed,·Pause, Speed2, and Pause2.
- b. Construct a reduced model by dropping any variables whose P-values are large, and test the plausibility of the model with an F test.
- c. Plot the residuals versus the fitted values for the reduced model. Are there any indications that the model is inappropriate? If so, what are they?
- d. Someone suggests that a model containing Pause and Pause2 as the only dependent variables is adequate. Do you agree? Why or why not?
- e. Using a best subsets software package, find the two models with the highest R2 value for each model size from one to five variables. Compute Cp and adjusted R2 for each model.
- f. Which model is selected by minimum Cp? By adjusted R2? Are they the same?
a.

Construct a multiple linear regression model with neighbor as the dependent variable, speed, pause,
Answer to Problem 5SE
A multiple linear regression model for the given data is:
Explanation of Solution
Calculation:
The data represents the values of the variables number of neighbors, average speed and pause time for a simulation of 30 mobile network computers.
Multiple linear regression model:
A multiple linear regression model is given as
Let
Regression:
Software procedure:
Step by step procedure to obtain regression using MINITAB software is given as,
- Choose Stat > Regression > General Regression.
- In Response, enter the numeric column containing the response data Y.
- In Model, enter the numeric column containing the predictor variables X1, X2, X1*X2, X1*X1 and X2*X2.
- Click OK.
Output obtained from MINITAB is given below:
The ‘Coefficient’ column of the regression analysis MINITAB output gives the slopes corresponding to the respective variables stored in the column ‘Term’.
A careful inspection of the output shows that the fitted model is:
Hence, the multiple linear regression model for the given data is:
b.

Construct a reduced model by dropping the variables with large P- values.
Check whether the reduced model is plausible or not.
Answer to Problem 5SE
A multiple linear regression model for the given data is:
Yes, there is enough evidence to conclude that the reduced model is plausible.
Explanation of Solution
Calculation:
From part (a), it can be seen that the ‘P’ column of the regression analysis MINITAB output gives the slopes corresponding to the respective variables stored in the column ‘Term’.
By observing the P- values of the MINITAB output, it is clear that the largest P-value is 0.390 corresponding to the predictor variable
Now, the new regression has to be fitted after dropping the predictor variable
Regression:
Software procedure:
Step by step procedure to obtain regression using MINITAB software is given as,
- Choose Stat > Regression > General Regression.
- In Response, enter the numeric column containing the response data Y.
- In Model, enter the numeric column containing the predictor variables X1, X2, X1*X1 and X2*X2.
- Click OK.
Output obtained from MINITAB is given below:
The ‘Coefficient’ column of the regression analysis MINITAB output gives the slopes corresponding to the respective variables stored in the column ‘Term’.
A careful inspection of the output shows that the fitted model is:
Hence, the multiple linear regression model for the given data is:
The full model is,
The reduced model is,
The test hypotheses are given below:
Null hypothesis:
That is, the dropped predictor of the full model is not significant to predict y.
Alternative hypothesis:
That is, the dropped predictor of the full model is significant to predict y.
Test statistic:
Where,
n represents the total number of observations.
p represents the number of predictors on the full model.
k represents the number of predictors on the reduced model.
From the obtained MINITAB outputs, the value of error sum of squares for full model is
The total number of observations is
Number of predictors on the full model is
Degrees of freedom of F-statistic for reduced model:
In a reduced multiple linear regression analysis, the F-statistic is
In the ratio, the numerator is obtained by dividing the quantity
Thus, the degrees of freedom for the F-statistic in a reduced multiple regression analysis are
Hence, the numerator degrees of freedom is
Test statistic under null hypothesis:
Under the null hypothesis, the test statistic is obtained as follows:
Thus, the test statistic is
Since, the level of significance is not specified. The prior level of significance
P-value:
Software procedure:
- Choose Graph > Probability Distribution Plot choose View Probability > OK.
- From Distribution, choose F, enter 1 in numerator df and 24 in denominator df.
- Click the Shaded Area tab.
- Choose X-Value and Right Tail for the region of the curve to shade.
- Enter the X-value as 0.76638.
- Click OK.
Output obtained from MINITAB is given below:
From the output, the P- value is 0.39.
Thus, the P- value is 0.39.
Decision criteria based on P-value approach:
If
If
Conclusion:
The P-value is 0.39 and
Here, P-value is greater than the
That is
By the rejection rule, fail to reject the null hypothesis.
Hence, there is sufficient evidence to conclude that the dropped predictor variable is not significant to predict the response variable y.
Thus, the reduced model is useful than the full model to predict the response variable y.
c.

Plot the residuals versus fitted line plot for the reduced model.
Check whether the model is appropriate.
Answer to Problem 5SE
Residual plot:
Yes, the model seems to be appropriate.
Explanation of Solution
Calculation:
Residual plot:
Software procedure:
Step by step procedure to obtain regression using MINITAB software is given as,
- Choose Stat > Regression > General Regression.
- In Response, enter the numeric column containing the response data Y.
- In Model, enter the numeric column containing the predictor variables X1, X2, X1*X1 and X2*X2.
- In Graphs, Under Residuals for plots, select Regular.
- Under Residual plots select box Residuals versus fits.
- Click OK.
Conditions for the appropriateness of regression model using the residual plot:
- The plot of the residuals vs. fitted values should fall roughly in a horizontal band contended and symmetric about x-axis. That is, the residuals of the data should not represent any bend.
- The plot of residuals should not contain any outliers.
- The residuals have to be scattered randomly around “0” with constant variability among for all the residuals. That is, the spread should be consistent.
Interpretation:
In residual plot there is high bend or pattern, which can violate the straight line condition and there is change in the spread of the residuals from one part to another part of the plot.
However, it is difficult to determine about the violation of the assumptions without the data.
Thus, the model seems to be appropriate.
d.

Check whether the model with only two dependent variables
Answer to Problem 5SE
No, the model with only two dependent variables
Explanation of Solution
Calculation:
Regression:
Software procedure:
Step by step procedure to obtain regression using MINITAB software is given as,
- Choose Stat > Regression > General Regression.
- In Response, enter the numeric column containing the response data Y.
- In Model, enter the numeric column containing the predictor variables X2 and X2*X2.
- Click OK.
Output obtained from MINITAB is given below:
The ‘Coefficient’ column of the regression analysis MINITAB output gives the slopes corresponding to the respective variables stored in the column ‘Term’.
A careful inspection of the output shows that the fitted model is:
Hence, the multiple linear regression model for the given data is:
The full model is,
The reduced model is,
The test hypotheses are given below:
Null hypothesis:
That is, the dropped predictors of the full model are not significant to predict y.
Alternative hypothesis:
That is, at least one of the dropped predictors of the full model are significant to predict y.
Test statistic:
Where,
n represents the total number of observations.
p represents the number of predictors on the full model.
k represents the number of predictors on the reduced model.
From the obtained MINITAB outputs, the value of error sum of squares for full model is
The total number of observations is
Number of predictors on the full model is
Degrees of freedom of F-statistic for reduced model:
In a reduced multiple linear regression analysis, the F-statistic is
In the ratio, the numerator is obtained by dividing the quantity
Thus, the degrees of freedom for the F-statistic in a reduced multiple regression analysis are
Hence, the numerator degrees of freedom is
Test statistic under null hypothesis:
Under the null hypothesis, the test statistic is obtained as follows:
Thus, the test statistic is
Since, the level of significance is not specified. The prior level of significance
P-value:
Software procedure:
- Choose Graph > Probability Distribution Plot choose View Probability > OK.
- From Distribution, choose F, enter 3 in numerator df and 24 in denominator df.
- Click the Shaded Area tab.
- Choose X-Value and Right Tail for the region of the curve to shade.
- Enter the X-value as 15.702.
- Click OK.
Output obtained from MINITAB is given below:
From the output, the P- value is
Thus, the P- value is
Decision criteria based on P-value approach:
If
If
Conclusion:
The P-value is
Here, P-value is less than the
That is
By the rejection rule, reject the null hypothesis.
Hence, there is sufficient evidence to conclude that at least one of the dropped predictors of the full model are significant to predict y.
Thus, the model with only two dependent variables
e.

Find the two models with the highest
Obtain the values of mallows
Answer to Problem 5SE
The two models with the highest
First model with
The values of M Mallows’
Predictor variables | Mallows’ | Adjusted |
92.5 | 60.1 | |
97 | 58.6 | |
47.1 | 75.2 | |
53.3 | 73 | |
7.9 | 89.2 | |
15.5 | 86.4 | |
4.8 | 90.7 | |
9.2 | 89 | |
6 | 90.6 |
Explanation of Solution
Calculation:
Coefficient of multiple determination
The coefficient of multiple determination,
The subset with larger
Regression:
Software procedure:
Step by step procedure to obtain regression using MINITAB software is given as,
- Choose Stat > Regression > Regression> Best subsets.
- In Response, enter the numeric column containing the response data Y.
- In Model, enter the numeric column containing the predictor variables X1, X2, X1*X2, X1*X1 and X2*X2.
- Click OK.
Output obtained from MINITAB is given below:
For the one predictor case, the highest value of
For the two predictor case, the highest value of
For the three predictor case, the highest value of
For the four predictor case, the highest value of
For the five predictor case, the value of
The value of
Thus, depending upon the factors affecting the analysis it would be most preferable to use the regression equation corresponding to the predictors
The second highest value of
That is, 90.6 and 90.3 are not much distinct.
Therefore, the model with
Thus, the two best models are:
First model with
From the accompanying MINITAB output, the values of Mallows’
Predictor variables | Mallows’ | Adjusted |
92.5 | 60.1 | |
97 | 58.6 | |
47.1 | 75.2 | |
53.3 | 73 | |
7.9 | 89.2 | |
15.5 | 86.4 | |
4.8 | 90.7 | |
9.2 | 89 | |
6 | 90.6 |
f.

Select the variables for the model, using the Mallows’
Check whether both the models are same.
Answer to Problem 5SE
The variables for the model using the Mallows’
The variables for the model using the adjusted-
Yes, both the models are same.
Explanation of Solution
Mallows’
An important utility of the Mallows’
Mallows’
The predictor with the lowest value of
From part (e), the values of Mallows’
Predictor variables | Mallows’ | Adjusted |
92.5 | 60.1 | |
97 | 58.6 | |
47.1 | 75.2 | |
53.3 | 73 | |
7.9 | 89.2 | |
15.5 | 86.4 | |
4.8 | 90.7 | |
9.2 | 89 | |
6 | 90.6 |
For the one predictor case, the lowest value of
For the two predictor case, the lowest value of
For the three predictor case, the lowest value of
For the four predictor case, the lowest value of
For the five predictor case, the value of
The value of
Thus, depending upon the factors affecting the analysis it would be most preferable to use the regression equation corresponding to the predictors
Hence, the variables for the model using the Mallows’
Adjusted
An important utility of the adjusted coefficient of multiple determination or
The adjusted coefficient of multiple determination,
For the one predictor case, the highest value of
For the two predictor case, the highest value of
For the three predictor case, the highest value of
For the four predictor case, the highest value of
For the five predictor case, the value of
The value of adjusted
Thus, provided other factors do not affect the analysis it could be most preferable to use the regression equation corresponding to the predictors,
Hence, the variables for the model using the adjusted-
Both Mallows’
Want to see more full solutions like this?
Chapter 8 Solutions
EBK STATISTICS FOR ENGINEERS AND SCIENT
Additional Math Textbook Solutions
Math in Our World
Elementary Statistics ( 3rd International Edition ) Isbn:9781260092561
APPLIED STAT.IN BUS.+ECONOMICS
Introductory Statistics
Elementary Statistics: Picturing the World (7th Edition)
Mathematics for the Trades: A Guided Approach (11th Edition) (What's New in Trade Math)
- A survey of 581 citizens found that 313 of them favor a new bill introduced by the city. We want to find a 95% confidence interval for the true proportion of the population who favor the bill. What is the lower limit of the interval? Enter the result as a decimal rounded to 3 decimal digits. Your Answer:arrow_forwardLet X be a continuous RV with PDF where a > 0 and 0 > 0 are parameters. verify that f-∞ /x (x)dx = 1. Find the CDF, Fx (7), of X.arrow_forward6. [20] Let X be a continuous RV with PDF 2(1), 1≤x≤2 fx(x) = 0, otherwisearrow_forward
- A survey of 581 citizens found that 313 of them favor a new bill introduced by the city. We want to find a 95% confidence interval for the true proportion of the population who favor the bill. What is the lower limit of the interval? Enter the result as a decimal rounded to 3 decimal digits. Your Answer:arrow_forwardA survey of 581 citizens found that 313 of them favor a new bill introduced by the city. We want to find a 95% confidence interval for the true proportion of the population who favor the bill. What is the lower limit of the interval? Enter the result as a decimal rounded to 3 decimal digits. Your Answer:arrow_forward2. The SMSA data consisting of 141 observations on 10 variables is fitted by the model below: 1 y = Bo+B1x4 + ẞ2x6 + ẞ3x8 + √1X4X8 + V2X6X8 + €. See Question 2, Tutorial 3 for the meaning of the variables in the above model. The following results are obtained: Estimate Std. Error t value Pr(>|t|) (Intercept) 1.302e+03 4.320e+02 3.015 0.00307 x4 x6 x8 x4:x8 x6:x8 -1.442e+02 2.056e+01 -7.013 1.02e-10 6.340e-01 6.099e+00 0.104 0.91737 -9.455e-02 5.802e-02 -1.630 0.10550 2.882e-02 2.589e-03 11.132 1.673e-03 7.215e-04 2.319 F) x4 1 3486722 3486722 17.9286 4.214e-05 x6 1 14595537 x8 x4:x8 x6:x8 1 132.4836 < 2.2e-16 1045693 194478 5.3769 0.02191 1 1198603043 1198603043 6163.1900 < 2.2e-16 1 25765100 25765100 1045693 Residuals 135 26254490 Estimated variance matrix (Intercept) x4 x6 x8 x4:x8 x6:x8 (Intercept) x4 x6 x8 x4:x8 x6:x8 0.18875694 1.866030e+05 -5.931735e+03 -2.322825e+03 -16.25142055 0.57188953 -5.931735e+03 4.228816e+02 3.160915e+01 0.61621781 -0.03608028 -0.00445013 -2.322825e+03…arrow_forward
- In some applications the distribution of a discrete RV, X resembles the Poisson distribution except that 0 is not a possible value of X. Consider such a RV with PMF where 1 > 0 is a parameter, and c is a constant. (a) Find the expression of c in terms of 1. (b) Find E(X). (Hint: You can use the fact that, if Y ~ Poisson(1), the E(Y) = 1.)arrow_forwardSuppose that X ~Bin(n,p). Show that E[(1 - p)] = (1-p²)".arrow_forwardI need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forward
- I need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forwardThis exercise is based on the following data on four bodybuilding supplements. (Figures shown correspond to a single serving.) Creatine(grams) L-Glutamine(grams) BCAAs(grams) Cost($) Xtend(SciVation) 0 2.5 7 1.00 Gainz(MP Hardcore) 2 3 6 1.10 Strongevity(Bill Phillips) 2.5 1 0 1.20 Muscle Physique(EAS) 2 2 0 1.00 Your personal trainer suggests that you supplement with at least 10 grams of creatine, 39 grams of L-glutamine, and 90 grams of BCAAs each week. You are thinking of combining Xtend and Gainz to provide you with the required nutrients. How many servings of each should you combine to obtain a week's supply that meets your trainer's specifications at the least cost? (If an answer does not exist, enter DNE.) servings of xtend servings of gainzarrow_forwardI need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forward
- Functions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage LearningHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin Harcourt


