Concept explainers
a.
Check whether a linear model is appropriate for the data using the
a.
Answer to Problem 48E
Output using MINITAB software is given below:
Yes, a simple linear model is appropriate for the data.
Explanation of Solution
Given info:
The data represents the values of the variables % total suspended solids removed
Justification:
Software Procedure:
Step by step procedure to obtain scatterplot using MINITAB software is given as,
- Choose Graph > Scatter plot.
- Choose Simple, and then click OK.
- Under Y variables, enter a column of % Total suspended solids removed.
- Under X variables, enter a column of Amount filtered.
- Click Ok.
Observation:
From the scatterplot it is clear that, as the values of amount filtered increases the values of % total suspended solids removed decreases linearly. Thus, there is a negative association between the variables amount filtered and % total suspended solids removed.
Appropriateness of regression linear model:
The conditions for a scatterplot that is well fitted for the data are,
- Straight Enough Condition: The relationship between y and x straight enough to proceed with a linear regression model.
- Outlier Condition: No outlier must be there which influences the fit of the least square line.
- Thickness Condition: The spread of the data around the generally straight relationship seem to be consistent for all values of x.
The scatterplot shows a fair enough linear relationship between the variables amount filtered and % total suspended solids removed. The spread of the data seem to roughly consistent.
Moreover, the scatterplot does not show any outliers.
Therefore, all the three conditions of appropriateness of simple linear model are satisfied.
Thus, a linear model is appropriate for the data.
b.
Find the regression line for the variables % total suspended solids removed
b.
Answer to Problem 48E
The regression line for the variables % total suspended solids removed
Explanation of Solution
Calculation:
Linear regression model:
A linear regression model is given as
A linear regression model is given as
In the given problem the % of total suspended solids remove is the response variable y and the amount filtered is the predictor variable x
Regression:
Software procedure:
Step by step procedure to obtain regression equation using MINITAB software is given as,
- Choose Stat > Regression > Fit Regression Line.
- In Response (Y), enter the column of Removal efficiency.
- In Predictor (X), enter the column of Inlet temperature.
- Click OK.
The output using MINITAB software is given as,
Thus, the regression line for the variables % total suspended solids removed
Interpretation:
The slope estimate implies a decrease in % total suspended solids removed by 22.0% for every 1,000 liters increase in amount filtered. It can also be said that, for every 1% increase in amount filtered the % total suspended solids removed decreases 22%.
c.
Find the proportion of observed variation in % total suspended solids removed that can be explained by amount filtered using the simple linear regression model.
c.
Answer to Problem 48E
The proportion of observed variation in % total suspended solids removed that can be explained by amount filtered using the simple linear regression model is
Explanation of Solution
Justification:
The coefficient of determination (
The general formula to obtain coefficient of variation is,
From the regression output obtained in part (b), the value of coefficient of determination is 0.701.
Thus, the coefficient of determination is
Interpretation:
From this coefficient of determination it can be said that, the amount filtered can explain only 70.1% variability in % total suspended solids removed. Then remaining variability of % total suspended solids removed is explained by other variables.
Thus, the percentage of variation in the observed values of %total suspended solids removed that is explained by the regression is 70.1%, which indicates that 70.1% of the variability in %total suspended solids removed is explained by variability in the amount filtered using the linear regression model.
d.
Test whether there is enough evidence to conclude that the predictor variable amount filtered is useful for predicting the value of the response variable %total suspended solids removed at
d.
Answer to Problem 48E
There is sufficient evidence to conclude that the predictor variable amount filtered is useful for predicting the value of the response variable %total suspended solids removed.
Explanation of Solution
Calculation:
From the MINITAB output obtained in part (b), the regression line for the variables %total suspended solids removed
The test hypotheses are given below:
Null hypothesis:
That is, there is no useful relationship between the variables %total suspended solids removed
Alternative hypothesis:
That is, there is useful relationship between the variables %total suspended solids removed
T-test statistic:
The test statistic is,
From the MINITAB output obtained in part (b), the test statistic is -4.33 and the P-value is 0.003.
Thus, the value of test statistic is -4.33 and P-value is 0.003.
Level of significance:
Here, level of significance is
Decision rule based on p-value:
If
If
Conclusion:
The P-value is 0.003 and
Here, P-value is less than the
That is
By the rejection rule, reject the null hypothesis.
Thus, there is sufficient evidence to conclude that the predictor variable amount filtered is useful for predicting the value of the response variable %total suspended solids removed.
e.
Test whether there is enough evidence to infer that the true average decrease in “%total suspended solids removed” associated with 10,000 liters increase in “amount filtered” is greater than or equal to 2 at
e.
Answer to Problem 48E
There is no sufficient evidence to infer that the true average decrease in “%total suspended solids removed” associated with 10,000 liters increase in “amount filtered” is greater than or equal to 2.
Explanation of Solution
Calculation:
Linear regression model:
A linear regression model is given as
A linear regression model is given as
From the MINITAB output in part (b), the slope coefficient of the regression equation is
Here,
Here, the claim is that, when the amount filtered is increased from 10,000 liters the true average decrease in %total suspended solids removed is greater than or equal to 2.
The claim states that, amount filtered is increased by 10,000 liters.
Decrease in the %total suspended solids removed for 1,000 liters increase in amount filtered:
The true average decrease in the %total suspended solids removed for 1,000 liters increase in amount filtered is,
That is, when the amount filtered is increased by 1,000 liters the true average decrease in %total suspended solids removed is greater than or equal to 0.2.
The test hypotheses are given below:
Null hypothesis:
That is, the true average decrease in %total suspended solids removed is greater than or equal to 0.2.
Alternative hypothesis:
That is, the true average decrease in %total suspended solids removed is less than 0.2.
Test statistic:
The test statistic is,
Degrees of freedom:
The sample size is
The degrees of freedom is,
Thus, the degree of freedom is 8.
Here, level of significance is
Critical value:
Software procedure:
Step by step procedure to obtain the critical value using the MINITAB software:
- Choose Graph > Probability Distribution Plot choose View Probability > OK.
- From Distribution, choose ‘t’ distribution and enter 8 as degrees of freedom.
- Click the Shaded Area tab.
- Choose Probability Value and Left Tail for the region of the curve to shade.
- Enter the Probability value as 0.05.
- Click OK.
Output using the MINITAB software is given below:
From the output, the critical value is –1.860.
Thus, the critical value is
From the MINITAB output obtained in part (b), the estimate of error standard deviation of slope coefficient is
Test statistic under null hypothesis:
Under the null hypothesis, the test statistic is obtained as follows:
Thus, the test statistic is -0.3931.
Decision criteria for the classical approach:
If
Conclusion:
Here, the test statistic is -0.3931 and critical value is –1.860.
The t statistic is less than the critical value.
That is,
Based on the decision rule, reject the null hypothesis.
Hence, the true average decrease in %total suspended solids removed is not greater than or equal to 0.2.
Therefore, there is no sufficient evidence to infer that the true average decrease in “%total suspended solids removed” associated with 10,000 liters increase in “amount filtered” is greater than or equal to 2.
f.
Find the 95% specified confidence interval for the true mean %total suspended solids removed when the amount filtered is 100,000 liters.
Compare the width of the confidence intervals for 100,000 liters and 200,000 liters amount filtered.
f.
Answer to Problem 48E
The 95% specified confidence interval for the true mean %total suspended solids removed when the amount filtered is 100,000 liters is
The confidence interval for 100,000 liters of amount filtered will be narrower than the interval for 200,000 liters of amount filtered.
Explanation of Solution
Calculation:
From the MINITAB output obtained in part (b), the regression line for the variables %total suspended solids removed
Here, the variable amount filtered
Hence, the value of 100,000 for amount filtered is
Expected %total suspended solids removed when the amount filtered is
The expected value of %total suspended solids removed with
Thus, the expected value of %total suspended solids removed with
Confidence interval:
The general formula for the
Where,
From the MINITAB output in part (a), the value of the standard error of the estimate is
The value of
From the give data, the sum of amount filtered is
The mean amount filtered is,
Thus, the mean amount filtered is
Covariance term
The value of
Thus, the covariance term
Critical value:
For 95% confidence level,
Degrees of freedom:
The sample size is
The degrees of freedom is,
From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 8 degrees of freedom is 2.306.
Thus, the critical value is
The 95% confidence interval is,
Thus, the 95% specified confidence interval for the true mean %total suspended solids removed when the amount filtered is 100,000 liters is
Interpretation:
There is 95% confident that, the true mean %total suspended solids removed when the amount filtered is 100,000 liters lies between 22.37244 and 38.82756.
Comparison:
For 100,000 amount filtered, the value of x is
The mean amount filtered is
Here, the observation
The general formula to obtain
For
For
In the two quantities, the only difference is the values
In general, the value of the quantity
Therefore, the value
The confidence interval will be wider for large value of
Here,
Thus, the confidence interval is wider for
g.
Find the 95% prediction interval for the single value of %total suspended solids removed when the amount filtered is 100,000 liters.
Compare the width of the prediction intervals for 100,000 liters and 200,000 liters amount filtered.
g.
Answer to Problem 48E
The 95% prediction interval for the single value of %total suspended solids removed when the amount filtered is 100,000 liters is
The prediction interval for 100,000 liters of amount filtered will be narrower than the interval for 200,000 liters of amount filtered.
Explanation of Solution
Calculation:
From the MINITAB output obtained in part (b), the regression line for the variables %total suspended solids removed
From part (c), the
Prediction interval for a single future value:
Prediction interval is used to predict a single value of the focus variable that is to be observed at some future time. In other words it can be said that the prediction interval gives a single future value rather than estimating the mean value of the variable.
The general formula for
where
From the MINITAB output in part (b), the value of the standard error of the estimate is
From part (c), the mean chlorine flow is
Critical value:
For 95% confidence level,
Degrees of freedom:
The sample size is
The degrees of freedom is,
From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 8 degrees of freedom is 2.306.
Thus, the critical value is
The 95% prediction interval is,
Thus, the 95% prediction interval for the single value of %total suspended solids removed when the amount filtered is 100,000 liters is
Interpretation:
For repeated samples, there is 95% confident that the single value of % total suspended solids removed when the amount filtered is 100,000 liters will lie between 4.950886 and 56.24911.
Comparison:
For 100,000 amount filtered, the value of x is
The mean amount filtered is
Here, the observation
The general formula to obtain
For
For
In the two quantities, the only difference is the values
In general, the value of the quantity
Therefore, the value
The prediction interval will be wider for large value of
Here,
Thus, the prediction interval is wider for
Want to see more full solutions like this?
Chapter 12 Solutions
Probability and Statistics for Engineering and the Sciences STAT 400 - University Of Maryland
- uppose automobile insurance companies gave annual premiums for top-rated companies in several states. The figure below shows box plots for the annual premium for urban customers in three states. Which state offers the lowest premium? Which state offers the highest premium?arrow_forwardWing Foot is a shoe franchise commonly found in shopping centers across the United States. Wing Foot knows that its stores will not show a profit unless they gross over $940,000 per year. Let A be the event that a new Wing Foot store grosses over $940,000 its first year. Let B be the event that a store grosses over $940,000 its second year. Wing Foot has an administrative policy of closing a new store if it does not show a profit in either of the first two years. Assume that the accounting office at Wing Foot provided the following information: 58% of all Wing Foot stores show a profit the first year; 72% of all Wing Foot store show a profit the second year (this includes stores that did not show a profit the first year); however, 86% of Wing Foot stores that showed a profit the first year also showed a profit the second year. Compute P(B|Ac). Round your answer to the nearest hundredth.arrow_forwardYou draw two cards from a standard deck of 52 cards, but before you draw the second card, you put the first one back and reshuffle the deck. If you get a3on the first card, find the probability of drawing a 3 for the second card.arrow_forward
- Do bonds reduce the overall risk of an investment portfolio? Let x be a random variable representing annual percent return for the Vanguard Total Stock Index (all Stocks). Let y be a random variable representing annual return for the Vanguard Balanced Index (60% stock and 40% bond). For the past several years, assume the following data. Compute the coefficient of variation for each fund. Round your answers to the nearest tenth. x: 14 0 37 21 35 23 24 -14 -14 -17 y: 8 -2 29 17 22 17 17 -2 -3 -8arrow_forwardWhat percentage of the general U.S. population have bachelor's degrees? Suppose that the Statistical Abstract of the United States, 120th Edition, gives the following percentage of bachelor’s degrees by state. For convenience, the data are sorted in increasing order. 17 18 18 18 19 20 20 20 21 21 21 21 21 22 22 22 22 22 23 23 24 24 24 24 24 25 25 25 25 26 26 26 26 26 26 27 27 27 28 28 28 29 29 31 31 32 32 34 35 38 Illinois has a bachelor's degree percentage rate of about 18%. Into what quartile does this rate fall?arrow_forwardWhat percentage of the general U.S. population have bachelor's degrees? Suppose that the Statistical Abstract of the United States, 120th Edition, gives the following percentage of bachelor’s degrees by state. For convenience, the data are sorted in increasing order. 17 18 18 18 19 20 20 20 21 21 21 21 21 22 22 22 22 22 23 23 24 24 24 24 24 25 25 25 25 26 26 26 26 26 26 27 27 27 28 28 28 29 29 31 31 32 32 34 35 38 Illinois has a bachelor's degree percentage rate of about 18%. Into what quartile does this rate fall?arrow_forward
- Find the range for the following sample data. x 23 17 11 30 27arrow_forwardDo bonds reduce the overall risk of an investment portfolio? Let x be a random variable representing annual percent return for the Vanguard Total Stock Index (all Stocks). Let y be a random variable representing annual return for the Vanguard Balanced Index (60% stock and 40% bond). For the past several years, assume the following data. Compute the sample mean for x and for y. Round your answer to the nearest tenth. x: 11 0 36 22 34 24 25 -11 -11 -22 y: 9 -3 28 14 23 16 14 -3 -4 -9arrow_forwardDo bonds reduce the overall risk of an investment portfolio? Let x be a random variable representing annual percent return for the Vanguard Total Stock Index (all Stocks). Let y be a random variable representing annual return for the Vanguard Balanced Index (60% stock and 40% bond). For the past several years, assume the following data. Compute the range for variable y. X 12 0 36 21 35 23 24 -12 -12 -21 Y 10 -2 26 15 22 18 15 -2 -3 -10arrow_forward
- Do bonds reduce the overall risk of an investment portfolio? Let x be a random variable representing annual percent return for the Vanguard Total Stock Index (all Stocks). Let y be a random variable representing annual return for the Vanguard Balanced Index (60% stock and 40% bond). For the past several years, assume the following data. Compute the range for variable y. X 12 0 36 21 35 23 24 -12 -12 -21 Y 10 -2 26 15 22 18 15 -2 -3 -10arrow_forwardDo bonds reduce the overall risk of an investment portfolio? Let x be a random variable representing annual percent return for the Vanguard Total Stock Index (all Stocks). Let y be a random variable representing annual return for the Vanguard Balanced Index (60% stock and 40% bond). For the past several years, assume the following data. Compute the range for variable x. X 15 0 37 23 33 25 26 -15 -15 -23 Y 6 -1 28 18 24 17 18 -1 -2 -6arrow_forward7.16. If the probability density of X is given by g kx³ for x>0 f(x) = (1+2x)6 0 10-01, elsewhere trolls inf ( 2X density of the random variable Y = where k is an appropriate constant, find the probability 1+2X distribution of Y, and thus determine the value of k. 7 Identify thearrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman