
(a)
To make a
(a)

Answer to Problem 28.14AYK
The strongest
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. The scatterplot with taste on the y axis is as follows:
As we can see in the scatterplot that all the lines are almost parallel and also that the R -square of the hydrogen sulfide is largest with taste so the correlation is largest for the hydrogen sulfide. The correlation is given in the scatterplot above by finding square root, the calculation is as:
Acetic | =SQRT(0.302) |
Lactic | =SQRT(0.3055) |
H2S | =SQRT(0.5712) |
And the result is as:
Acetic | 0.549545 |
Lactic | 0.552721 |
H2S | 0.755778 |
(b)
To use a software to obtain the regression equation and run inference for a regression model that includes all three explanatory variables and interpret the software output, including the meaning of the value taken by
(b)

Answer to Problem 28.14AYK
The equation is
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. Now, run inference for a regression model that includes all three explanatory variables and interpret the software output by using the Excel, the result will be as:
Regression Statistics | |
Multiple R | 0.800438 |
R Square | 0.640701 |
Adjusted R Square | 0.599243 |
Standard Error | 10.29053 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 3 | 4909.619 | 1636.54 | 15.45438 | 5.68E-06 |
Residual | 26 | 2753.268 | 105.8949 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -32.8566 | 20.2335 | -1.62387 | 0.116466 |
Acetic | 2.000654 | 4.346475 | 0.460294 | 0.649132 |
H2S | 4.566348 | 1.176917 | 3.879925 | 0.000639 |
Lactic | 13.67117 | 6.643259 | 2.057902 | 0.049755 |
And the equation is as:
And
(c)
To explain which explanatory variable does it describe and create a new regression model that excludes this explanatory variable and interpret the software output and compare it with your findings in (b).
(c)

Answer to Problem 28.14AYK
That explanatory variable is Acetic.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. In the above result in part (b), we can see that the explanatory variable Acetic has a P-value greater than the level of significance so it is not significant. Thus, we will remove this variable and run this test with the other two variables using Excel and the result will be as:
Regression Statistics | |
Multiple R | 0.798607 |
R Square | 0.637773 |
Adjusted R Square | 0.610941 |
Standard Error | 10.13922 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 2 | 4887.183 | 2443.592 | 23.76946 | 1.11E-06 |
Residual | 27 | 2775.704 | 102.8038 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -24.4609 | 8.629104 | -2.8347 | 0.008581 |
H2S | 4.858662 | 0.976305 | 4.976581 | 3.24E-05 |
Lactic | 14.28672 | 6.411593 | 2.228263 | 0.034385 |
In this all the explanatory variables are statistically significant but in the above model in (b) all are not statistically significant but the variations explained are approximately equal.
(d)
To explain which explanatory variable of the two has the less significant or larger value and create a new regression model that excludes this explanatory variable and keeps only significant one and explain how does this last model compare with the model in (c).
(d)

Answer to Problem 28.14AYK
The explanatory variable of the two has the less significant or larger value is lactic.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. In the above result in part (c), we can see that the P-value for the Lactic is larger than the hydrogen sulfide thus, we will remove the Lactic variable and then run the
Regression Statistics | |
Multiple R | 0.755752 |
R Square | 0.571162 |
Adjusted R Square | 0.555846 |
Standard Error | 10.83338 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 1 | 4376.746 | 4376.746 | 37.29265 | 1.37E-06 |
Residual | 28 | 3286.141 | 117.3622 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -9.78684 | 5.95791 | -1.64266 | 0.111638 |
H2S | 5.776089 | 0.94585 | 6.10677 | 1.37E-06 |
In this as we compare it with the model in part (c), we can see that the coefficient of determination or the variations explained are less in this model then in part (c) and all the slopes are statistically significant.
(e)
To explain which model best explains cheddar taste and check the conditions for inference for this model and conclude.
(e)

Answer to Problem 28.14AYK
Model (b) best explains cheddar taste and conditions are met.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. By looking at the model (b), (c) and (d), we can say that the variations explained is more in part (b) than in (c) and (d). Thus, the model in (b) best explains cheddar taste. The conditions for inferences are as: as we can see in the scatterplot, it shows the linearity and as we look at the data it shows the normality and constant variance by looking at the model regression analysis using Excel’s residual plot and the data is randomly selected so it shows independence. Thus, the conditions are met.
Want to see more full solutions like this?
Chapter 28 Solutions
EBK PRACTICE OF STATISTICS IN THE LIFE
- Let us suppose we have some article reported on a study of potential sources of injury to equine veterinarians conducted at a university veterinary hospital. Forces on the hand were measured for several common activities that veterinarians engage in when examining or treating horses. We will consider the forces on the hands for two tasks, lifting and using ultrasound. Assume that both sample sizes are 6, the sample mean force for lifting was 6.2 pounds with standard deviation 1.5 pounds, and the sample mean force for using ultrasound was 6.4 pounds with standard deviation 0.3 pounds. Assume that the standard deviations are known. Suppose that you wanted to detect a true difference in mean force of 0.25 pounds on the hands for these two activities. Under the null hypothesis, 40 = 0. What level of type II error would you recommend here? Round your answer to four decimal places (e.g. 98.7654). Use a = 0.05. β = i What sample size would be required? Assume the sample sizes are to be equal.…arrow_forward= Consider the hypothesis test Ho: μ₁ = μ₂ against H₁ μ₁ μ2. Suppose that sample sizes are n₁ = 15 and n₂ = 15, that x1 = 4.7 and X2 = 7.8 and that s² = 4 and s² = 6.26. Assume that o and that the data are drawn from normal distributions. Use απ 0.05. (a) Test the hypothesis and find the P-value. (b) What is the power of the test in part (a) for a true difference in means of 3? (c) Assuming equal sample sizes, what sample size should be used to obtain ẞ = 0.05 if the true difference in means is - 2? Assume that α = 0.05. (a) The null hypothesis is 98.7654). rejected. The P-value is 0.0008 (b) The power is 0.94 . Round your answer to four decimal places (e.g. Round your answer to two decimal places (e.g. 98.76). (c) n₁ = n2 = 1 . Round your answer to the nearest integer.arrow_forwardConsider the hypothesis test Ho: = 622 against H₁: 6 > 62. Suppose that the sample sizes are n₁ = 20 and n₂ = 8, and that = 4.5; s=2.3. Use a = 0.01. (a) Test the hypothesis. Round your answers to two decimal places (e.g. 98.76). The test statistic is fo = i The critical value is f = Conclusion: i the null hypothesis at a = 0.01. (b) Construct the confidence interval on 02/022 which can be used to test the hypothesis: (Round your answer to two decimal places (e.g. 98.76).) iarrow_forward
- 2011 listing by carmax of the ages and prices of various corollas in a ceratin regionarrow_forwardس 11/ أ . اذا كانت 1 + x) = 2 x 3 + 2 x 2 + x) هي متعددة حدود محسوبة باستخدام طريقة الفروقات المنتهية (finite differences) من جدول البيانات التالي للدالة (f(x . احسب قيمة . ( 2 درجة ) xi k=0 k=1 k=2 k=3 0 3 1 2 2 2 3 αarrow_forward1. Differentiate between discrete and continuous random variables, providing examples for each type. 2. Consider a discrete random variable representing the number of patients visiting a clinic each day. The probabilities for the number of visits are as follows: 0 visits: P(0) = 0.2 1 visit: P(1) = 0.3 2 visits: P(2) = 0.5 Using this information, calculate the expected value (mean) of the number of patient visits per day. Show all your workings clearly. Rubric to follow Definition of Random variables ( clearly and accurately differentiate between discrete and continuous random variables with appropriate examples for each) Identification of discrete random variable (correctly identifies "number of patient visits" as a discrete random variable and explains reasoning clearly.) Calculation of probabilities (uses the probabilities correctly in the calculation, showing all steps clearly and logically) Expected value calculation (calculate the expected value (mean)…arrow_forward
- if the b coloumn of a z table disappeared what would be used to determine b column probabilitiesarrow_forwardConstruct a model of population flow between metropolitan and nonmetropolitan areas of a given country, given that their respective populations in 2015 were 263 million and 45 million. The probabilities are given by the following matrix. (from) (to) metro nonmetro 0.99 0.02 metro 0.01 0.98 nonmetro Predict the population distributions of metropolitan and nonmetropolitan areas for the years 2016 through 2020 (in millions, to four decimal places). (Let x, through x5 represent the years 2016 through 2020, respectively.) x₁ = x2 X3 261.27 46.73 11 259.59 48.41 11 257.96 50.04 11 256.39 51.61 11 tarrow_forwardIf the average price of a new one family home is $246,300 with a standard deviation of $15,000 find the minimum and maximum prices of the houses that a contractor will build to satisfy 88% of the market valuearrow_forward
- 21. ANALYSIS OF LAST DIGITS Heights of statistics students were obtained by the author as part of an experiment conducted for class. The last digits of those heights are listed below. Construct a frequency distribution with 10 classes. Based on the distribution, do the heights appear to be reported or actually measured? Does there appear to be a gap in the frequencies and, if so, how might that gap be explained? What do you know about the accuracy of the results? 3 4 555 0 0 0 0 0 0 0 0 0 1 1 23 3 5 5 5 5 5 5 5 5 5 5 5 5 6 6 8 8 8 9arrow_forwardA side view of a recycling bin lid is diagramed below where two panels come together at a right angle. 45 in 24 in Width? — Given this information, how wide is the recycling bin in inches?arrow_forward1 No. 2 3 4 Binomial Prob. X n P Answer 5 6 4 7 8 9 10 12345678 8 3 4 2 2552 10 0.7 0.233 0.3 0.132 7 0.6 0.290 20 0.02 0.053 150 1000 0.15 0.035 8 7 10 0.7 0.383 11 9 3 5 0.3 0.132 12 10 4 7 0.6 0.290 13 Poisson Probability 14 X lambda Answer 18 4 19 20 21 22 23 9 15 16 17 3 1234567829 3 2 0.180 2 1.5 0.251 12 10 0.095 5 3 0.101 7 4 0.060 3 2 0.180 2 1.5 0.251 24 10 12 10 0.095arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





