(a)
To make a
(a)

Answer to Problem 28.14AYK
The strongest
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. The scatterplot with taste on the y axis is as follows:
As we can see in the scatterplot that all the lines are almost parallel and also that the R -square of the hydrogen sulfide is largest with taste so the correlation is largest for the hydrogen sulfide. The correlation is given in the scatterplot above by finding square root, the calculation is as:
Acetic | =SQRT(0.302) |
Lactic | =SQRT(0.3055) |
H2S | =SQRT(0.5712) |
And the result is as:
Acetic | 0.549545 |
Lactic | 0.552721 |
H2S | 0.755778 |
(b)
To use a software to obtain the regression equation and run inference for a regression model that includes all three explanatory variables and interpret the software output, including the meaning of the value taken by
(b)

Answer to Problem 28.14AYK
The equation is
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. Now, run inference for a regression model that includes all three explanatory variables and interpret the software output by using the Excel, the result will be as:
Regression Statistics | |
Multiple R | 0.800438 |
R Square | 0.640701 |
Adjusted R Square | 0.599243 |
Standard Error | 10.29053 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 3 | 4909.619 | 1636.54 | 15.45438 | 5.68E-06 |
Residual | 26 | 2753.268 | 105.8949 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -32.8566 | 20.2335 | -1.62387 | 0.116466 |
Acetic | 2.000654 | 4.346475 | 0.460294 | 0.649132 |
H2S | 4.566348 | 1.176917 | 3.879925 | 0.000639 |
Lactic | 13.67117 | 6.643259 | 2.057902 | 0.049755 |
And the equation is as:
And
(c)
To explain which explanatory variable does it describe and create a new regression model that excludes this explanatory variable and interpret the software output and compare it with your findings in (b).
(c)

Answer to Problem 28.14AYK
That explanatory variable is Acetic.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. In the above result in part (b), we can see that the explanatory variable Acetic has a P-value greater than the level of significance so it is not significant. Thus, we will remove this variable and run this test with the other two variables using Excel and the result will be as:
Regression Statistics | |
Multiple R | 0.798607 |
R Square | 0.637773 |
Adjusted R Square | 0.610941 |
Standard Error | 10.13922 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 2 | 4887.183 | 2443.592 | 23.76946 | 1.11E-06 |
Residual | 27 | 2775.704 | 102.8038 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -24.4609 | 8.629104 | -2.8347 | 0.008581 |
H2S | 4.858662 | 0.976305 | 4.976581 | 3.24E-05 |
Lactic | 14.28672 | 6.411593 | 2.228263 | 0.034385 |
In this all the explanatory variables are statistically significant but in the above model in (b) all are not statistically significant but the variations explained are approximately equal.
(d)
To explain which explanatory variable of the two has the less significant or larger value and create a new regression model that excludes this explanatory variable and keeps only significant one and explain how does this last model compare with the model in (c).
(d)

Answer to Problem 28.14AYK
The explanatory variable of the two has the less significant or larger value is lactic.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. In the above result in part (c), we can see that the P-value for the Lactic is larger than the hydrogen sulfide thus, we will remove the Lactic variable and then run the
Regression Statistics | |
Multiple R | 0.755752 |
R Square | 0.571162 |
Adjusted R Square | 0.555846 |
Standard Error | 10.83338 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 1 | 4376.746 | 4376.746 | 37.29265 | 1.37E-06 |
Residual | 28 | 3286.141 | 117.3622 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -9.78684 | 5.95791 | -1.64266 | 0.111638 |
H2S | 5.776089 | 0.94585 | 6.10677 | 1.37E-06 |
In this as we compare it with the model in part (c), we can see that the coefficient of determination or the variations explained are less in this model then in part (c) and all the slopes are statistically significant.
(e)
To explain which model best explains cheddar taste and check the conditions for inference for this model and conclude.
(e)

Answer to Problem 28.14AYK
Model (b) best explains cheddar taste and conditions are met.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. By looking at the model (b), (c) and (d), we can say that the variations explained is more in part (b) than in (c) and (d). Thus, the model in (b) best explains cheddar taste. The conditions for inferences are as: as we can see in the scatterplot, it shows the linearity and as we look at the data it shows the normality and constant variance by looking at the model regression analysis using Excel’s residual plot and the data is randomly selected so it shows independence. Thus, the conditions are met.
Want to see more full solutions like this?
Chapter 28 Solutions
PRACT STAT W/ ACCESS 6MO LOOSELEAF
- Please help me answer the following questions from this problem.arrow_forwardPlease help me find the sample variance for this question.arrow_forwardCrumbs Cookies was interested in seeing if there was an association between cookie flavor and whether or not there was frosting. Given are the results of the last week's orders. Frosting No Frosting Total Sugar Cookie 50 Red Velvet 66 136 Chocolate Chip 58 Total 220 400 Which category has the greatest joint frequency? Chocolate chip cookies with frosting Sugar cookies with no frosting Chocolate chip cookies Cookies with frostingarrow_forward
- The table given shows the length, in feet, of dolphins at an aquarium. 7 15 10 18 18 15 9 22 Are there any outliers in the data? There is an outlier at 22 feet. There is an outlier at 7 feet. There are outliers at 7 and 22 feet. There are no outliers.arrow_forwardStart by summarizing the key events in a clear and persuasive manner on the article Endrikat, J., Guenther, T. W., & Titus, R. (2020). Consequences of Strategic Performance Measurement Systems: A Meta-Analytic Review. Journal of Management Accounting Research?arrow_forwardThe table below was compiled for a middle school from the 2003 English/Language Arts PACT exam. Grade 6 7 8 Below Basic 60 62 76 Basic 87 134 140 Proficient 87 102 100 Advanced 42 24 21 Partition the likelihood ratio test statistic into 6 independent 1 df components. What conclusions can you draw from these components?arrow_forward
- What is the value of the maximum likelihood estimate, θ, of θ based on these data? Justify your answer. What does the value of θ suggest about the value of θ for this biased die compared with the value of θ associated with a fair, unbiased, die?arrow_forwardShow that L′(θ) = Cθ394(1 −2θ)604(395 −2000θ).arrow_forwarda) Let X and Y be independent random variables both with the same mean µ=0. Define a new random variable W = aX +bY, where a and b are constants. (i) Obtain an expression for E(W).arrow_forward
- The table below shows the estimated effects for a logistic regression model with squamous cell esophageal cancer (Y = 1, yes; Y = 0, no) as the response. Smoking status (S) equals 1 for at least one pack per day and 0 otherwise, alcohol consumption (A) equals the average number of alcohoic drinks consumed per day, and race (R) equals 1 for blacks and 0 for whites. Variable Effect (β) P-value Intercept -7.00 <0.01 Alcohol use 0.10 0.03 Smoking 1.20 <0.01 Race 0.30 0.02 Race × smoking 0.20 0.04 Write-out the prediction equation (i.e., the logistic regression model) when R = 0 and again when R = 1. Find the fitted Y S conditional odds ratio in each case. Next, write-out the logistic regression model when S = 0 and again when S = 1. Find the fitted Y R conditional odds ratio in each case.arrow_forwardThe chi-squared goodness-of-fit test can be used to test if data comes from a specific continuous distribution by binning the data to make it categorical. Using the OpenIntro Statistics county_complete dataset, test the hypothesis that the persons_per_household 2019 values come from a normal distribution with mean and standard deviation equal to that variable's mean and standard deviation. Use signficance level a = 0.01. In your solution you should 1. Formulate the hypotheses 2. Fill in this table Range (-⁰⁰, 2.34] (2.34, 2.81] (2.81, 3.27] (3.27,00) Observed 802 Expected 854.2 The first row has been filled in. That should give you a hint for how to calculate the expected frequencies. Remember that the expected frequencies are calculated under the assumption that the null hypothesis is true. FYI, the bounderies for each range were obtained using JASP's drag-and-drop cut function with 8 levels. Then some of the groups were merged. 3. Check any conditions required by the chi-squared…arrow_forwardSuppose that you want to estimate the mean monthly gross income of all households in your local community. You decide to estimate this population parameter by calling 150 randomly selected residents and asking each individual to report the household’s monthly income. Assume that you use the local phone directory as the frame in selecting the households to be included in your sample. What are some possible sources of error that might arise in your effort to estimate the population mean?arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





