(a)
To make a
(a)
Answer to Problem 28.14AYK
The strongest
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. The scatterplot with taste on the y axis is as follows:
As we can see in the scatterplot that all the lines are almost parallel and also that the R -square of the hydrogen sulfide is largest with taste so the correlation is largest for the hydrogen sulfide. The correlation is given in the scatterplot above by finding square root, the calculation is as:
Acetic | =SQRT(0.302) |
Lactic | =SQRT(0.3055) |
H2S | =SQRT(0.5712) |
And the result is as:
Acetic | 0.549545 |
Lactic | 0.552721 |
H2S | 0.755778 |
(b)
To use a software to obtain the regression equation and run inference for a regression model that includes all three explanatory variables and interpret the software output, including the meaning of the value taken by
(b)
Answer to Problem 28.14AYK
The equation is
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. Now, run inference for a regression model that includes all three explanatory variables and interpret the software output by using the Excel, the result will be as:
Regression Statistics | |
Multiple R | 0.800438 |
R Square | 0.640701 |
Adjusted R Square | 0.599243 |
Standard Error | 10.29053 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 3 | 4909.619 | 1636.54 | 15.45438 | 5.68E-06 |
Residual | 26 | 2753.268 | 105.8949 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -32.8566 | 20.2335 | -1.62387 | 0.116466 |
Acetic | 2.000654 | 4.346475 | 0.460294 | 0.649132 |
H2S | 4.566348 | 1.176917 | 3.879925 | 0.000639 |
Lactic | 13.67117 | 6.643259 | 2.057902 | 0.049755 |
And the equation is as:
And
(c)
To explain which explanatory variable does it describe and create a new regression model that excludes this explanatory variable and interpret the software output and compare it with your findings in (b).
(c)
Answer to Problem 28.14AYK
That explanatory variable is Acetic.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. In the above result in part (b), we can see that the explanatory variable Acetic has a P-value greater than the level of significance so it is not significant. Thus, we will remove this variable and run this test with the other two variables using Excel and the result will be as:
Regression Statistics | |
Multiple R | 0.798607 |
R Square | 0.637773 |
Adjusted R Square | 0.610941 |
Standard Error | 10.13922 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 2 | 4887.183 | 2443.592 | 23.76946 | 1.11E-06 |
Residual | 27 | 2775.704 | 102.8038 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -24.4609 | 8.629104 | -2.8347 | 0.008581 |
H2S | 4.858662 | 0.976305 | 4.976581 | 3.24E-05 |
Lactic | 14.28672 | 6.411593 | 2.228263 | 0.034385 |
In this all the explanatory variables are statistically significant but in the above model in (b) all are not statistically significant but the variations explained are approximately equal.
(d)
To explain which explanatory variable of the two has the less significant or larger value and create a new regression model that excludes this explanatory variable and keeps only significant one and explain how does this last model compare with the model in (c).
(d)
Answer to Problem 28.14AYK
The explanatory variable of the two has the less significant or larger value is lactic.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. In the above result in part (c), we can see that the P-value for the Lactic is larger than the hydrogen sulfide thus, we will remove the Lactic variable and then run the
Regression Statistics | |
Multiple R | 0.755752 |
R Square | 0.571162 |
Adjusted R Square | 0.555846 |
Standard Error | 10.83338 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 1 | 4376.746 | 4376.746 | 37.29265 | 1.37E-06 |
Residual | 28 | 3286.141 | 117.3622 | ||
Total | 29 | 7662.887 |
Coefficients | Standard Error | t Stat | P-value | |
Intercept | -9.78684 | 5.95791 | -1.64266 | 0.111638 |
H2S | 5.776089 | 0.94585 | 6.10677 | 1.37E-06 |
In this as we compare it with the model in part (c), we can see that the coefficient of determination or the variations explained are less in this model then in part (c) and all the slopes are statistically significant.
(e)
To explain which model best explains cheddar taste and check the conditions for inference for this model and conclude.
(e)
Answer to Problem 28.14AYK
Model (b) best explains cheddar taste and conditions are met.
Explanation of Solution
In the question, it is given that experimenters assessed the concentration of lactic acid, acetic acid and hydrogen sulfide in thirty randomly chosen pieces of cheddar cheese. The table is given which shows the data. By looking at the model (b), (c) and (d), we can say that the variations explained is more in part (b) than in (c) and (d). Thus, the model in (b) best explains cheddar taste. The conditions for inferences are as: as we can see in the scatterplot, it shows the linearity and as we look at the data it shows the normality and constant variance by looking at the model regression analysis using Excel’s residual plot and the data is randomly selected so it shows independence. Thus, the conditions are met.
Want to see more full solutions like this?
Chapter 28 Solutions
Practice of Statistics in the Life Sciences
- A recent survey of 400 americans asked whether or not parents do too much for their young adult children. The results of the survey are shown in the data file. a) Construct the frequency and relative frequency distributions. How many respondents felt that parents do too much for their adult children? What proportion of respondents felt that parents do too little for their adult children? b) Construct a pie chart. Summarize the findingsarrow_forwardThe average number of minutes Americans commute to work is 27.7 minutes (Sterling's Best Places, April 13, 2012). The average commute time in minutes for 48 cities are as follows: Click on the datafile logo to reference the data. DATA file Albuquerque 23.3 Jacksonville 26.2 Phoenix 28.3 Atlanta 28.3 Kansas City 23.4 Pittsburgh 25.0 Austin 24.6 Las Vegas 28.4 Portland 26.4 Baltimore 32.1 Little Rock 20.1 Providence 23.6 Boston 31.7 Los Angeles 32.2 Richmond 23.4 Charlotte 25.8 Louisville 21.4 Sacramento 25.8 Chicago 38.1 Memphis 23.8 Salt Lake City 20.2 Cincinnati 24.9 Miami 30.7 San Antonio 26.1 Cleveland 26.8 Milwaukee 24.8 San Diego 24.8 Columbus 23.4 Minneapolis 23.6 San Francisco 32.6 Dallas 28.5 Nashville 25.3 San Jose 28.5 Denver 28.1 New Orleans 31.7 Seattle 27.3 Detroit 29.3 New York 43.8 St. Louis 26.8 El Paso 24.4 Oklahoma City 22.0 Tucson 24.0 Fresno 23.0 Orlando 27.1 Tulsa 20.1 Indianapolis 24.8 Philadelphia 34.2 Washington, D.C. 32.8 a. What is the mean commute time for…arrow_forwardMorningstar tracks the total return for a large number of mutual funds. The following table shows the total return and the number of funds for four categories of mutual funds. Click on the datafile logo to reference the data. DATA file Type of Fund Domestic Equity Number of Funds Total Return (%) 9191 4.65 International Equity 2621 18.15 Hybrid 1419 2900 11.36 6.75 Specialty Stock a. Using the number of funds as weights, compute the weighted average total return for these mutual funds. (to 2 decimals) % b. Is there any difficulty associated with using the "number of funds" as the weights in computing the weighted average total return in part (a)? Discuss. What else might be used for weights? The input in the box below will not be graded, but may be reviewed and considered by your instructor. c. Suppose you invested $10,000 in this group of mutual funds and diversified the investment by placing $2000 in Domestic Equity funds, $4000 in International Equity funds, $3000 in Specialty Stock…arrow_forward
- The days to maturity for a sample of five money market funds are shown here. The dollar amounts invested in the funds are provided. Days to Maturity 20 Dollar Value ($ millions) 20 12 30 7 10 5 6 15 10 Use the weighted mean to determine the mean number of days to maturity for dollars invested in these five money market funds (to 1 decimal). daysarrow_forwardc. What are the first and third quartiles? First Quartiles (to 1 decimals) Third Quartiles (to 4 decimals) × ☑ Which companies spend the most money on advertising? Business Insider maintains a list of the top-spending companies. In 2014, Procter & Gamble spent more than any other company, a whopping $5 billion. In second place was Comcast, which spent $3.08 billion (Business Insider website, December 2014). The top 12 companies and the amount each spent on advertising in billions of dollars are as follows. Click on the datafile logo to reference the data. DATA file Company Procter & Gamble Comcast Advertising ($billions) $5.00 3.08 2.91 Company American Express General Motors Advertising ($billions) $2.19 2.15 ETET AT&T Ford Verizon L'Oreal 2.56 2.44 2.34 Toyota Fiat Chrysler Walt Disney Company J.P Morgan a. What is the mean amount spent on advertising? (to 2 decimals) 2.55 b. What is the median amount spent on advertising? (to 3 decimals) 2.09 1.97 1.96 1.88arrow_forwardMartinez Auto Supplies has retail stores located in eight cities in California. The price they charge for a particular product in each city are vary because of differing competitive conditions. For instance, the price they charge for a case of a popular brand of motor oil in each city follows. Also shown are the number of cases that Martinez Auto sold last quarter in each city. City Price ($) Sales (cases) Bakersfield 34.99 501 Los Angeles 38.99 1425 Modesto 36.00 294 Oakland 33.59 882 Sacramento 40.99 715 San Diego 38.59 1088 San Francisco 39.59 1644 San Jose 37.99 819 Compute the average sales price per case for this product during the last quarter? Round your answer to two decimal places.arrow_forward
- Consider the following data and corresponding weights. xi Weight(wi) 3.2 6 2.0 3 2.5 2 5.0 8 a. Compute the weighted mean (to 2 decimals). b. Compute the sample mean of the four data values without weighting. Note the difference in the results provided by the two computations (to 3 decimals).arrow_forwardExpert only,if you don't know it don't attempt it, no Artificial intelligence or screen shot it solvingarrow_forwardFor context, the image provided below is a quesion from a Sepetember, 2024 past paper in statistical modelingarrow_forward
- For context, the images attached below (the question and the related figure) is from a january 2024 past paperarrow_forwardFor context, the image attached below is a question from a June 2024 past paper in statisical modelingarrow_forwardFor context, the images attached below are a question from a June, 2024 past paper in statistical modelingarrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman