Introduction to Statistics and Data Analysis
5th Edition
ISBN: 9781305115347
Author: Roxy Peck; Chris Olsen; Jay L. Devore
Publisher: Brooks Cole
expand_more
expand_more
format_list_bulleted
Concept explainers
Question
Chapter 14.3, Problem 45E
a.
To determine
Test whether the chosen model is useful or not at 0.05 level of significance.
b.
To determine
Check whether the interaction term is important or not.
c.
To determine
Explain whether the model utility test indicates a useful model.
Expert Solution & Answer
Want to see the full answer?
Check out a sample textbook solutionStudents have asked these similar questions
A researcher records age in years (x) and systolic blood pressure (y) for volunteers. They perform a
regression analysis was performed, and a portion of the computer output is as follows:
ŷ
= 4.3 14.9x
Coefficients
(Intercept)
X
Estimate St
4.3
Ho: B₁ = 0
Ha: B₁ > 0
B1
O Ho: B₁
Ha: B₁ <0
= 0
14.9
B1
O Ho: B₁ = 0
0
Ha: B1
Std. Error Test statistic P-value
2.9
5.1
1.48
Specify the null and the alternative hypotheses that you would use in order to test whether a negative
linear relationship exists between x and y.
2.92
0.08
0.01
A researcher records age in years (x) and systolic blood pressure (y) for volunteers. They perform a
regression analysis was performed, and a portion of the computer output is as follows:
ŷ = 4.5+ 14.4x
Coefficients
(Intercept)
x
Estimate
4.5
Ho: B₁ = 0
H₁: B₁ > 0
Ho: B₁ = 0
Ha: B₁ <0
14.4
Ho: B₁ = 0
Ha:
B₁ #0
Std. Error Test statistic
2.9
4.7
1.55
3.06
P-value
Specify the null and the alternative hypotheses that you would use in order to test whether a linear
relationship exists between x and y.
0.07
0
Retail price data for n = 60 hard disk drives were recently reported in a computer magazine. Three variables were recorded for each hard disk drive:
y = Retail PRICE (measured in dollars)
X1 = Microprocessor SPEED (measured in megahertz)
(Values in sample range from 10 to 40)
x 2 = CHIP size (measured in computer processing units)
(Values in sample range from 286 to 486)
A first-order regression model. was fit to the data. Part of the printout follows:
Parameter Estimates
T FOR 0
ERROR PARAMETER = 0 PROB>ITI
PARAMETER STANDARD
VARIABLE DF
ESTIMATE
INTERCEPT 1
-373.526392
1258.1243396 -0.297
0.7676
SPEED
1
104.838940
22.36298195 4 688
0.0001
сHP
1
3.571850
3.89422935
0.917
0.3629
Identify and interpret the estimate of B2-
Chapter 14 Solutions
Introduction to Statistics and Data Analysis
Ch. 14.1 - Prob. 1ECh. 14.1 - The authors of the paper Weight-Bearing Activity...Ch. 14.1 - Prob. 3ECh. 14.1 - Prob. 4ECh. 14.1 - Prob. 5ECh. 14.1 - Prob. 6ECh. 14.1 - Prob. 7ECh. 14.1 - Prob. 8ECh. 14.1 - Prob. 9ECh. 14.1 - The relationship between yield of maize (a type of...
Ch. 14.1 - Prob. 11ECh. 14.1 - A manufacturer of wood stoves collected data on y...Ch. 14.1 - Prob. 13ECh. 14.1 - Prob. 14ECh. 14.1 - Prob. 15ECh. 14.2 - Prob. 16ECh. 14.2 - State as much information as you can about the...Ch. 14.2 - Prob. 18ECh. 14.2 - Prob. 19ECh. 14.2 - Prob. 20ECh. 14.2 - The ability of ecologists to identify regions of...Ch. 14.2 - Prob. 22ECh. 14.2 - Prob. 23ECh. 14.2 - Prob. 24ECh. 14.2 - Prob. 25ECh. 14.2 - Prob. 26ECh. 14.2 - This exercise requires the use of a statistical...Ch. 14.2 - Prob. 28ECh. 14.2 - The article The Undrained Strength of Some Thawed...Ch. 14.2 - Prob. 30ECh. 14.2 - Prob. 31ECh. 14.2 - Prob. 32ECh. 14.2 - Prob. 33ECh. 14.2 - This exercise requires the use of a statistical...Ch. 14.2 - This exercise requires the use of a statistical...Ch. 14.3 - Prob. 36ECh. 14.3 - Prob. 37ECh. 14.3 - Prob. 38ECh. 14.3 - Prob. 39ECh. 14.3 - The article first introduced in Exercise 14.28 of...Ch. 14.3 - Data from a random sample of 107 students taking a...Ch. 14.3 - Benevolence payments are monies collected by a...Ch. 14.3 - Prob. 43ECh. 14.3 - Prob. 44ECh. 14.3 - Prob. 45ECh. 14.3 - Prob. 46ECh. 14.3 - Exercise 14.26 gave data on fish weight, length,...Ch. 14.3 - Prob. 48ECh. 14.3 - Prob. 49ECh. 14.3 - Prob. 50ECh. 14.4 - Prob. 51ECh. 14.4 - Prob. 52ECh. 14.4 - The article The Analysis and Selection of...Ch. 14.4 - Prob. 54ECh. 14.4 - Prob. 55ECh. 14.4 - Prob. 57ECh. 14.4 - Prob. 58ECh. 14.4 - Prob. 59ECh. 14.4 - Prob. 60ECh. 14.4 - This exercise requires use of a statistical...Ch. 14.4 - Prob. 62ECh. 14 - Prob. 63CRCh. 14 - Prob. 64CRCh. 14 - The accompanying data on y = Glucose concentration...Ch. 14 - Much interest in management circles has focused on...Ch. 14 - Prob. 67CRCh. 14 - Prob. 68CRCh. 14 - Prob. 69CRCh. 14 - A study of pregnant grey seals resulted in n = 25...Ch. 14 - Prob. 71CRCh. 14 - Prob. 72CRCh. 14 - This exercise requires the use of a statistical...
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.Similar questions
- Olympic Pole Vault The graph in Figure 7 indicates that in recent years the winning Olympic men’s pole vault height has fallen below the value predicted by the regression line in Example 2. This might have occurred because when the pole vault was a new event there was much room for improvement in vaulters’ performances, whereas now even the best training can produce only incremental advances. Let’s see whether concentrating on more recent results gives a better predictor of future records. (a) Use the data in Table 2 (page 176) to complete the table of winning pole vault heights shown in the margin. (Note that we are using x=0 to correspond to the year 1972, where this restricted data set begins.) (b) Find the regression line for the data in part ‚(a). (c) Plot the data and the regression line on the same axes. Does the regression line seem to provide a good model for the data? (d) What does the regression line predict as the winning pole vault height for the 2012 Olympics? Compare this predicted value to the actual 2012 winning height of 5.97 m, as described on page 177. Has this new regression line provided a better prediction than the line in Example 2?arrow_forwardA b carrow_forwardThe flow rate in a device used for air quality measurement depends on the pressure drop x (inches of water) across the device's filter. Suppose that for x values between 5 and 20, these two variables are related according to the simple linear regression model with true regression line y = -0.11 + 0.097x. (a.1) What is the true average flow rate for a pressure drop of 10 in.?(a.2) A drop of 15 in.?(b) What is the true average change in flow rate associated with a 1 inch increase in pressure drop?(c) What is the average change in flow rate when pressure drop decreases by 5 in.?arrow_forward
- A. Do these data provide sufficient evidence that there is a positive linear relationship between the two variables? B. What does R^2 imply? C. Using the regression model, predict the blood pressure level associated with a sound pressure of 7.5 decibels.arrow_forwardThe authors of the paper "Predicting Yolk Height, Yolk Width, Albumen Length, Eggshell Weight, Egg Shape Index, Eggshell Thickness, Egg Surface Area of Japanese Quails Using Various Egg Traits as Regressors"t used a multiple regression model with two independent variables where y = quail egg weight (g), X, = egg width (mm), and X2 = egg length (mm). The regression function suggested in the paper is -21.658 + 0.828x, 0.373x2. + (a) What is the mean egg weight for quail eggs that have a width of 20 mm and a length of 48 mm? (Enter your answer to three decimal places.) (b) Interpret the value of B,. O When width is fixed, the mean increase in weight associated with a 1-mm increase in length is 0.373 g. When length is fixed, the mean increase in weight associated with a 1-mm increase in width is 0.373 g. O When length is fixed, the mean increase in weight associated with a 1-mm increase in width is 0.828 g. O When width is fixed, the mean increase in weight associated with a 1-mm increase…arrow_forwardThe accompanying Minitab regression output is based on data that appeared in the article "Application of Design of Experiments for Modeling Surface Roughness in Ultrasonic Vibration Turning."+ The response variable is surface roughness (um), and the independent variables are vibration amplitude (um), depth of cut (mm), feed rate (mm/rev), and cutting speed (m/min), respectively. The regression equation is Ra = -0.972 - 0.0312a + 0.557d + 18.3f + 0.00282v Predictor Coef SE Coef Constant -0.9723 0.3923 -2.48 0.015 -0.03117 0.01864 -1.67 0.099 d 0.5568 0.3185 1.75 0.084 18.2602 0.7536 24.23 0.000 0.002822 0.003977 0.71 0.480 S = 0.822059 Source R-Sq = 88.6 R-Sq (adj) = 88.04 DS MS 0.000 Regression Residual Error 4 401.02 100.25 148.35 76 51.36 0.68 Total 80 452.38 (a) How many observations were there in the data set? observations (b) Interpret the coefficient of multiple determination. O 8.0% of the observed variation in feed rate can be explained by the model relationship with vibration…arrow_forward
- The quality of the orange juice produced by a certain manufacturer is constantly monitored. Data collected on the sweetness index of an orange juice sample and amount of water-soluble pectin for 24 production runs at a juice manufacturing plant are shown in the accompanying table. Suppose a manufacturer wants to use simple linear regression to predict the sweetness (y) from the amount of pectin (x). Find and interpret the coefficient of determination, r2, and the coefficient of correlation, r. Find and interpret the coefficient of determination, r2. Select the correct choice below and fill in the answer box within your choice. (Round to three decimal places as needed.) A. The coefficient of determination, r2, is enter your response here. Sample variations in the amount of water-soluble pectin explain 100r2% of the sample variation in the sweetness index using the least squares line. B. The coefficient of determination, r2, is enter your…arrow_forwardIn an experiment, the independent variable is the percentage of hydrocarbons and the dependent variable is the purity of oxygen produced in a chemical distillation process that are present in the main condenser of the distillation unit. The simple linear regression and correlation analysis is performed in a sample of 9 observations. Results are as shown below: SSxx : 113.7356 SSyy = 0.5156 уу a = -4.2869 b = 0.0648 %3Darrow_forwardWe have data on Lung Capacity of persons and we wish to build a multiple linear regression model that predicts Lung Capacity based on the predictors Age and Smoking Status. Age is a numeric variable whereas Smoke is a categorical variable (0 if non-smoker, 1 if smoker). Here is the partial result from STATISTICA. b* Std.Err. of b* Std.Err. N=725 of b Intercept Age Smoke 0.835543 -0.075120 1.085725 0.555396 0.182989 0.014378 0.021631 0.021631 -0.648588 0.186761 Which of the following statements is absolutely false? A. The expected lung capacity of a smoker is expected to be 0.648588 lower than that of a non-smoker. B. The predictor variables Age and Smoker both contribute significantly to the model. C. For every one year that a person gets older, the lung capacity is expected to increase by 0.555396 units, holding smoker status constant. D. For every one unit increase in smoker status, lung capacity is expected to decrease by 0.648588 units, holding age constant.arrow_forward
- A particular article used a multiple regression model with the following four independent variables. y = error percentage for subjects reading a four-digit liquid crystal displayx1 = level of backlight (from 0 to 122 cd/m)x2 = character subtense (from .025 to 1.34)x3 = viewing angle (from 0 to 60)x4 = level of ambient light (from 20 to 1500 lx) The model equation suggested in the article is given below. (a) Assume that this is the correct equation. What is the mean value of y when x1 = 30, x2 = 0.6, x3 = 50 and x4 = 150?(b) What mean error percentage is associated with a backlight level of 40, character subtense of 0.6, viewing angle of 20, and ambient light level of 30?arrow_forwardThe relationship between yield of maize, date of planting, and planting density was investigated in an article. Let the variables be defined as follows. y = percent maize yield x = planting date (days after April 20) z = planting density (plants/ha) The following regression model with both quadratic terms where x₁ = x, X₂ = Z, X3 = x² and x4 = 2² provides a good description of the relationship between y and the independent variables. y =a +B₁x₁ + B₂X₂ + B3X3+B₁x₁ + e (a) If a = 21.07, B₁ = 0.653, B₂ = 0.0022, B3 = -0.0207, and B4 = 0.00002, what is the population regression function? y = 509 X (b) Use the regression function in Part (a) to determine the mean yield for a plot planted on May 7 with a density of 41,182 plants/ha. (Give the exact answer.) (c) Would the mean yield be higher for a planting date of May 7 or May 23 (for the same density)? The mean yield would be higher for [May 7 You may need to use the appropriate table in Appendix A to answer this question.arrow_forwardA researcher interested in explaining the level of foreign reserves for the country of Barbados estimated the following multiple regression model using yearly data spanning the period 2001 to 2016: FR=a+B01L+YEXP+8FDI Where FR = yearly foreign reserves (So000's), OIL = annual oil prices, EXP = yearly total exports (S000's) and FDI = annual foreign direct investment ($000's). The sample of data was processed using MINITAB and the following is an extract of the output obtained: Predictor Coef StDev t-ratio p-value Constant 5491.38 2508.81 2.1888 0.0491 OIL 85.39 18.46 4.626 0.0006 EXP -377.08 112.19 0.0057 FDI -396.99 160.66 -2.471 s - 2.45 R-sq = 96.3% R-sq(adj) = 95.3% Analysis of Variance Source DF MS F Regression 3 1991.31 663.77 ?? Error 12 43. רר 6.45 Total 15 a) What is dependent and independent variables? b) Fully write out the regression equation c) Fill in the missing values **', **', '?'and *??"arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- College AlgebraAlgebraISBN:9781305115545Author:James Stewart, Lothar Redlin, Saleem WatsonPublisher:Cengage LearningAlgebra & Trigonometry with Analytic GeometryAlgebraISBN:9781133382119Author:SwokowskiPublisher:CengageFunctions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning
- Big Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin Harcourt
College Algebra
Algebra
ISBN:9781305115545
Author:James Stewart, Lothar Redlin, Saleem Watson
Publisher:Cengage Learning
Algebra & Trigonometry with Analytic Geometry
Algebra
ISBN:9781133382119
Author:Swokowski
Publisher:Cengage
Functions and Change: A Modeling Approach to Coll...
Algebra
ISBN:9781337111348
Author:Bruce Crauder, Benny Evans, Alan Noell
Publisher:Cengage Learning
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Correlation Vs Regression: Difference Between them with definition & Comparison Chart; Author: Key Differences;https://www.youtube.com/watch?v=Ou2QGSJVd0U;License: Standard YouTube License, CC-BY
Correlation and Regression: Concepts with Illustrative examples; Author: LEARN & APPLY : Lean and Six Sigma;https://www.youtube.com/watch?v=xTpHD5WLuoA;License: Standard YouTube License, CC-BY