Concept explainers
a.
Explain how one would assess the fit based on the
a.
Explanation of Solution
Calculation:
Answer will vary.
Here the data set B is taken, in which the midterm exam (X) and final exam score (Y) is given.
Hypotheses:
Null hypothesis:
That is, the slope is zero.
Alternative hypothesis:
That is, the slope not equal to zero.
Regression:
Suppose
Where,
The total sum of squares is denoted as,
The regression sum of squares is denoted as,
The error sum of squares is denoted as,
From the regression the fitted line is denoted as,
The 95% confidence interval for the slope,
Where,
Software Procedure:
Step-by-step software procedure to find R-squared using EXCEL is as follows:
- • Open an EXCEL file.
- • In column A and B, enter the Midterm Exam Score and Final Exam Score data.
- • Click on data > click on Data analysis.
- • Choose Regression > click OK.
- • Select Input Y range as the column of Final Exam Score.
- • Select Input X range as the column of Midterm Exam Score.
- • Select the output range.
- • Click OK.
- Output using EXCEL is given below:
Thus, the R-squared value is 0.429.
The coefficient of determination (
The
b.
Interpret the p-value for the F statistic.
b.
Explanation of Solution
Calculation:
For the F-test of the slope the p-value is 0.000.
Decision rule:
If
If
It is assumed that the level of significance is 0.05.
Conclusion:
Here the p-value is less than the level of significance.
That is,
Hence, by the decision rule, reject the null hypothesis.
Therefore, it can be concluded that there is not sufficient evidence to support that the slope is zero.
Hence, the linear model provides significant fit.
c.
Check whether the model’s fit is good enough to be of practical value.
c.
Explanation of Solution
Calculation:
Now, a hypothesis test is needed to check the whether the model provides good fit or not.
Decision rule:
If
If
Critical value:
From the output it is observed that, the
The degrees of freedom is,
Thus, the degrees of freedom is56.
For two tailed test, the critical value for t-test will be,
It is assumed that the level of significance,
Procedure for critical-value using EXCEL:
Software Procedure:
Step-by-step software procedure to obtain critical-value using EXCEL software is as follows:
- • Open an EXCEL file.
- • In cell A1, enter the formula “=F.INV.RT(0.05,1,56)”
- Output using EXCEL software is given below:
Hence, the critical value will be 4.013.
From the output in part (a), the F-statistic value is 42.22.
The level of significance is 0.05.
Conclusion:
Here the F-statistics is greater than the critical value.
That is,
Hence, by the decision rule, reject the null hypothesis.
Therefore, it can be concluded that there is not sufficient evidence to support that the slope is zero.
Hence, linear model provides significant fit.
The coefficient of determination (
Thus, using
Want to see more full solutions like this?
Chapter 12 Solutions
Loose-leaf For Applied Statistics In Business And Economics
- A technician services mailing machines at companies in the Phoenix area. Depending on the type of malfunction, the service call can take 1, 2, 3, or 4 hours. The different types of malfunctions occur at about the same frequency. Develop a probability distribution for the duration of a service call. Which of the following probability distribution graphs accurately represents the data set? Consider the required conditions for a discrete probability function, shown below.Does this probability distribution satisfy equation (5.1)?Does this probability distribution satisfy equation (5.2)? What is the probability a service call will take three hours? A service call has just come in, but the type of malfunction is unknown. It is 3:00 P.M. and service technicians usually get off at 5:00 P.M. What is the probability the service technician will have to work overtime to fix the machine today?arrow_forwardWest Virginia has one of the highest divorce rates in the nation, with an annual rate of approximately 5 divorces per 1000 people (Centers for Disease Control and Prevention website, January 12, 2012). The Marital Counseling Center, Inc. (MCC) thinks that the high divorce rate in the state may require them to hire additional staff. Working with a consultant, the management of MCC has developed the following probability distribution for x = the number of new clients for marriage counseling for the next year. Excel File: data05-19.xls 10 20 f(x) .05 .10 11 30 40 50 60 .10 .20 .35 .20 a. Is this probability distribution valid? Yes Explain. greater than or equal to 0 f(x) Σf(x) equal to 1 b. What is the probability MCC will obtain more than 30 new clients (to 2 decimals)? c. What is the probability MCC will obtain fewer than 20 new clients (to 2 decimals)? d. Compute the expected value and variance of x. Expected value Variance clients per year squared clients per yeararrow_forwardReconsider the patient satisfaction data in Table 1. Fit a multiple regression model using both patient age and severity as the regressors. (a) Test for significance of regression. (b) Test for the individual contribution of the two regressors. Are both regressor variables needed in the model? (c) Has adding severity to the model improved the quality of the model fit? Explain your answer.arrow_forward
- The output voltage of a power supply is assumed to be normally distributed. Sixteen observations taken at random on voltage are as follows: 10.35, 9.30, 10.00, 9.96, 11.65, 12.00, 11.25, 9.58, 11.54, 9.95, 10.28, 8.37, 10.44, 9.25, 9.38, and 10.85. (a) Test the hypothesis that the mean voltage equals 12 V against a two-sided alternative using a = 0.05. (b) Construct a 95% two-sided confidence interval on μ. (c) Test the hypothesis that σ² = 11 using α = 0.05. (d) Construct a 95% two-sided confidence interval on σ. (e) Construct a 95% upper confidence interval on σ. (f) Does the assumption of normality seem reasonable for the output voltage?arrow_forwardAnalyze the residuals from the regression model on the patient satisfaction data from Exercise 3. Comment on the adequacy of the regression model.arrow_forwardConsider the hypotheses: Hop=po H₁ppo where 2 is known. Derive a general expression for determining the sample size for detecting a true mean of 1μo with probability 1-ẞ if the type I error is a.arrow_forward
- Suppose we wish to test the hypotheses: Họ : | = 15 H₁: 15 where we know that o² = 9.0. If the true mean is really 20, what sample size must be used to ensure that the probability of type II error is no greater than 0.10? Assume that a = 0.05.arrow_forwardTable 1 contains the data from a patient satisfaction survey for a group of 25 randomly selected patients at a hospital. In addition to satisfaction, data were collected on patient age and an index that measured the severity of illness. (a) Fit a linear regression model relating satisfaction to patient age. (b) Test for significance of regression. (c) What portion of the total variability is accounted for by the regressor variable age? Table 1: Patient Satisfaction Data Severity Observation Age (21) (x2) Satisfaction (y) 1 55 50 2 46 24 3 30 46 4 35 48 5 59 58 6 61 60 7 74 65 8 38 42 9 27 42 10 51 50 11 53 38 12 41 30 13 37 31 88 14 24 34 15 42 30 16 50 48 17 58 61 18 60 71 19 62 62 20 68 38 21 70 41 22 79 66 23 63 31 24 39 42 25 49 40 BE225222222222222222 68 77 96 80 43 44 26 88 75 57 56 88 102 88 70 43 46 56 59 26 83 75arrow_forward14 A survey is conducted to determine whether would prefer to work at home, if given the 20 office employees of a certain company chance. The overall results are shown in the first bar graph, and the results broken down by gender are presented in the second. a. Interpret the results of each graph. b. Discuss the added value in including gen- der in the second bar graph. (The second bar graph in this problem is called a side by side bar graph and is often used to show results broken down by two or more variables.) c. Compare the side by side bar graph with the two pie charts that you made for Question 6. Which of the two methods is best for comparing two groups, in your opinion? A Would you prefer to work at home? (n=20) 60 50 40 Percent 20 30 20 30 10 0 No Yes Prefer to work at home? (10 males, 10 females) 80 Percent 60 00 40 40 20- No Yes No Yes Female Malearrow_forward
- Frequency 12 Suppose that a random sample of 270 gradu- ating seniors are asked what their immediate priorities are, including whether buying a house is a priority. The results are shown in the following bar graph. a. The bar graph is misleading; explain why. b. Make a new bar graph that more fairly presents the results. Is Buying a House a Priority? 300 250 200 150 100 50 0 Yes No Undecidedarrow_forwardFrequency 11 A polling organization wants to find out what voters think of Issue X. It chooses a random sample of voters and asks them for their opinions of Issue X: yes, no, or no opinion. I organize the results in the following bar graph. a. Make a frequency table of these results (including the total number). brocb. Evaluate the bar graph as to whether it biz s b fairly represents the results. of beau no STORE TO OW! vd wob spind 550 540 500 vd 480 420 360 300 250 240 Yes No Undecided Opinion on Issue Xarrow_forwardPercent 13 A car dealer specializing in minivan sales saibe conducts a survey to find out more about who its customers are. One of the variables at the company measures is gender; the results of this part of the survey are shown in the following bar graph. pow a. Interpret these results. b. Explain whether you think the bar graph is a fair and accurate representation of this data. 70 Gender of Customers 60 50 40 30 20 10 0 Males Femalesarrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman