Concept explainers
a.
Explain how one would assess the fit based on the
a.
Explanation of Solution
Calculation:
Answer will vary.
Here the data set B is taken, in which the midterm exam (X) and final exam score (Y) is given.
Hypotheses:
Null hypothesis:
That is, the slope is zero.
Alternative hypothesis:
That is, the slope not equal to zero.
Regression:
Suppose
Where,
The total sum of squares is denoted as,
The regression sum of squares is denoted as,
The error sum of squares is denoted as,
From the regression the fitted line is denoted as,
The 95% confidence interval for the slope,
Where,
Software Procedure:
Step-by-step software procedure to find R-squared using EXCEL is as follows:
- • Open an EXCEL file.
- • In column A and B, enter the Midterm Exam Score and Final Exam Score data.
- • Click on data > click on Data analysis.
- • Choose Regression > click OK.
- • Select Input Y range as the column of Final Exam Score.
- • Select Input X range as the column of Midterm Exam Score.
- • Select the output range.
- • Click OK.
- Output using EXCEL is given below:
Thus, the R-squared value is 0.429.
The coefficient of determination (
The
b.
Interpret the p-value for the F statistic.
b.
Explanation of Solution
Calculation:
For the F-test of the slope the p-value is 0.000.
Decision rule:
If
If
It is assumed that the level of significance is 0.05.
Conclusion:
Here the p-value is less than the level of significance.
That is,
Hence, by the decision rule, reject the null hypothesis.
Therefore, it can be concluded that there is not sufficient evidence to support that the slope is zero.
Hence, the linear model provides significant fit.
c.
Check whether the model’s fit is good enough to be of practical value.
c.
Explanation of Solution
Calculation:
Now, a hypothesis test is needed to check the whether the model provides good fit or not.
Decision rule:
If
If
Critical value:
From the output it is observed that, the
The degrees of freedom is,
Thus, the degrees of freedom is56.
For two tailed test, the critical value for t-test will be,
It is assumed that the level of significance,
Procedure for critical-value using EXCEL:
Software Procedure:
Step-by-step software procedure to obtain critical-value using EXCEL software is as follows:
- • Open an EXCEL file.
- • In cell A1, enter the formula “=F.INV.RT(0.05,1,56)”
- Output using EXCEL software is given below:
Hence, the critical value will be 4.013.
From the output in part (a), the F-statistic value is 42.22.
The level of significance is 0.05.
Conclusion:
Here the F-statistics is greater than the critical value.
That is,
Hence, by the decision rule, reject the null hypothesis.
Therefore, it can be concluded that there is not sufficient evidence to support that the slope is zero.
Hence, linear model provides significant fit.
The coefficient of determination (
Thus, using
Want to see more full solutions like this?
Chapter 12 Solutions
Applied Statistics in Business and Economics
- Introduce yourself and describe a time when you used data in a personal or professional decision. This could be anything from analyzing sales data on the job to making an informed purchasing decision about a home or car. Describe to Susan how to take a sample of the student population that would not represent the population well. Describe to Susan how to take a sample of the student population that would represent the population well. Finally, describe the relationship of a sample to a population and classify your two samples as random, systematic, cluster, stratified, or convenience.arrow_forward1.2.17. (!) Let G,, be the graph whose vertices are the permutations of (1,..., n}, with two permutations a₁, ..., a,, and b₁, ..., b, adjacent if they differ by interchanging a pair of adjacent entries (G3 shown below). Prove that G,, is connected. 132 123 213 312 321 231arrow_forwardYou are planning an experiment to determine the effect of the brand of gasoline and the weight of a car on gas mileage measured in miles per gallon. You will use a single test car, adding weights so that its total weight is 3000, 3500, or 4000 pounds. The car will drive on a test track at each weight using each of Amoco, Marathon, and Speedway gasoline. Which is the best way to organize the study? Start with 3000 pounds and Amoco and run the car on the test track. Then do 3500 and 4000 pounds. Change to Marathon and go through the three weights in order. Then change to Speedway and do the three weights in order once more. Start with 3000 pounds and Amoco and run the car on the test track. Then change to Marathon and then to Speedway without changing the weight. Then add weights to get 3500 pounds and go through the three gasolines in the same order.Then change to 4000 pounds and do the three gasolines in order again. Choose a gasoline at random, and run the car with this gasoline at…arrow_forward
- AP1.2 A child is 40 inches tall, which places her at the 90th percentile of all children of similar age. The heights for children of this age form an approximately Normal distribution with a mean of 38 inches. Based on this information, what is the standard deviation of the heights of all children of this age? 0.20 inches (c) 0.65 inches (e) 1.56 inches 0.31 inches (d) 1.21 inchesarrow_forwardAP1.1 You look at real estate ads for houses in Sarasota, Florida. Many houses range from $200,000 to $400,000 in price. The few houses on the water, however, have prices up to $15 million. Which of the following statements best describes the distribution of home prices in Sarasota? The distribution is most likely skewed to the left, and the mean is greater than the median. The distribution is most likely skewed to the left, and the mean is less than the median. The distribution is roughly symmetric with a few high outliers, and the mean is approximately equal to the median. The distribution is most likely skewed to the right, and the mean is greater than the median. The distribution is most likely skewed to the right, and the mean is less than the median.arrow_forwardDuring busy political seasons, many opinion polls are conducted. In apresidential race, how do you think the participants in polls are generally selected?Discuss any issues regarding simple random, stratified, systematic, cluster, andconvenience sampling in these polls. What about other types of polls, besides political?arrow_forward
- Please could you explain why 0.5 was added to each upper limpit of the intervals.Thanksarrow_forward28. (a) Under what conditions do we say that two random variables X and Y are independent? (b) Demonstrate that if X and Y are independent, then it follows that E(XY) = E(X)E(Y); (e) Show by a counter example that the converse of (ii) is not necessarily true.arrow_forward1. Let X and Y be random variables and suppose that A = F. Prove that Z XI(A)+YI(A) is a random variable.arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman