![STATISTICAL TECHNIQUES-ACCESS ONLY](https://www.bartleby.com/isbn_cover_images/9780077639648/9780077639648_largeCoverImage.gif)
Concept explainers
Refer to the Baseball 2012 data, which reports information on the 2012 Major League Baseball season. Let attendance be the dependent variable and total team salary, in millions of dollars, be the independent variable. Determine the regression equation and answer the following questions.
- a. Draw a
scatter diagram . From the diagram, does there seem to be a direct relationship between the two variables? - b. What is the expected attendance for a team with a salary of $80.0 million?
- c. If the owners pay an additional $30 million, how many more people could they expect to attend?
- d. At the .05 significance level, can we conclude that the slope of the regression line is positive? Conduct the appropriate test of hypothesis.
- e. What percentage of the variation in attendance is accounted for by salary?
- f. Determine the
correlation between attendance and team batting average and between attendance and team ERA. Which is stronger? Conduct an appropriate test of hypothesis for each set of variables.
a.
![Check Mark](/static/check-mark.png)
Construct a scatter diagram.
Check whether there is a direct or indirect relationship between the two variables.
Answer to Problem 63DE
The scatter diagram of the data is as follows:
There seems to be a direct relationship between the team’s salary and attendance.
Explanation of Solution
Step-by-step procedure to obtain the scatterplot using the MegaStat software:
- In an EXCEL sheet enter the data values of x and y.
- Go to Add-Ins > MegaStat > Correlation/Regression > Scatterplot.
- Enter horizontal axis as $A$1:$A$81 and vertical axis as $B$1:$B$81.
- Click on OK.
The scatterplot of the data indicates an increasing trend. Therefore, the relationship between the two variables is direct.
b.
![Check Mark](/static/check-mark.png)
Find the expected attendance for a team with a salary of $80.0 million.
Answer to Problem 63DE
The expected attendance for a team with a salary of $80.0 million is 2.25295.
Explanation of Solution
Step-by-step procedure to obtain the ‘regression equation’ using the MegaStat software:
- In an EXCEL sheet enter the data values of x and y.
- Go to Add-Ins > MegaStat > Correlation/Regression > Regression Analysis.
- Select input range as ‘Sheet1!$B$2:$B$81’ under Y/Dependent variable.
- Select input range ‘Sheet1!$A$2:$A$81’ under X/Independent variables.
- Select ‘Type in predictor values’.
- Enter 80.0 as ‘predictor values’ and 95% as ‘confidence level’.
- Click on OK.
Output obtained using the MegaStat software is given below:
From the above output, the regression equation is as follows:
The expected attendance for a team with a salary of $80.0 million is 2.25295.
c.
![Check Mark](/static/check-mark.png)
Find the number of people expected to attend if the owners pay an additional amount of $30 million.
Answer to Problem 63DE
The mean number of people expected to attend if the owners pay an additional amount of $30 million is 0.4000 million.
Explanation of Solution
From Part (b), the regression equation is as follows:
The expected number of people who attend is as follows:
The expected number of people who attend if the owners pay an additional amount of $30million is the difference between the expected attendance with a salary of 110 and the expected attendance with a salary of 80 million.
That is,
Thus, the mean number of people expected to attend when the owners pay an additional amount of $30 million is 0.4000 million.
d.
![Check Mark](/static/check-mark.png)
Check whether the slope of the regression line is positive or not.
Answer to Problem 63DE
There is sufficient evidence to conclude that the slope of the regression line is positive at the 5% level of significance.
Explanation of Solution
Define
Null hypothesis:
That is, the slope of the regression line is less than or equal to zero.
Alternate hypothesis:
That is, the slope of the regression line is greater than zero.
Consider the level of significance as 0.05.
From Part (b), the standard error of
The test statistic is calculated as follows:
Thus, the test statistic value is 6.38.
Here, the sample size is
Critical value:
Step-by-step software procedure to obtain the critical value
· Open an EXCEL file.
· In cell A1, enter the formula “=T.INV(0.95,28)”.
Output obtained using the EXCEL is given as follows:
From the EXCEL output, the critical value is 1.701.
Decision rule based on critical value:
Reject the null hypothesis if
Conclusion:
The t-calculated value is 6.38 and the critical value is 1.701.
That is,
Thus, the null hypothesis is rejected.
Hence, there is sufficient evidence to conclude that the slope of the regression line is positive at the 5% level of significance.
e.
![Check Mark](/static/check-mark.png)
Find the coefficient of determination and interpret it.
Answer to Problem 63DE
There is 59.30% of variation in attendance, which can be explained by the salary.
Explanation of Solution
From Part (b), the coefficient of determination is 0.593.
Thus, there is 59.30% variation in attendance, which can be explained by the salary.
f.
![Check Mark](/static/check-mark.png)
Find the correlation between attendance and team batting average.
Find the correlation between attendance and team ERA.
Conduct a hypothesis test for the variables’ attendance and team’s batting average.
Conduct a hypothesis test for attendance and team ERA.
Answer to Problem 63DE
The correlation between attendance and team batting average is 0.630.
The correlation between attendance and team ERA is –0.055.
There is a positive association between attendance and team’s batting average at the 0.05 significance level.
There is no association between attendance and team’s ERA at the 0.05 significance level.
Explanation of Solution
Correlation between attendance and team’s batting average is as follows:
Step-by-step procedure to obtain the correlation between attendance and team’s batting average using Excel Mega stat software:
- Choose MegaStat > Correlation/Regression > Correlation Matrix.
- In Input ranges, Select the data values.
- Click OK.
The output obtained using Mega stat is as follows:
From the above output, the correlation between attendance and team’s batting average is 0.630.
Correlation between attendance and team’s ERA is as follows:
Follow the above procedure to obtain the correlation between attendance and team’s ERA as follows:
The output obtained using Mega stat is as follows:
From the above output, the correlation between attendance and team’s ERA is –0.055.
The correlation between attendance and team’s batting average is positively correlated, and the correlation between attendance and team’s ERA is negatively associated.
Therefore, it is noticed that the correlation between attendance and team’s batting average is stronger than the correlation between attendance and team’s ERA.
Hypothesis test for attendance and team’s batting average:
The null and alternative hypotheses are stated below:
Null hypothesis:
That is, the correlation between attendance and team’s batting average is less than or equal to zero.
Alternative hypothesis:
That is, the correlation between “attendance and team batting average” is greater than zero.
Here, the sample size is 30 and the correlation coefficient between “attendance and team batting average is 0.630.
The test statistic is as follows:
Thus, the test statistic value is 4.29.
The degrees of freedom is as follows:
The level of significance is 0.05. Therefore,
Critical value:
Step-by-step software procedure to obtain the critical value using the EXCEL software:
- Open an EXCEL file.
- In cell A1, enter the formula “=T.INV (0.95, 28)”.
Output obtained using EXCEL is given as follows:
From the above output, the critical value is 1.701.
Decision rule:
Reject the null hypothesis H0, if
Conclusion:
The value of test statistic is 4.29 and the critical value is 1.701.
Here,
By the rejection rule, reject the null hypothesis.
Thus, it can be concluded that there is a positive association between attendance and team’s batting average.
Hypothesis test for attendance and team’s ERA:
The hypotheses are given below:
Null hypothesis:
That is, the correlation between attendance and team’s ERA is greater than or equal to zero.
Alternative hypothesis:
That is, the correlation between “attendance and team ERA” is less than zero.
Here, the sample size is 30 and the correlation between attendance and team’s ERA is –0.055.
The test statistic is as follows:
Thus, the t-test statistic value is –0.29.
Critical value:
Step-by-step software procedure to obtain the critical value using the EXCEL software:
- Open an EXCEL file.
- In cell A1, enter the formula “=T.INV (0.95, 28)”.
Output obtained using EXCEL is given as follows:
From the above output, the critical vale is –1.7011.
Conclusion:
The value of test statistic is –0.29 and the critical value is –1.7011.
Here,
By the rejection rule, fail to reject the null hypothesis.
Thus, it can be concluded that there is no association between “attendance and team ERA”.
Want to see more full solutions like this?
Chapter 13 Solutions
STATISTICAL TECHNIQUES-ACCESS ONLY
- Suppose a random sample of 459 married couples found that 307 had two or more personality preferences in common. In another random sample of 471 married couples, it was found that only 31 had no preferences in common. Let p1 be the population proportion of all married couples who have two or more personality preferences in common. Let p2 be the population proportion of all married couples who have no personality preferences in common. Find a95% confidence interval for . Round your answer to three decimal places.arrow_forwardA history teacher interviewed a random sample of 80 students about their preferences in learning activities outside of school and whether they are considering watching a historical movie at the cinema. 69 answered that they would like to go to the cinema. Let p represent the proportion of students who want to watch a historical movie. Determine the maximal margin of error. Use α = 0.05. Round your answer to three decimal places. arrow_forwardA random sample of medical files is used to estimate the proportion p of all people who have blood type B. If you have no preliminary estimate for p, how many medical files should you include in a random sample in order to be 99% sure that the point estimate will be within a distance of 0.07 from p? Round your answer to the next higher whole number.arrow_forward
- A clinical study is designed to assess the average length of hospital stay of patients who underwent surgery. A preliminary study of a random sample of 70 surgery patients’ records showed that the standard deviation of the lengths of stay of all surgery patients is 7.5 days. How large should a sample to estimate the desired mean to within 1 day at 95% confidence? Round your answer to the whole number.arrow_forwardA clinical study is designed to assess the average length of hospital stay of patients who underwent surgery. A preliminary study of a random sample of 70 surgery patients’ records showed that the standard deviation of the lengths of stay of all surgery patients is 7.5 days. How large should a sample to estimate the desired mean to within 1 day at 95% confidence? Round your answer to the whole number.arrow_forwardIn the experiment a sample of subjects is drawn of people who have an elbow surgery. Each of the people included in the sample was interviewed about their health status and measurements were taken before and after surgery. Are the measurements before and after the operation independent or dependent samples?arrow_forward
- iid 1. The CLT provides an approximate sampling distribution for the arithmetic average Ỹ of a random sample Y₁, . . ., Yn f(y). The parameters of the approximate sampling distribution depend on the mean and variance of the underlying random variables (i.e., the population mean and variance). The approximation can be written to emphasize this, using the expec- tation and variance of one of the random variables in the sample instead of the parameters μ, 02: YNEY, · (1 (EY,, varyi n For the following population distributions f, write the approximate distribution of the sample mean. (a) Exponential with rate ẞ: f(y) = ß exp{−ßy} 1 (b) Chi-square with degrees of freedom: f(y) = ( 4 ) 2 y = exp { — ½/ } г( (c) Poisson with rate λ: P(Y = y) = exp(-\} > y! y²arrow_forward2. Let Y₁,……., Y be a random sample with common mean μ and common variance σ². Use the CLT to write an expression approximating the CDF P(Ỹ ≤ x) in terms of µ, σ² and n, and the standard normal CDF Fz(·).arrow_forwardmatharrow_forward
- Compute the median of the following data. 32, 41, 36, 42, 29, 30, 40, 22, 25, 37arrow_forwardTask Description: Read the following case study and answer the questions that follow. Ella is a 9-year-old third-grade student in an inclusive classroom. She has been diagnosed with Emotional and Behavioural Disorder (EBD). She has been struggling academically and socially due to challenges related to self-regulation, impulsivity, and emotional outbursts. Ella's behaviour includes frequent tantrums, defiance toward authority figures, and difficulty forming positive relationships with peers. Despite her challenges, Ella shows an interest in art and creative activities and demonstrates strong verbal skills when calm. Describe 2 strategies that could be implemented that could help Ella regulate her emotions in class (4 marks) Explain 2 strategies that could improve Ella’s social skills (4 marks) Identify 2 accommodations that could be implemented to support Ella academic progress and provide a rationale for your recommendation.(6 marks) Provide a detailed explanation of 2 ways…arrow_forwardQuestion 2: When John started his first job, his first end-of-year salary was $82,500. In the following years, he received salary raises as shown in the following table. Fill the Table: Fill the following table showing his end-of-year salary for each year. I have already provided the end-of-year salaries for the first three years. Calculate the end-of-year salaries for the remaining years using Excel. (If you Excel answer for the top 3 cells is not the same as the one in the following table, your formula / approach is incorrect) (2 points) Geometric Mean of Salary Raises: Calculate the geometric mean of the salary raises using the percentage figures provided in the second column named “% Raise”. (The geometric mean for this calculation should be nearly identical to the arithmetic mean. If your answer deviates significantly from the mean, it's likely incorrect. 2 points) Starting salary % Raise Raise Salary after raise 75000 10% 7500 82500 82500 4% 3300…arrow_forward
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtFunctions and Change: A Modeling Approach to Coll...AlgebraISBN:9781337111348Author:Bruce Crauder, Benny Evans, Alan NoellPublisher:Cengage Learning
- Holt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
![Text book image](https://www.bartleby.com/isbn_cover_images/9780079039897/9780079039897_smallCoverImage.jpg)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781680331141/9781680331141_smallCoverImage.jpg)
![Text book image](https://www.bartleby.com/isbn_cover_images/9781337111348/9781337111348_smallCoverImage.gif)
![Text book image](https://www.bartleby.com/isbn_cover_images/9780547587776/9780547587776_smallCoverImage.jpg)