What is the relationship between the attendance at a major league ball game and the total number of runs scored? Attendance figures (in thousands) and the runs scored for 8 randomly selected games are shown below. Attendance 24 44 35 60 56 42 27 31 Runs 5 9 6 12 16 13 8 12 Find the correlation coefficient: r=r= Round to 2 decimal places. The null and alternative hypotheses for correlation are: H0:H0: ? μ ρ r == 0 H1:H1: ? ρ r μ ≠≠ 0 The p-value is: (Round to four decimal places) r2r2 = (Round to two decimal places) (Round to two decimal places) Interpret r2r2 : There is a 51% chance that the regression line will be a good predictor for the runs scored based on the attendance of the game. Given any fixed attendance, 51% of all of those games will have the predicted number of runs scored. There is a large variation in the runs scored in baseball games, but if you only look at games with a fixed attendance, this variation on average is reduced by 51%. 51% of all games will have the average number of runs scored. The equation of the linear regression line is: ˆyy^ = + xx (Please show your answers to two decimal places) Use the model to predict the runs scored at a game that has an attendance of 26,000 people. Runs scored = (Please round your answer to the nearest whole number.)
Continuous Probability Distributions
Probability distributions are of two types, which are continuous probability distributions and discrete probability distributions. A continuous probability distribution contains an infinite number of values. For example, if time is infinite: you could count from 0 to a trillion seconds, billion seconds, so on indefinitely. A discrete probability distribution consists of only a countable set of possible values.
Normal Distribution
Suppose we had to design a bathroom weighing scale, how would we decide what should be the range of the weighing machine? Would we take the highest recorded human weight in history and use that as the upper limit for our weighing scale? This may not be a great idea as the sensitivity of the scale would get reduced if the range is too large. At the same time, if we keep the upper limit too low, it may not be usable for a large percentage of the population!
What is the relationship between the attendance at a major league ball game and the total number of runs scored? Attendance figures (in thousands) and the runs scored for 8 randomly selected games are shown below.
Attendance | 24 | 44 | 35 | 60 | 56 | 42 | 27 | 31 |
---|---|---|---|---|---|---|---|---|
Runs | 5 | 9 | 6 | 12 | 16 | 13 | 8 | 12 |
- Find the
correlation coefficient : r=r= Round to 2 decimal places. - The null and alternative hypotheses for correlation are:
H0:H0: ? μ ρ r == 0
H1:H1: ? ρ r μ ≠≠ 0
The p-value is: (Round to four decimal places)
- r2r2 = (Round to two decimal places) (Round to two decimal places)
- Interpret r2r2 :
- There is a 51% chance that the regression line will be a good predictor for the runs scored based on the attendance of the game.
- Given any fixed attendance, 51% of all of those games will have the predicted number of runs scored.
- There is a large variation in the runs scored in baseball games, but if you only look at games with a fixed attendance, this variation on average is reduced by 51%.
- 51% of all games will have the average number of runs scored.
- The equation of the linear regression line is:
ˆyy^ = + xx (Please show your answers to two decimal places) - Use the model to predict the runs scored at a game that has an attendance of 26,000 people.
Runs scored = (Please round your answer to the nearest whole number.)
Step by step
Solved in 4 steps