Concept explainers
(a)
>The least squares regression line for the given data set.
(a)
>Answer to Problem 24E
Explanation of Solution
Given information:
The following table represents the ages of the last
U.S. presidents and their wives on the first day of their presidencies:
Concepts Used:
The equation for least-square regression line:
Where
The
Where,
The standard deviations are given by:
Calculation:
The mean of
The mean of
The data can be represented in tabular form as:
Hence, the standard deviation is given by:
And,
Consider,
Putting the values in the formula,
Putting the values to obtain
Putting the values to obtain
Hence, the least-square regression line is given by:
Therefore, the least squares regression line for the given data set is
(b)
>The coefficient of determination.
(b)
>Answer to Problem 24E
Explanation of Solution
Given information:
Same as part
Calculation:
From part
The coefficient of determination is given by:
Where
Putting the values to obtain Coefficient of Determination,
Therefore, the Coefficient of Determination is
(c)
>A
(c)
>Answer to Problem 24E
Explanation of Solution
Given information:
Same as part
Calculation:
Consider age of the president as
The points representing the data would be given by:
Plotting the points to make a scatter plot:
(d)
>The outlier points.
(d)
>Answer to Problem 24E
Explanation of Solution
Given information:
Same as part
Calculation:
From the given table, it can be seen that among all the
Therefore, the outlier point would be
(e)
>The least squares regression line for the given data set by excluding the outlier.
(e)
>Answer to Problem 24E
Explanation of Solution
Given information:
Same as part
Concepts used:
The equation for least-square regression line:
Where
The correlation coefficient of a data is given by:
Where,
The standard deviations are given by:
Calculation:
From part
Excluding the outlier,
The mean of
The mean of
The data can be represented in tabular form as:
Hence, the standard deviation is given by:
And,
Consider,
Putting the values in the formula,
Putting the values to obtain
Putting the values to obtain
Hence, the least-square regression line is given by:
Therefore, the least squares regression line for the given data set by removing the outlier is
(f)
>Whether the outlier is influential.
(f)
>Answer to Problem 24E
The outlier is influential.
Explanation of Solution
Given information:
Same as part
Calculation:
From part
From part
From the equations, it can be seen that removing the outlier creates a great difference in the equation of the least square regression line.
Therefore, the outlier is influential.
(g)
>The coefficient of determination for the data set with the outlier removed.
(g)
>Answer to Problem 24E
Coefficient of Determination is
The proportion of variation is more without the outlier.
Explanation of Solution
Given information:
Same as part
Calculation:
From part
The coefficient of determination is given by:
Where
Plugging the values to obtain Coefficient of Determination,
Therefore, the Coefficient of Determination is
Here the coefficient of determination increased without the outlier.
Hence, the proportion of variance explained is more without the outlier.
Want to see more full solutions like this?
Chapter 4 Solutions
Elementary Statistics (Text Only)
- AP1.1 You look at real estate ads for houses in Sarasota, Florida. Many houses range from $200,000 to $400,000 in price. The few houses on the water, however, have prices up to $15 million. Which of the following statements best describes the distribution of home prices in Sarasota? The distribution is most likely skewed to the left, and the mean is greater than the median. The distribution is most likely skewed to the left, and the mean is less than the median. The distribution is roughly symmetric with a few high outliers, and the mean is approximately equal to the median. The distribution is most likely skewed to the right, and the mean is greater than the median. The distribution is most likely skewed to the right, and the mean is less than the median.arrow_forwardDuring busy political seasons, many opinion polls are conducted. In apresidential race, how do you think the participants in polls are generally selected?Discuss any issues regarding simple random, stratified, systematic, cluster, andconvenience sampling in these polls. What about other types of polls, besides political?arrow_forwardPlease could you explain why 0.5 was added to each upper limpit of the intervals.Thanksarrow_forward
- 28. (a) Under what conditions do we say that two random variables X and Y are independent? (b) Demonstrate that if X and Y are independent, then it follows that E(XY) = E(X)E(Y); (e) Show by a counter example that the converse of (ii) is not necessarily true.arrow_forward1. Let X and Y be random variables and suppose that A = F. Prove that Z XI(A)+YI(A) is a random variable.arrow_forward30. (a) What is meant by the term "product measur"? ANDarrow_forward
- Big Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
- Linear Algebra: A Modern IntroductionAlgebraISBN:9781285463247Author:David PoolePublisher:Cengage Learning