Commute times: Every morning, Tania leaves for work a few minutes after 7:00 A.M. For eight days, she keeps track of the time she leaves (the number of minutes after 7:00) and the number of minutes it takes her to get to work. Following are the results. Construct a scatterplot of the length of commute ( y ) versus the time leaving ( x ). Compute the least-squares regression line for predicting the length of commute from the time leaving. Compute the coefficient of determination. Which point is an outlier? Remove the outlier and compute the least-squares regression line for predicting the length of commute from the time leaving. Is the outlier influential? Explain. Compute the coefficient of determination for the data set with the outlier removed. Is the relationship stronger. weaker; or about equally strong without the outlier?

Question

Want to see more full solutions like this?

Answer 1

Textbook Question

Chapter 4, Problem 12RE

Commute times: Every morning, Tania leaves for work a few minutes after 7:00 A.M. For eight days, she keeps track of the time she leaves (the number of minutes after 7:00) and the number of minutes it takes her to get to work. Following are the results.

Chapter 4, Problem 12RE, Commute times: Every morning, Tania leaves for work a few minutes after 7:00 A.M. For eight days,

Construct a scatterplot of the length of commute (y) versus the time leaving (x).
Compute the least-squares regression line for predicting the length of commute from the time leaving.
Compute the coefficient of determination.
Which point is an outlier?
Remove the outlier and compute the least-squares regression line for predicting the length of commute from the time leaving.
Is the outlier influential? Explain.
Compute the coefficient of determination for the data set with the outlier removed. Is the relationship stronger. weaker; or about equally strong without the outlier?

a.

Expert Solution

To determine

To Graph:a scatter plot using the length of commute time y versus time leaving x

Explanation of Solution

Given information: T leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30

Graph:The scatter plot shows the number of minutes after 7.00 A.M on the x -axis and the number of minutes taken to office on the y -axis.

Elementary Statistics ( 3rd International Edition ) Isbn:9781260092561, Chapter 4, Problem 12RE , additional homework tip 1

Interpretation:Each of the data in the table contributes an ordered pair of the form (number of minutes after 7.00 A.M, number of minutes taken to office).So the ordered pairs to be plotted are

(13,27),(14,20),(16,23),(30,45),(20,20),(12,21),(9,20),(17,28),(16,27),(10,23),(16,30) .

We use a scatter plot for this example because to understand the relationship between the two variables as ordered pairs it is useful. The points tend to cluster around the straight line. Therefore, we conclude that the variable on the x -axis and variable on the y -axis have a linear relationship.

Now consider the three points (9,20),(14,20),(20,20) . These show that the number of minutes after 7.00 A.M on the x -axis does not change with the number of minutes taken to an office on the y -axis. There are few other points in the table to reflect the same idea.

Therefore, we conclude that the variable on the x -axis and variable on the y -axis have a linear relationship, but it is difficult to say whether it is negative or positive.

Because for positive linear relationship the large value of data associates with large values of data in the plot, while for negative linear relationship the large value of data associates with small values of data in the pot. And in this case, it is difficult to say that because the large values of data associate with both small and large and small values of data associate with both small and large at the same time.

Therefore it is good to measure how strong the linear relationship is, to know this we can calculate the correlation coefficient.

b.

Expert Solution

To determine

To Calculate: the least square regression line

Answer to Problem 12RE

When two variables have a linear relationship, the points on a scatter plot tend to cluster around a straight line called the least square regression line. It is simplified to be y∧=26+4×10−31x .

Explanation of Solution

Given information: T leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30

Formulas Used:

Sample mean: x¯=∑i=1nxin

Sample variance: s2=∑i=1n ( x i − x ¯ )2n−1

Correlation Coefficient: r=1(n−1).∑i=1n(xi−x¯)sx.∑i=1n(yi−y¯)sy

The least-square regression line:

y∧=b0+b1xb1=rsysxb0=y¯−b1x¯

Calculation: Using the below table for calculation.

xyx¯y¯ (x− x ¯ )2 (y− y ¯ )2x−x¯y−y¯132715.7272727325.818187.4380171.396694−2.7272727271.1818181821420 2.98347133.85124−1.727272727−5.818181821623 0.074387.9421490.272727273−2.818181823045 203.7107367.942114.2727272719.181818182020 18.256233.851244.272727273−5.818181821221 13.8925623.21488−3.727272727−4.81818182920 45.256233.85124−6.727272727−5.818181821728 1.6198354.7603311.2727272732.1818181821627 0.074381.3966940.2727272731.1818181821023 32.801657.942149−5.727272727−2.818181821630 0.0743817.48760.2727272734.181818182

The sample means and the sample variances can be calculated as shown.

x¯= ∑ i=1 n x i nx¯=13+14+16+30+20+12+9+17+16+10+1611x¯=17311x¯=15.72727

s2= ∑ i=1 n ( x i − x ¯ ) 2 n−1sx2= (13−15.72727)2+..............+ (16−15.72727)210sx2=326.181810sx2=32.61818sx=5.71123279161

y¯= ∑ i=1 n y i ny¯=27+20+23+45+20+21+20+28+27+23+3011y¯=28411y¯=25.81818

s2= ∑ i=1 n ( y i − y ¯ ) 2 n−1sy2= (27−25.81818)2+..............+ (30−25.81818)210sy2=533.636410sy2=53.36364sy=7.30504209433

Now, one can use these to calculate the correlation coefficient as shown.

r=1(n−1). ∑ i=1 n ( x i − x ¯ )sx. ∑ i=1 n ( y i − y ¯ )syr=1(11−1).7.1054274× 10 −157.30504209433.1.77636× 10 −145.71123279161r=110(0.972674395×10−15)(0.311029171×10−14)r=0.030253011×10−29r=3.0253011×10−31

Finally, to calculate the least square regression line as shown, theser can be used.

y∧=b0+b1x

Where b0 (intercept) and b1 (slope) can be calculated as shown.

b1=rsysx=(3.0253011×10−31)7.305042094335.71123279161=3.869559×10−31=4.0×10−31b0=y¯−b1x¯=25.81818−(3.869559×10−31)(15.72727273)=25.81818−0=26

c.

Expert Solution

To determine

To Calculate: the coefficient of determination

Answer to Problem 12RE

The coefficient of determination is r2=

9.1524467×10−62

Explanation of Solution

Given information: T leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30

Formulas Used:

Sample mean: x¯=∑i=1nxin

Sample variance: s2=∑i=1n ( x i − x ¯ )2n−1

Correlation Coefficient: r=1(n−1).∑i=1n(xi−x¯)sx.∑i=1n(yi−y¯)sy

Calculation: the correlation coefficient can be calculated as shown by using the formula.

r=1(n−1). ∑ i=1 n ( x i − x ¯ )sx. ∑ i=1 n ( y i − y ¯ )syr=1(11−1).7.1054274× 10 −157.30504209433.1.77636× 10 −145.71123279161r=110(0.972674395×10−15)(0.311029171×10−14)r=0.030253011×10−29r=3.0253011×10−31

The correlation coefficient r measures the strength of a linear relationship. Positive values of r

indicates a positive linear association. Here the value of the correlation coefficient close to zero.

Therefore, one can conclude that the positive linear relationship is weak.

Also,

To calculate the coefficient of determination we need to square the correlation coefficient.

Therefore, the coefficient of determination is r2=

9.1524467×10−62

d.

Expert Solution

To determine

To Find: the outlier point

Explanation of Solution

Given information: Tania leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30

Graph: The scatter plot shows the number of minutes after 7.00 A.M on the x -axis and the number of minutes taken to an office on the y -axis.

Elementary Statistics ( 3rd International Edition ) Isbn:9781260092561, Chapter 4, Problem 12RE , additional homework tip 2

Interpretation: An outlier is a value that considerably larger or considerably smaller than most of the values in a data set. It may be resulting from an error in the process of sampling.

So in the given data set, an outlier point can be detected in the ordered pair (30,45) .

Because it is much larger than the other ordered pairs.

e.

Expert Solution

To determine

To Calculate: the least square regression line without the outlier point.

Answer to Problem 12RE

The least square regression line without theoutlier point is y∧=24+10−30x

Explanation of Solution

Given information: T leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30

Formulas Used:

Sample mean: x¯=∑i=1nxin

Sample variance: s2=∑i=1n ( x i − x ¯ )2n−1

Correlation Coefficient: r=1(n−1).∑i=1n(xi−x¯)sx.∑i=1n(yi−y¯)sy

The least square regression line:

y∧=b0+b1xb1=rsysxb0=y¯−b1x¯

Calculation: the sample means and the sample variances can be calculated as shown without the outlier point.

x¯= ∑ i=1 n x i nx¯=13+14+1+20+12+9+17+16+10+1610x¯=14310x¯=14.3

y¯= ∑ i=1 n y i ny¯=27+20+23+20+21+20+28+27+23+3010y¯=23910y¯=23.9

s2= ∑ i=1 n ( x i − x ¯ ) 2 n−1sx2= (13−14.3)2+..............+ (16−14.3)29sx2=102.19sx2=11.3444444sx=3.36815148115

s2= ∑ i=1 n ( y i − y ¯ ) 2 n−1sy2= (27−23.9)2+..............+ (30−23.9)29sy2=128.99sy2=14.3222222sy=3.78447

Now, one can use these to calculate the correlation coefficient without the outlier as shown.

r=1(n−1). ∑ i=1 n ( x i − x ¯ )sx. ∑ i=1 n ( y i − y ¯ )syr=1(10−1).−7.1054× 10 −153.36815148115.1.42109× 10 −143.78447119159r=19(−2.1096×10−15)(3.75504×10−15)r=8.80179146×10−31

Finally, one can use these to calculate the least square regression line as shown.

y∧=b0+b1x

Where b0 (intercept) and b1 (slope) can be calculated as shown.

b1=rsysx=(8.80179146×10−31)3.784471193.36815148115=9.889735182×10−31b0=y¯−b1x¯=23.9−(9.889735182×10−31)(14.3)=23.9−141.4232131×10−31=23.9−0=23.9

f.

Expert Solution

To determine

To Show: the outlier is influential

Answer to Problem 12RE

No, the outlier is not that much influential.

Explanation of Solution

Given information: T leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30

The least-square regression line with the outlier is y∧=26+4×10−31x .

The least-square regression line without the outlier is y∧=24+10−30x .

The two least square regression lines are so much close and have no huge difference.

Therefore we can conclude that the outlier is not that much influential. Here, the outlier cannot be a result of an error. It is just random data measured along with the other data in the sampling process.

g.

Expert Solution

To determine

To Find: the coefficient of determination without the outlier and discuss its strength.

Answer to Problem 12RE

Coefficient of determination is r2=

77.4715329053

×10−62.

Explanation of Solution

Given information: T leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30

Formula Used:Sample mean: x¯=∑i=1nxin

Sample variance: s2=∑i=1n ( x i − x ¯ )2n−1

Correlation Coefficient: r=1(n−1).∑i=1n(xi−x¯)sx.∑i=1n(yi−y¯)sy

Calculation: to calculate the correlation coefficient without the outlier as shown.

r=1(n−1). ∑ i=1 n ( x i − x ¯ )sx. ∑ i=1 n ( y i − y ¯ )syr=1(10−1).−7.1054× 10 −153.36815148115.1.42109× 10 −143.78447119159r=19(−2.1096×10−15)(3.75504×10−15)r=8.80179146×10−31

The correlation coefficient r measures the strength of a linear relationship. Positive values of r

indicates a positive linear association. It is r=8.8×10−31 without the outlier.

The value is so close to zero .because of the ten to the power is minus thirty-one.

Therefore, one can conclude that the positive linear relationship is very weak without the outlier.

Also,

One can calculate the coefficient of determination by squaring the correlation coefficient.

Therefore, the coefficient of determination is r2=77.4715329053×10−62.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!

Students have asked these similar questions

Customers experiencing technical difficulty with their Internet cable service may call an 800 number for technical support. It takes the technician between 30 seconds and 11 minutes to resolve the problem. The distribution of this support time follows the uniform distribution. Required: a. What are the values for a and b in minutes? Note: Do not round your intermediate calculations. Round your answers to 1 decimal place. b-1. What is the mean time to resolve the problem? b-2. What is the standard deviation of the time? c. What percent of the problems take more than 5 minutes to resolve? d. Suppose we wish to find the middle 50% of the problem-solving times. What are the end points of these two times?

Exercise 6-6 (Algo) (LO6-3) The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.

1. Find the mean of the x-values (x-bar) and the mean of the y-values (y-bar) and write/label each here: 2. Label the second row in the table using proper notation; then, complete the table. In the fifth and sixth columns, show the 'products' of what you're multiplying, as well as the answers. X y x minus x-bar y minus y-bar (x minus x-bar)(y minus y-bar) (x minus x-bar)^2 xy 16 20 34 4-2 5 2 3. Write the sums that represents Sxx and Sxy in the table, at the bottom of their respective columns. 4. Find the slope of the Regression line: bi = (simplify your answer) 5. Find the y-intercept of the Regression line, and then write the equation of the Regression line. Show your work. Then, BOX your final answer. Express your line as "y-hat equals...

Answer 2

Textbook Question

Chapter 4, Problem 12RE

Commute times: Every morning, Tania leaves for work a few minutes after 7:00 A.M. For eight days, she keeps track of the time she leaves (the number of minutes after 7:00) and the number of minutes it takes her to get to work. Following are the results.

Chapter 4, Problem 12RE, Commute times: Every morning, Tania leaves for work a few minutes after 7:00 A.M. For eight days,

Construct a scatterplot of the length of commute (y) versus the time leaving (x).
Compute the least-squares regression line for predicting the length of commute from the time leaving.
Compute the coefficient of determination.
Which point is an outlier?
Remove the outlier and compute the least-squares regression line for predicting the length of commute from the time leaving.
Is the outlier influential? Explain.
Compute the coefficient of determination for the data set with the outlier removed. Is the relationship stronger. weaker; or about equally strong without the outlier?

a.

Expert Solution

To determine

To Graph:a scatter plot using the length of commute time y versus time leaving x

Explanation of Solution

Given information: T leaves her home every day few minutes after 7.00 A.M. The time she leaves the home (The number of minutes after 7.00 A.M) and the number of minutes taken to office has been recorded as below.

x	13	14	16	30	20	12	9	17	16	10	16
y	27	20	23	45	20	21	20	28	27	23	30