The correlation for all four data sets. To find: The least-squares regression line for all four data sets. To find: The predicted value for x = 10 using least-squares regression line for all four data sets.

Question

Want to see more full solutions like this?

Answer 1

Question

Compare Compare Correlation is the measure of the degree of the relationship between the variables. In regression, the relationship between the variables to predict one by another is assessed. In correlation there is no cause and effect between the variables, whereas in regression there is a cause and effect between the variables. By using correlation, we cannot predict anything, but regression is a predictive tool. Coefficients are symmetrical in correlation but in regression they are asymmetrical. Correlation is independent of change in origin and scale, but regression is not independent of change in scale.

Chapter 5, Problem 5.42E

(a)

To determine

To find: The correlation for all four data sets.

To find: The least-squares regression line for all four data sets.

To find: The predicted value for x=10 using least-squares regression line for all four data sets.

(a)

Expert Solution

Answer to Problem 5.42E

The correlation for the data set A is 0.816.

The correlation for the data set B is 0.816.

The correlation for the data set C is 0.816.

The correlation for the data set D is 0.8176.

The least-squares regression line for the data set A is y^=3.00+0.500x.

The least-squares regression line for the data set B is y^=3.00+0.500x.

The least-squares regression line for the data set C is y^=3.00+0.500x.

The least-squares regression line for the data set D is y^=3.00+0.500x.

The predicted value for x=10 using least-squares regression line for the data set A is 8.001.

The predicted value for x=10 using least-squares regression line for the data set B is 8.001.

The predicted value for x=10 using least-squares regression line for the data set C is 8.000.

The predicted value for x=10 using least-squares regression line for the data set D is 8.001.

Explanation of Solution

Given info:

The four data sets are used to exploring the correlation and regression.

Calculation:

Correlation for Data set A:

Software procedure:

Step-by-step procedure to find the correlation between the x and y for data set A by using the MINITAB software:

Select Stat >Basic Statistics > Correlation.
In Variables, select x and y.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 1

From the MINITAB output, the correlation between the x and y for data set A is 0.816.

Correlation for Data set B:

Software procedure:

Step-by-step procedure to find the correlation between the x and y for data set B by using the MINITAB software:

Select Stat >Basic Statistics > Correlation.
In Variables, select x and y.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 2

From the MINITAB output, the correlation between the x and y for data set B is 0.816.

Correlation for Data set C:

Software procedure:

Step-by-step procedure to find the correlation between the x and y for data set C by using the MINITAB software:

Select Stat >Basic Statistics > Correlation.
In Variables, select x and y.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 3

From the MINITAB output, the correlation between the x and y for data set C is 0.816.

Correlation for Data set D:

Software procedure:

Step-by-step procedure to find the correlation between the x and y for data set D by using the MINITAB software:

Select Stat >Basic Statistics > Correlation.
In Variables, select x and y.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 4

From the MINITAB output, the correlation between the x and y for data set D is 0.817.

Equation of the least-squares line for Data set A:

Software procedure:

Step-by-step procedure to find the equation of the least-squares line by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 5

From the MINITAB output, the least-squares line for predicting y from x for data set A is y^=3.00+0.500x.

Equation of the least-squares line for Data set B:

Software procedure:

Step-by-step procedure to find the equation of the least-squares line by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 6

From the MINITAB output, the least-squares line for predicting y from x for data set B is y^=3.00+0.500x.

Equation of the least-squares line for Data set C:

Software procedure:

Step-by-step procedure to find the equation of the least-squares line by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 7

From the MINITAB output, the least-squares line for predicting y from x for data set C is y^=3.00+0.500x.

Equation of the least-squares line for Data set D:

Software procedure:

Step-by-step procedure to find the equation of the least-squares line by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 8

From the MINITAB output, the least-squares line for predicting y from x for data set D is y^=3.00+0.500x.

Predicted value for x=10 for Data set A:

Software procedure:

Step-by-step procedure to find the predicted value for x=10 using least-squares regression line for the data set A by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
In option, enter 10 under prediction.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 9

From the MINITAB output, the predicted value for x=10 using least-squares regression line for the data set A is 8.001.

Predicted value for x=10 for Data set B:

Software procedure:

Step-by-step procedure to find the predicted value for x=10 using least-squares regression line for the data set B by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
In option, enter 10 under prediction.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 10

From the MINITAB output, the predicted value for x=10 using least-squares regression line for the data set B is 8.001.

Predicted value for x=10 for Data set C:

Software procedure:

Step-by-step procedure to find the predicted value for x=10 using least-squares regression line for the data set C by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
In option, enter 10 under prediction.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 11

From the MINITAB output, the predicted value for x=10 using least-squares regression line for the data set C is 8.000.

Predicted value for x=10 for Data set D:

Software procedure:

Step-by-step procedure to find the predicted value for x=10 using least-squares regression line for the data set D by using the MINITAB software:

Choose Stat > Regression > Regression.
In Responses, enter the column of y.
In Predictors, enter the column of x.
In option, enter 10 under prediction.
Click OK.

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 12

From the MINITAB output, the predicted value for x=10 using least-squares regression line for the data set D is 8.001.

From the results, it can be observed that the correlation for all four data sets, the least-squares regression line and the predicted value for x=10 all four data sets are similar.

(b)

To determine

To construct: The scatterplot for each of the data sets with regression line.

(b)

Expert Solution

Answer to Problem 5.42E

Scatterplot for Data set A:

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 13

Scatterplot for Data set B:

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 14

Scatterplot for Data set C:

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 15

Scatterplot for Data set D:

Output using the MINITAB software is given below:

BASIC PRACTICE OF STATISTICS(REISSUE)>C, Chapter 5, Problem 5.42E , additional homework tip 16

Explanation of Solution

Calculation:

Scatterplot:

Software procedure:

Step-by-step procedure to construct scatterplot for x and y for all four data sets by using the MINITAB software:

Choose Graph > Scatter plot.
Choose With Regression, and then click OK.
Under Y variables, enter a column of y.
Under X variables, enter a column of x.
Click OK.

Observation:

The scatterplot shows that the predicted values are passed through the regression line of the model. Moreover, there is outlier that appears in the x and y directions for the data set A, B, and C. Also, the scatterplot for the data set D shows that the most of the points are plotted around 8.

(c)

To determine

To identify: Which of the four cases would you be willing to use the regression line to describe the dependence of y on x.

(c)

Expert Solution

Answer to Problem 5.42E

The data set A would use the regression line to describe the dependence of y on x.

Explanation of Solution

From the scatterplots for all data sets, it can be observed that the points for data set A are scattered around the straight line when compared to the other data sets. Hence, the data set A would use the regression line to describe the dependence of y on x.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!

Students have asked these similar questions

The managing director of a consulting group has the accompanying monthly data on total overhead costs and professional labor hours to bill to clients. Complete parts a through c Overhead Costs Billable Hours345000 3000385000 4000410000 5000462000 6000530000 7000545000 8000

Using the accompanying Home Market Value data and associated regression line, Market ValueMarket Valueequals=$28,416plus+$37.066×Square Feet, compute the errors associated with each observation using the formula e Subscript ieiequals=Upper Y Subscript iYiminus−ModifyingAbove Upper Y with caret Subscript iYi and construct a frequency distribution and histogram. Square Feet Market Value1813 911001916 1043001842 934001814 909001836 1020002030 1085001731 877001852 960001793 893001665 884001852 1009001619 967001690 876002370 1139002373 1131001666 875002122 1161001619 946001729 863001667 871001522 833001484 798001589 814001600 871001484 825001483 787001522 877001703 942001485 820001468 881001519 882001518 885001483 765001522 844001668 909001587 810001782 912001483 812001519 1007001522 872001684 966001581 86200

a. Find the value of A.b. Find pX(x) and py(y).c. Find pX|y(x|y) and py|X(y|x)d. Are x and y independent? Why or why not?

Answer 2