Statistical Techniques in Business and Economics
Statistical Techniques in Business and Economics
16th Edition
ISBN: 9780077639723
Author: Lind
Publisher: Mcgraw-Hill Course Content Delivery
bartleby

Concept explainers

bartleby

Videos

Textbook Question
Book Icon
Chapter 14, Problem 33DE

Refer to the Real Estate data, which report information on homes sold in Goodyear, Arizona. Use the selling price of the home as the dependent variable and determine the regression equation with number of bedrooms, size of the house, center of the city, and number of bathrooms as independent variables.

  1. a. Use a statistical software package to determine the multiple regression equation. Discuss each of the variables. For example, are you surprised that the regression coefficient for distance from the center of the city is negative? How much does a garage or a swimming pool add to the selling price of a home?
  2. b. Determine the value of the Intercept.
  3. c. Develop a correlation matrix. Which independent variables have strong or weak correlations with the dependent variable? Do you see any problems with multicollinearity?
  4. d. Conduct the global test on the set of independent variables. Interpret.
  5. e. Conduct a test of hypothesis on each of the independent variables. Would you consider deleting any of the variables? If so, which ones?
  6. f. Rerun the analysis until only significant regression coefficients remain in the analysis. Identify these variables.
  7. g. Develop a histogram or a stem-and-leaf display of the residuals from the final regression equation developed in part (f). Is it reasonable to conclude that the normality assumption has been met?
  8. h. Plot the residuals against the fitted values from the final regression equation developed in part (f). Plot the residuals on the vertical axis and the fitted values on the horizontal axis.

a.

Expert Solution
Check Mark
To determine

Find the multiple regression equation.

Explain each of the variables.

Answer to Problem 33DE

The multiple regression equation is as follows:

y^=38.7+(7.12)x1+(0.0380)x2(0.929)x3+(23.31)x4+(18.32)x5+(35.81)x6_

Explanation of Solution

Let y is dependent variable, x1x2, x3, x4, x5 and x6 are independent variables.

Where, y is the selling price of a house, x1 is the number of bedrooms, x2 is size of the house, x3 is the distance from the center, x4 is the number of bathrooms, x5 is a house with pool, and x6 is the garage attached.

Step-by-step procedure to obtain the regression equation using MINITAB software:

  • Choose Stat > Regression > Regression > Fit Regression Model.
  • Under Responses, enter the column of Price.
  • Under Continuous predictors, enter the columns of Bedrooms, Size, Center of the city and Bathrooms, pools and Garage.
  • Click OK.

Output using MINITAB software is given below:

Statistical Techniques in Business and Economics, Chapter 14, Problem 33DE , additional homework tip  1

From the above output, the multiple regression equation is as follows:

y^=38.7+(7.12)x1+(0.0380)x2(0.929)x3+(23.31)x4+(18.32)x5+(35.81)x6_

The slope coefficients of bedrooms, size, and center of the city, bedrooms, pool and garage are 7.12, 0.0380, 0.929, 23.31, 18.32, and 35.18, respectively.

For each additional bedroom, the selling price of house will increase by $7,120 on average when all of the other variables are constant.

For each additional square foot of the size in the house, the price of the house will increase by $38.

For each additional mile away from the center of the city, the selling price of home decreases by $929.

For each additional bathroom, then the selling price of home will increase by $2,331 on average.

A house with a pool increases $1,832 to the selling price of the home and a house with garage will increase $3,581 to the selling price of a house.

b.

Expert Solution
Check Mark
To determine

Find the value of intercept.

Answer to Problem 33DE

The value of intercept is $38.7.

Explanation of Solution

From Part (a) output, the value of intercept is $38.7.

The intercept value indicates the price for the home at the center of the city when all other factors are zero.

c.

Expert Solution
Check Mark
To determine

Make the correlation matrix.

Find the independent variables those that have strong or weak correlations with the dependent variables.

Explain whether there are any problems with multicollinearity.

Explanation of Solution

Multicollinearity:

In a multiple regression model, when there is high correlation between two or more independent variables, then multicollinearity occurs.

Due to this multicollinearity the standard errors will be high and there will be no exact estimate of the partial regression coefficient. Moreover, there will be difficulty to measure the relative significance of independent variables.

Step-by-step procedure to obtain the correlation matrix using MINITAB software:

  • Choose Stat > Basic Statistics > Correlation.
  • Select the columns of Price, Size, Bedrooms, Bathrooms, Pool, and Garage under Variables tab.
  • Click OK.

Output obtained using Minitab is as follows:

Statistical Techniques in Business and Economics, Chapter 14, Problem 33DE , additional homework tip  2

From the obtained output, there is a moderate correlation between the price and garage.

The variables price and distance from the center are negatively correlated. Thus, there is no strong correlation among the variables. The independent variables pool and size have weak correlation with the dependent variable.

It can be said that there is no high correlation among the independent variables. Thus, there is no multicollinearity among the independent variables.

d.

Expert Solution
Check Mark
To determine

Conduct a global test on the set of the independent variables and interpret.

Explanation of Solution

The null and alternative hypotheses are stated below:

Null hypothesis:

H0:β1=β2=β3=β4=β5=β6=0

That is, all the regression coefficients are equal to zero.

Alternative hypothesis:

H1: At least one of the regression coefficients is not equal to zero.

From Part (a), the F test statistic value is 18.54 and the p-value is 0.000.

Decision rule:

  • If p-valueα, then reject the null hypothesis.
  • Otherwise, failed to reject the null hypothesis.

Conclusion:

Consider the significance level as 0.05.

Here, the p-value(=0.000)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that all the regression coefficients are not equal to zero.

e.

Expert Solution
Check Mark
To determine

Perform individual tests of each independent variable.

Explanation of Solution

For independent variablex1:

Consider that β1 is the population regression coefficient of independent variable x1.

State the hypotheses:

Null hypothesis:

H0:β1=0.

That is, there is no significant relationship between y and x1.

Alternative hypothesis:

H1:β10.

That is, there is significant relationship between y and x1.

In the case of individual regression coefficient test, the t test statistic is defined as follows:

t=bisbi, where bi and sbi are the ith regression coefficient and the standard deviation of the ith regression coefficient.

From Part (a), the t statistic value corresponding to x1 is 2.79.

Consider the level of significance is α=0.05.

Decision rule:

  • If p-valueα, then reject the null hypothesis.
  • Otherwise, failed to reject the null hypothesis.

Conclusion:

Here, the p-value corresponding to the “Bedrooms”(x1) is 0.006.

Hence, p-value(=0.006)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x1.

For independent variablex2:

Consider that β2 is the population regression coefficient of independent variable x2.

State the hypotheses:

Null hypothesis:

H0:β2=0.

That is, there is no significant relationship between y and x2.

Alternative hypothesis:

H1:β20.

That is, there is significant relationship between y and x2.

From output in Part (a), the value of t-test statistic corresponding to x2 is 2.59.

Conclusion:

Here, the p-value corresponding to the “Size”(x2) is 0.011.

The p-value is less than the level of significance.

That is, the p-value(=0.011)<α(=0.05).

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is a significant relationship between y and x2.

For independent variablex3:

Consider that β3 is the population regression coefficient of independent variable x3.

State the hypotheses:

Null hypothesis:

H0:β3=0.

That is, there is no significant relationship between y and x3.

Alternative hypothesis:

H1:β30.

That is, there is significant relationship between y and x3.

From the output in Part (a), the value of t test statistic corresponding to x3 is –6.28.

Conclusion:

Here, the p-value corresponding to the “Center of the city”(x3) is 0.205.

The p-value is greater than the level of significance.

That is, p-value(=0.205)>α(=0.05)

Therefore, fail to reject the null hypothesis.

Hence, it can be concluded that there is no significant relationship between y and x3.

For independent variablex4:

Consider that β4 is the population regression coefficient of independent variable x4.

State the hypotheses:

Null hypothesis:

H0:β4=0.

That is, there is no significant relationship between y and x4.

Alternative hypothesis:

H1:β40.

That is, there is significant relationship between y and x4.

From the output in Part (a), the value of t test statistic corresponding to x4 is 2.58.

Conclusion:

Here, the p-value corresponding to the “bathrooms”(x4) is 0.011.

The p-value is less than the level of significance.

That is, p-value(=0.011)<α(=0.05)

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is a significant relationship between y and x4.

For independent variablex5:

Consider that β5 is the population regression coefficient of independent variable x5.

State the hypotheses:

Null hypothesis:

H0:β5=0.

That is, there is no significant relationship between y and x5.

Alternative hypothesis:

H1:β50.

That is, there is significant relationship between y and x5.

From the output in Part (a), the value of t test statistic corresponding to x5 is 2.62.

Conclusion:

Here, the p-value corresponding to the “pool”(x5) is 0.01.

The p-value is less than the level of significance.

That is, p-value(=0.01)<α(=0.05)

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is a significant relationship between y and x5.

For independent variablex6:

Consider that β6 is the population regression coefficient of independent variable x6.

State the hypotheses:

Null hypothesis:

H0:β6=0.

That is, there is no significant relationship between y and x6.

Alternative hypothesis:

H1:β60.

That is, there is significant relationship between y and x6.

From the output in Part (a), the value of t test statistic corresponding to x6 is 4.69.

Conclusion:

Here, the p-value corresponding to the “Garage”(x6) is 0.000.

The p-value is less than the level of significance.

That is, p-value(=0.000)<α(=0.05)

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is a significant relationship between y and x6.

It is noticed that the p-value for the “center of the city” is greater than the level of significance. Thus, it is not a significant variable and it can be dropped from the model.

f.

Expert Solution
Check Mark
To determine

Perform the regression analysis until only significant regression coefficients remain in the analysis.

Explanation of Solution

Step-by-step procedure to obtain the regression equation using MINITAB software:

  • Choose Stat > Regression > Regression > Fit Regression Model.
  • Under Responses, enter the column of Price.
  • Under Continuous predictors, enter the columns of Bedrooms, Size, Bathrooms, Pools, and Garage.
  • Click OK.

Output using MINITAB software is given below:

Statistical Techniques in Business and Economics, Chapter 14, Problem 33DE , additional homework tip  3

From the above output, the p-values for each of the independent variables are less than 0.05.

Thus, all the regression coefficients are significant for the dependent variable price. Even though the center of the city variable is dropped from the model, the coefficient of determination value is 0.5238 and it does not increase.

Therefore, the independent variables bedrooms, size, bathrooms, pool, and garage have significant effect on the price of house.

g.

Expert Solution
Check Mark
To determine

Provide a histogram for the regression developed in Part (f).

Explain whether the residual follows normal distribution.

Explanation of Solution

Step-by-step procedure to obtain the regression equation using MINITAB software:

  • Choose Stat > Regression > Regression > Fit Regression Model.
  • Under Responses, enter the column of Price.
  • Under Continuous predictors, enter the columns of Screen, Sharp, Samsung, and Sony.
  • Choose Graphs.
  • Under Residual plot select Histogram of residuals.
  • Click OK.

The output obtained using MINITAB is as follows:

Statistical Techniques in Business and Economics, Chapter 14, Problem 33DE , additional homework tip  4

Assumption of normality from histogram:

  • The majority of the observation in the middle and centered on the mean of 0.
  • There are lower frequencies on the tails of the distributions.

According to the given histogram, most of the observations are centered on the mean of 0 and there are fewer frequencies on the tails of the distributions. Thus, it can be considered as roughly symmetric.

Hence, the residuals follow a normal distribution.

h.

Expert Solution
Check Mark
To determine

Plot the residual plot for the regression developed in Part (f).

Explanation of Solution

Follow the same procedure mentioned in Part (g), one can get the following output:

Statistical Techniques in Business and Economics, Chapter 14, Problem 33DE , additional homework tip  5

Assumption for residual analysis for the regression model:

  • The plot of the residuals vs. the observed values of the predictor variable should fall roughly in a horizontal band and symmetric about the x-axis.
  • For a normal probability plot, residuals should be roughly linear.
  • There should not be any observable pattern.

According to the given residual plot, the points are roughly scattered and moreover, there is no particular pattern in the residual plot. A complete haphazard and random nature has been observed.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!
Students have asked these similar questions
Spring is a peak time for selling houses. The file SpringHouses contains the selling price, number of bathrooms, square footage, and number of bedrooms of 26 homes sold in Ft. Thomas, Kentucky, in spring 2018 (realtor.com website) Click on the datafile logo to reference the data. DATA file a. The Excel output for the estimated regression equation that can be used to predict the selling price given the number of bathrooms, square footage, and number of bedrooms in the house: SUMMARY OUTPUT Multiple R R Square Adjusted R Square Standard Error Observations ANOVA Regression statistics Regression Residual Total Multiple R R Square ANOVA Adjusted R Square Standard Error Observations 0.7429 0.5519 0.4907 61948.6931 Regression statistics Regression Residual Total df Intercept Sq Ft Beds Lower 95% 0.9353 -145129.5298 Intercept Baths 0.9528 -49383.5243 0.0180 Sq Ft Beds 0.0326 Does the estimated regression equation provide a good fit to the data? Explain. Hint: If R is greater than 45%, the…
Please answer ASAP. Thank you.
The datasetBody.xlsgives the percent of weight made up of body fat for 100 men as well as other variables such as Age, Weight (lb), Height (in), and circumference (cm) measurements for the Neck, Chest, Abdomen, Ankle, Biceps, and Wrist. We are interested in predicting body fat based on abdomen circumference. Find the equation of the regression line relating to body fat and abdomen circumference. Make a scatter-plot with a regression line. What body fat percent does the line predict for a person with an abdomen circumference of 110 cm? One of the men in the study had an abdomen circumference of 92.4 cm and a body fat of 22.5 percent. Find the residual that corresponds to this observation. Bodyfat Abdomen 32.3 115.6 22.5 92.4 22 86 12.3 85.2 20.5 95.6 22.6 100 28.7 103.1 21.3 89.6 29.9 110.3 21.3 100.5 29.9 100.5 20.4 98.9 16.9 90.3 14.7 83.3 10.8 73.7 26.7 94.9 11.3 86.7 18.1 87.5 8.8 82.8 11.8 83.3 11 83.6 14.9 87 31.9 108.5 17.3…

Chapter 14 Solutions

Statistical Techniques in Business and Economics

Ch. 14 - The following regression output was obtained from...Ch. 14 - A study by the American Realtors Association...Ch. 14 - The manager of High Point Sofa and Chair, a large...Ch. 14 - Prob. 10ECh. 14 - Prob. 11ECh. 14 - A real estate developer wishes to study the...Ch. 14 - Prob. 13CECh. 14 - Prob. 14CECh. 14 - Prob. 15CECh. 14 - Prob. 16CECh. 14 - The district manager of Jasons, a large discount...Ch. 14 - Suppose that the sales manager of a large...Ch. 14 - The administrator of a new paralegal program at...Ch. 14 - Prob. 20CECh. 14 - Prob. 21CECh. 14 - A regional planner is studying the demographics of...Ch. 14 - Great Plains Distributors, Inc. sells roofing and...Ch. 14 - Prob. 24CECh. 14 - Prob. 25CECh. 14 - Prob. 26CECh. 14 - An investment advisor is studying the relationship...Ch. 14 - Prob. 28CECh. 14 - Prob. 29CECh. 14 - The director of special events for Sun City...Ch. 14 - Prob. 31CECh. 14 - Prob. 32CECh. 14 - Refer to the Real Estate data, which report...Ch. 14 - Prob. 34DECh. 14 - Refer to the Buena School District bus data....Ch. 14 - Prob. 1PCh. 14 - Quick-print firms in a large downtown business...Ch. 14 - The following ANOVA output is given. a. Compute...Ch. 14 - Prob. 1CCh. 14 - Prob. 2CCh. 14 - Prob. 3CCh. 14 - In a scatter diagram, the dependent variable is...Ch. 14 - What level of measurement is required to compute...Ch. 14 - If there is no correlation between two variables,...Ch. 14 - Which of the following values indicates the...Ch. 14 - Under what conditions will the coefficient of...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Prob. 1.9PTCh. 14 - In a multiple regression equation, what is the...Ch. 14 - Prob. 1.11PTCh. 14 - Prob. 1.12PTCh. 14 - For a dummy variable, such as gender, how many...Ch. 14 - What is the term given to a table that shows all...Ch. 14 - If there is a linear relationship between the...Ch. 14 - Given the following regression analysis output: a....Ch. 14 - Given the following regression analysis output. a....
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Similar questions
SEE MORE QUESTIONS
Recommended textbooks for you
Text book image
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Text book image
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Text book image
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Text book image
College Algebra
Algebra
ISBN:9781938168383
Author:Jay Abramson
Publisher:OpenStax
Text book image
Functions and Change: A Modeling Approach to Coll...
Algebra
ISBN:9781337111348
Author:Bruce Crauder, Benny Evans, Alan Noell
Publisher:Cengage Learning
Correlation Vs Regression: Difference Between them with definition & Comparison Chart; Author: Key Differences;https://www.youtube.com/watch?v=Ou2QGSJVd0U;License: Standard YouTube License, CC-BY
Correlation and Regression: Concepts with Illustrative examples; Author: LEARN & APPLY : Lean and Six Sigma;https://www.youtube.com/watch?v=xTpHD5WLuoA;License: Standard YouTube License, CC-BY