Statistical Techniques in Business and Economics
Statistical Techniques in Business and Economics
16th Edition
ISBN: 9780077639723
Author: Lind
Publisher: Mcgraw-Hill Course Content Delivery
bartleby

Videos

Textbook Question
Book Icon
Chapter 14, Problem 23CE

Great Plains Distributors, Inc. sells roofing and siding products to home improvement retailers, such as Lowe’s and Home Depot, and commercial contractors. The owner is interested in studying the effects of several variables on the sales volume of fiber-cement siding products.

The company has 26 marketing districts across the United States. In each district, it collected information on the following variables: sales volume (in thousands of dollars), advertising dollars (in thousands), number of active accounts, number of competing brands, and a rating of market potential.

Chapter 14, Problem 23CE, Great Plains Distributors, Inc. sells roofing and siding products to home improvement retailers,

Conduct a multiple regression analysis to find the best predictors of sales.

  1. a. Draw a scatter diagram comparing sales volume with each of the independent variables. Comment on the results.
  2. b. Develop a correlation matrix. Do you see any problems? Does it appear there are any redundant independent variables?
  3. c. Develop a regression equation. Conduct the global test. Can we conclude that some of the independent variables are useful in explaining the variation in the dependent variable?
  4. d. Conduct a test of each of the independent variables. Are there any that should be dropped?
  5. e. Refine the regression equation so the remaining variables are all significant.
  6. f. Develop a histogram of the residuals and a normal probability plot. Are there any problems?
  7. g. Determine the variance inflation factor for each of the independent variables. Are there any problems?

a.

Expert Solution
Check Mark
To determine

Make a scatter diagram that compares sales with each of the independent variables.

Explain the results.

Answer to Problem 23CE

The scatter diagram is obtained below,

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  1

Explanation of Solution

Step by step procedure to obtain the correlation matrix using MINITAB software is given below:

  • Choose Graph > Matrix plot.
  • Under Each Y versus each X, select Simple.
  • Under Y variable select the column of Sales.
  • Under X variables select the columns of Ad Dollars, Number of accounts, Number of Competitors, and Potential.
  • Click OK.

The MINITAB output is obtained.

According to the output, as the number of competitors increases the sales decreases. In other hand, as the number of accounts and the rating of market potential increases, the sales also increase. However, there is no linear relationship between the Ad Dollars with the sales.

b.

Expert Solution
Check Mark
To determine

Make the correlation matrix.

Explain whether there is any problem due to redundant independent variables.

Answer to Problem 23CE

The correlation matrix is obtained as,

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  2

Explanation of Solution

Multiple linear regression model:

A multiple linear regression model is given as y^=a+b1x1+b2x2+b3x3+...+bkxk where y is the response or dependent variable, and x1,x2,...,xk are the k quantitative independent variables where k is a positive integer.

Here, a is the intercept term of the regression model, that is, the value of predicted value of y when X’s are 0 and bi’s are the slopes, that is, the amount of change of the predicted value of y for one unit increase in xi when all other independent variables are constant.

In the given problem the predicted dependent variable y is the sales. The years of advertising dollars, the number of accounts, the number of competitors and the rating of market potential, are defined as x1,x2,x3andx4, respectively.

Step by step procedure to obtain the correlation matrix using MINITAB software is given below:

  • Choose Stat > Basic Statistics > Correlation.
  • Select the columns of Sales, Ad Dollars, Number of accounts, Number of Competitors, and Potential under Variables tab.
  • Click OK.

The MINITAB output is obtained.

Multicollinearity:

In a multiple regression model, when there is high correlation between two or more independent variables, then multicollinearity occurs.

There is moderate correlation between the independent variables “Potential” and “Number of Accounts”.

Thus, there is no chance of occurrence of multicollinearity in the regression model.

c.

Expert Solution
Check Mark
To determine

Find the regression equation and perform a global test.

Answer to Problem 23CE

The regression equation is y^=178.3+1.81x1+3.318x221.185x3+0.325x4_.

Some of the independent variables are useful in explaining the variation in the dependent variable at 0.05 significance level.

Explanation of Solution

Calculation:

Step by step procedure to obtain the regression equation using MINITAB software:

  • Choose Stat > Regression > Regression > Fit Regression Model.
  • Under Responses, enter the column of Sales.
  • Under Continuous predictors, enter the columns of Ad Dollars, Number of accounts, Number of Competitors, and Potential.
  • Click OK.

Output using MINITAB software is given below:

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  3

Thus, the regression equation is y^=178.3+1.81x1+3.318x221.185x3+0.325x4_.

Consider that y is dependent variable and xi's are the independent variables where βi's are the corresponding population regression coefficient for all i=1,2,3,4.

State the hypotheses:

Null hypothesis:

H0:β1=β2=β3=β4=0.

That is, the model is not significant.

Alternative hypothesis:

H1:At least one βi is not equal to 0.

That is, the model is significant.

In case of global test the F test statistic is defined as,

F=SSRkSSEnk1, where SSR, SSE, n and k are the regression sum of square, error sum of square, sample size and the number of independent variables.

According to the output the value of F statistic is 479.1 with numerator degrees of freedom 4 and denominator degrees of freedom 21.

Consider that, the level of significance is α=0.05.

Decision rule:

  • If p-valueα, then reject the null hypothesis.
  • Otherwise failed to reject the null hypothesis.

Conclusion:

Here, p-value corresponding to the global test is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that some of the independent variables are useful in explaining the variation in the dependent variable at 0.05 significance level.

d.

Expert Solution
Check Mark
To determine

Perform individual tests of each independent variables.

Explain whether any variable should be dropped.

Answer to Problem 23CE

There is significant relationship between y and x2andx3, whereas there is no significant relationship between y and x1andx4.

It is better to drop these two variables x1andx4 and perform the regression analysis using x2andx3 as independent variables.

Explanation of Solution

Calculation:

For independent variable x1:

Consider that β1 is the population regression coefficient of independent variable x1.

State the hypotheses:

Null hypothesis:

H0:β1=0.

That is, there is no significant relationship between y and x1.

Alternative hypothesis:

H1:β10.

That is, there is significant relationship between y and x1.

In case of individual regression coefficient test the t test statistic is defined as,

t=bisbi, where bi and sbi are the ith regression coefficient and the standard deviation of the ith regression coefficient.

According to the output in Part (c) the t statistic value corresponding to x1 is 1.67 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Ad Dollars”(x1) is 0.109.

Hence, p-value(=0.109)>α(=0.05).

That is, the p-value is greater than the level of significance.

Therefore, fail to reject the null hypothesis.

Hence, it can be concluded that there is no significant relationship between y and x1.

For independent variable x2:

Consider that β2 is the population regression coefficient of independent variable x2.

State the hypotheses:

Null hypothesis:

H0:β2=0.

That is, there is no significant relationship between y and x2.

Alternative hypothesis:

H1:β20.

That is, there is significant relationship between y and x2.

According to the output in Part (c) the value of t test statistic corresponding to x2 is 20.37 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of accounts”(x2) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x2.

For independent variable x3:

Consider that β3 is the population regression coefficient of independent variable x3.

State the hypotheses:

Null hypothesis:

H0:β3=0.

That is, there is no significant relationship between y and x3.

Alternative hypothesis:

H1:β30.

That is, there is significant relationship between y and x3.

According to the output in Part (c) the value of t test statistic corresponding to x3 is –26.89 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of competitors”(x3) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x3.

For independent variable x4:

Consider that β4 is the population regression coefficient of independent variable x4.

State the hypotheses:

Null hypothesis:

H0:β4=0.

That is, there is no significant relationship between y and x4.

Alternative hypothesis:

H1:β40.

That is, there is significant relationship between y and x4.

According to the output in Part (c) the value of t test statistic corresponding to x4 is 0.69 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Potential”(x4) is 0.495.

Hence, p-value(=0.495)>α(=0.05).

That is, the p-value is greater than the level of significance.

Therefore, fail to reject the null hypothesis.

Hence, it can be concluded that there is no significant relationship between y and x4.

Hence, it can be said that as there is no significant relationship between “Sales” and “Ad Dollars” and between “Sales” and “Potential”, thus it is better to omit these two variables and perform the regression analysis using “Number of accounts” and “Number of competitors” as independent variables.

e.

Expert Solution
Check Mark
To determine

Refine the regression equation so the remaining variables are all significant.

Answer to Problem 23CE

The refined regression equation is y^=186.7+3.408x121.193x2.

Explanation of Solution

Calculation:

In this part the dependent variable is sales (y) and the independent variables are the number of accounts (x1) and the number of competitors (x2).

Step by step procedure to obtain the regression equation using MINITAB software:

  • Choose Stat > Regression > Regression > Fit Regression Model.
  • Under Responses, enter the column of Sales.
  • Under Continuous predictors, enter the columns of Number of accounts, and Number of Competitors.
  • Choose Graphs.
  • Under Residual plot select Histogram of residuals, Normal probability plot of residuals.
  • Click OK.
  • Click OK.

Output using MINITAB software is given below:

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  4

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  5

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  6

Hence, the regression equation is y^=186.7+3.408x121.193x2.

For independent variable x1:

Consider that β1 is the population regression coefficient of independent variable x1.

State the hypotheses:

Null hypothesis:

H0:β1=0.

That is, there is no significant relationship between y and x1.

Alternative hypothesis:

H1:β10.

That is, there is significant relationship between y and x1.

According to the output the t statistic value corresponding to x1 is 23.37 with 23 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of Accounts”(x1) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x1.

For independent variable x2:

Consider that β2 is the population regression coefficient of independent variable x2.

State the hypotheses:

Null hypothesis:

H0:β2=0.

That is, there is no significant relationship between y and x2.

Alternative hypothesis:

H1:β20.

That is, there is significant relationship between y and x2.

According to the output in the value of t test statistic corresponding to x2 is –26.4 with 23 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of competitors”(x2) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x2.

Hence, it can be said that as there is significant relationship between “Sales” and “Number of accounts” and “Number of competitors”.

f.

Expert Solution
Check Mark
To determine

Provide a histogram and normal probability plot for residuals.

Also explain whether there are any problems.

Explanation of Solution

Calculation:

Histogram:

From Part (e), the histogram and normal probability plot is obtained as,

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  7

Statistical Techniques in Business and Economics, Chapter 14, Problem 23CE , additional homework tip  8

Assumption of normality from histogram:

  • The majority of the observation in the middle and centered on the mean of 0.
  • There are lower frequencies on the tails of the distributions.

According to the given histogram, the most of the observations are centered on the mean of 0 and there are less frequencies on the tails of the distributions. In addition, it can be considered as perfect symmetric.

According to the normal probability plot, all of the residuals are linear.

Hence, there are no problems in normality assumptions.

g.

Expert Solution
Check Mark
To determine

Find the variation inflection factor for each of the independent variables.

Explain whether there is any problem.

Explanation of Solution

Variation inflation factor(VIF):

The variation inflation factor is defined as,

VIF=11Ri2 , where Ri2 is the coefficient of determination of ith independent variable.

It is used to detect multicollinearity in the regression. If VIF<10, then three is no presence of multicollinearity.

From Part (e), the variation inflection factor corresponding to both variables are 1.12, which is less than 10.

Hence, there is no presence of multicollinearity.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!
Students have asked these similar questions
Please could you explain why 0.5 was added to each upper limpit of the intervals.Thanks
28. (a) Under what conditions do we say that two random variables X and Y are independent? (b) Demonstrate that if X and Y are independent, then it follows that E(XY) = E(X)E(Y); (e) Show by a counter example that the converse of (ii) is not necessarily true.
1. Let X and Y be random variables and suppose that A = F. Prove that Z XI(A)+YI(A) is a random variable.

Chapter 14 Solutions

Statistical Techniques in Business and Economics

Ch. 14 - The following regression output was obtained from...Ch. 14 - A study by the American Realtors Association...Ch. 14 - The manager of High Point Sofa and Chair, a large...Ch. 14 - Prob. 10ECh. 14 - Prob. 11ECh. 14 - A real estate developer wishes to study the...Ch. 14 - Prob. 13CECh. 14 - Prob. 14CECh. 14 - Prob. 15CECh. 14 - Prob. 16CECh. 14 - The district manager of Jasons, a large discount...Ch. 14 - Suppose that the sales manager of a large...Ch. 14 - The administrator of a new paralegal program at...Ch. 14 - Prob. 20CECh. 14 - Prob. 21CECh. 14 - A regional planner is studying the demographics of...Ch. 14 - Great Plains Distributors, Inc. sells roofing and...Ch. 14 - Prob. 24CECh. 14 - Prob. 25CECh. 14 - Prob. 26CECh. 14 - An investment advisor is studying the relationship...Ch. 14 - Prob. 28CECh. 14 - Prob. 29CECh. 14 - The director of special events for Sun City...Ch. 14 - Prob. 31CECh. 14 - Prob. 32CECh. 14 - Refer to the Real Estate data, which report...Ch. 14 - Prob. 34DECh. 14 - Refer to the Buena School District bus data....Ch. 14 - Prob. 1PCh. 14 - Quick-print firms in a large downtown business...Ch. 14 - The following ANOVA output is given. a. Compute...Ch. 14 - Prob. 1CCh. 14 - Prob. 2CCh. 14 - Prob. 3CCh. 14 - In a scatter diagram, the dependent variable is...Ch. 14 - What level of measurement is required to compute...Ch. 14 - If there is no correlation between two variables,...Ch. 14 - Which of the following values indicates the...Ch. 14 - Under what conditions will the coefficient of...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Prob. 1.9PTCh. 14 - In a multiple regression equation, what is the...Ch. 14 - Prob. 1.11PTCh. 14 - Prob. 1.12PTCh. 14 - For a dummy variable, such as gender, how many...Ch. 14 - What is the term given to a table that shows all...Ch. 14 - If there is a linear relationship between the...Ch. 14 - Given the following regression analysis output: a....Ch. 14 - Given the following regression analysis output. a....
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Text book image
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Text book image
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Text book image
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Text book image
Functions and Change: A Modeling Approach to Coll...
Algebra
ISBN:9781337111348
Author:Bruce Crauder, Benny Evans, Alan Noell
Publisher:Cengage Learning
Text book image
College Algebra
Algebra
ISBN:9781337282291
Author:Ron Larson
Publisher:Cengage Learning
F- Test or F- statistic (F- Test of Equality of Variance); Author: Prof. Arvind Kumar Sing;https://www.youtube.com/watch?v=PdUt7InTyc8;License: Standard Youtube License
Statistics 101: F-ratio Test for Two Equal Variances; Author: Brandon Foltz;https://www.youtube.com/watch?v=UWQO4gX7-lE;License: Standard YouTube License, CC-BY
Hypothesis Testing and Confidence Intervals (FRM Part 1 – Book 2 – Chapter 5); Author: Analystprep;https://www.youtube.com/watch?v=vth3yZIUlGQ;License: Standard YouTube License, CC-BY
Understanding the Levene's Test for Equality of Variances in SPSS; Author: Dr. Todd Grande;https://www.youtube.com/watch?v=udJr8V2P8Xo;License: Standard Youtube License