Refer to the Baseball 2018 data given below, which report information on the 30 Major League Baseball teams for the 2018 season. Let the number of games won be the dependent variable and the following variables be independent variables: team batting average, team earned run average (ERA), number of home runs and whether the team plays in the American or National league (American League is 1 and National League is 0). a. Develop a correlation matrix. (i) Which independent variables have strong or weak correlations with the dependent variable. (ii) Do you see any problems with multicollinearity? Explain your answer. b. Use Excel to determine the multiple regression equation. (i) Write out the regression equation and determine its practical application (i.e., interpret the equation). (ii) Report and interpret the R-square. c. Conduct a global test on the set of independent variables. Interpret. d. Conduct a test of hypothesis on each of the independent variables. Would you consider deleting any of the variables? (i) If so, which ones? (ii) If so, what is your new equation? e. Develop a histogram of the residuals from the final regression equation developed in part (d-ii). Is it reasonable to conclude that the normality assumption has been met? Why or Why not? f. Plot the residuals against the fitted values from the final regression equation developed in part (d-ii). Plot the residuals on the vertical axis and the fitted values on the horizontal axis. What regression assumption is supported? Why is it supported? Team League ($ mil) HR BA Wins ERA Opened mil $ bil Arizona Diamondbacks National 143.32 176 0.235 82 3.72 1998 2.242695 1.21 Atlanta Braves National 130.6 175 0.257 90 3.75 2017 2.555781 1.625 Baltimore Orioles American 127.63 188 0.239 47 5.18 1992 1.564192 1.2 Boston Red Sox American 227.4 208 0.268 108 3.75 1912 2.895575 2.8 Chicago Cubs National 194.26 167 0.258 95 3.65 1914 3.181089 2.9 Chicago White Sox American 71.84 182 0.241 62 4.84 1991 1.608817 1.5 Cincinnati Reds National 100.31 172 0.254 67 4.63 2003 1.629356 1.01 Cleveland Indians American 142.8 216 0.259 91 3.77 1994 1.926701 1.045 Colorado Rockies National 143.97 210 0.256 91 4.33 1995 3.01588 1.1 Detroit Tigers American 130.96 135 0.241 64 4.58 2000 1.85697 1.225 Houston Astros American 163.52 205 0.255 103 3.11 2000 2.980549 1.65 Kansas City Royals American 129.94 155 0.245 58 4.94 1973 1.665107 1.015 Los Angeles Angels American 173.78 214 0.242 80 4.15 1966 3.020216 1.8 Los Angeles Dodgers National 199.58 235 0.25 92 3.38 1962 3.8575 3 Miami Marlins National 91.82 128 0.237 63 4.76 2012 0.811104 1 Milwaukee Brewers National 108.98 218 0.252 96 3.73 2001 2.850875 1.03 Minnesota Twins American 115.51 166 0.25 78 4.5 2010 1.959197 1.15 New York Mets National 150.19 170 0.234 77 4.07 2009 2.224995 2.1 New York Yankees American 179.6 267 0.249 100 3.78 2009 3.482855 4 Oakland Athletics American 80.32 227 0.252 97 3.81 1966 1.573616 1.02 Philadelphia Phillies National 104.3 186 0.234 80 4.14 2004 2.158124 1.7 Pittsburgh Pirates National 91.03 157 0.254 82 4 2001 1.465316 1.26 San Diego Padres National 101.34 162 0.235 66 4.4 2004 2.168536 1.27 San Francisco Giants American 205.67 176 0.254 89 4.13 2000 2.299489 2.85 Seattle Mariners National 160.99 133 0.239 73 3.95 1999 3.156185 1.45 St. Louis Cardinals National 163.78 205 0.249 88 3.85 2006 3.403587 1.9 Tampa Bay Rays American 68.81 150 0.258 90 3.74 1990 1.154973 0.9 Texas Rangers American 140.63 194 0.24 67 4.92 1994 2.107107 1.6 Toronto Blue Jays American 150.95 217 0.244 73 4.85 1989 2.325281 1.35 Washington Nationals National 181.38 191 0.254 82 4.04 2008 2.529604 1.675
Refer to the Baseball 2018 data given below, which report information on the 30 Major League Baseball teams for the 2018 season. Let the number of games won be the dependent variable and the following variables be independent variables: team batting average, team earned run average (ERA), number of home runs and whether the team plays in the American or National league (American League is 1 and National League is 0). a. Develop a correlation matrix. (i) Which independent variables have strong or weak correlations with the dependent variable. (ii) Do you see any problems with multicollinearity? Explain your answer. b. Use Excel to determine the multiple regression equation. (i) Write out the regression equation and determine its practical application (i.e., interpret the equation). (ii) Report and interpret the R-square. c. Conduct a global test on the set of independent variables. Interpret. d. Conduct a test of hypothesis on each of the independent variables. Would you consider deleting any of the variables? (i) If so, which ones? (ii) If so, what is your new equation? e. Develop a histogram of the residuals from the final regression equation developed in part (d-ii). Is it reasonable to conclude that the normality assumption has been met? Why or Why not? f. Plot the residuals against the fitted values from the final regression equation developed in part (d-ii). Plot the residuals on the vertical axis and the fitted values on the horizontal axis. What regression assumption is supported? Why is it supported? Team League ($ mil) HR BA Wins ERA Opened mil $ bil Arizona Diamondbacks National 143.32 176 0.235 82 3.72 1998 2.242695 1.21 Atlanta Braves National 130.6 175 0.257 90 3.75 2017 2.555781 1.625 Baltimore Orioles American 127.63 188 0.239 47 5.18 1992 1.564192 1.2 Boston Red Sox American 227.4 208 0.268 108 3.75 1912 2.895575 2.8 Chicago Cubs National 194.26 167 0.258 95 3.65 1914 3.181089 2.9 Chicago White Sox American 71.84 182 0.241 62 4.84 1991 1.608817 1.5 Cincinnati Reds National 100.31 172 0.254 67 4.63 2003 1.629356 1.01 Cleveland Indians American 142.8 216 0.259 91 3.77 1994 1.926701 1.045 Colorado Rockies National 143.97 210 0.256 91 4.33 1995 3.01588 1.1 Detroit Tigers American 130.96 135 0.241 64 4.58 2000 1.85697 1.225 Houston Astros American 163.52 205 0.255 103 3.11 2000 2.980549 1.65 Kansas City Royals American 129.94 155 0.245 58 4.94 1973 1.665107 1.015 Los Angeles Angels American 173.78 214 0.242 80 4.15 1966 3.020216 1.8 Los Angeles Dodgers National 199.58 235 0.25 92 3.38 1962 3.8575 3 Miami Marlins National 91.82 128 0.237 63 4.76 2012 0.811104 1 Milwaukee Brewers National 108.98 218 0.252 96 3.73 2001 2.850875 1.03 Minnesota Twins American 115.51 166 0.25 78 4.5 2010 1.959197 1.15 New York Mets National 150.19 170 0.234 77 4.07 2009 2.224995 2.1 New York Yankees American 179.6 267 0.249 100 3.78 2009 3.482855 4 Oakland Athletics American 80.32 227 0.252 97 3.81 1966 1.573616 1.02 Philadelphia Phillies National 104.3 186 0.234 80 4.14 2004 2.158124 1.7 Pittsburgh Pirates National 91.03 157 0.254 82 4 2001 1.465316 1.26 San Diego Padres National 101.34 162 0.235 66 4.4 2004 2.168536 1.27 San Francisco Giants American 205.67 176 0.254 89 4.13 2000 2.299489 2.85 Seattle Mariners National 160.99 133 0.239 73 3.95 1999 3.156185 1.45 St. Louis Cardinals National 163.78 205 0.249 88 3.85 2006 3.403587 1.9 Tampa Bay Rays American 68.81 150 0.258 90 3.74 1990 1.154973 0.9 Texas Rangers American 140.63 194 0.24 67 4.92 1994 2.107107 1.6 Toronto Blue Jays American 150.95 217 0.244 73 4.85 1989 2.325281 1.35 Washington Nationals National 181.38 191 0.254 82 4.04 2008 2.529604 1.675
MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
Related questions
Question
Refer to the Baseball 2018 data given below, which report information on the 30 Major League Baseball teams for the 2018 season. Let the number of games won be the dependent variable and the following variables be independent variables: team batting average, team earned run average (ERA), number of home runs and whether the team plays in the American or National league (American League is 1 and National League is 0).
a. Develop a correlation matrix.
(i) Which independent variables have strong or weak correlations with the dependent variable.
(ii) Do you see any problems with multicollinearity? Explain your answer.
b. Use Excel to determine the multiple regression equation.
(i) Write out the regression equation and determine its practical application (i.e., interpret the equation).
(ii) Report and interpret the R-square.
c. Conduct a global test on the set of independent variables. Interpret.
d. Conduct a test of hypothesis on each of the independent variables. Would you consider deleting any of the variables?
(i) If so, which ones?
(ii) If so, what is your new equation?
e. Develop a histogram of the residuals from the final regression equation developed in part (d-ii). Is it reasonable to conclude that the normality assumption has been met? Why or Why not?
f. Plot the residuals against the fitted values from the final regression equation developed in part (d-ii). Plot the residuals on the vertical axis and the fitted values on the horizontal axis. What regression assumption is supported? Why is it supported?
Team | League | ($ mil) | HR | BA | Wins | ERA | Opened | mil | $ bil |
Arizona Diamondbacks | National | 143.32 | 176 | 0.235 | 82 | 3.72 | 1998 | 2.242695 | 1.21 |
Atlanta Braves | National | 130.6 | 175 | 0.257 | 90 | 3.75 | 2017 | 2.555781 | 1.625 |
Baltimore Orioles | American | 127.63 | 188 | 0.239 | 47 | 5.18 | 1992 | 1.564192 | 1.2 |
Boston Red Sox | American | 227.4 | 208 | 0.268 | 108 | 3.75 | 1912 | 2.895575 | 2.8 |
Chicago Cubs | National | 194.26 | 167 | 0.258 | 95 | 3.65 | 1914 | 3.181089 | 2.9 |
Chicago White Sox | American | 71.84 | 182 | 0.241 | 62 | 4.84 | 1991 | 1.608817 | 1.5 |
Cincinnati Reds | National | 100.31 | 172 | 0.254 | 67 | 4.63 | 2003 | 1.629356 | 1.01 |
Cleveland Indians | American | 142.8 | 216 | 0.259 | 91 | 3.77 | 1994 | 1.926701 | 1.045 |
Colorado Rockies | National | 143.97 | 210 | 0.256 | 91 | 4.33 | 1995 | 3.01588 | 1.1 |
Detroit Tigers | American | 130.96 | 135 | 0.241 | 64 | 4.58 | 2000 | 1.85697 | 1.225 |
Houston Astros | American | 163.52 | 205 | 0.255 | 103 | 3.11 | 2000 | 2.980549 | 1.65 |
Kansas City Royals | American | 129.94 | 155 | 0.245 | 58 | 4.94 | 1973 | 1.665107 | 1.015 |
Los Angeles Angels | American | 173.78 | 214 | 0.242 | 80 | 4.15 | 1966 | 3.020216 | 1.8 |
Los Angeles Dodgers | National | 199.58 | 235 | 0.25 | 92 | 3.38 | 1962 | 3.8575 | 3 |
Miami Marlins | National | 91.82 | 128 | 0.237 | 63 | 4.76 | 2012 | 0.811104 | 1 |
Milwaukee Brewers | National | 108.98 | 218 | 0.252 | 96 | 3.73 | 2001 | 2.850875 | 1.03 |
Minnesota Twins | American | 115.51 | 166 | 0.25 | 78 | 4.5 | 2010 | 1.959197 | 1.15 |
New York Mets | National | 150.19 | 170 | 0.234 | 77 | 4.07 | 2009 | 2.224995 | 2.1 |
New York Yankees | American | 179.6 | 267 | 0.249 | 100 | 3.78 | 2009 | 3.482855 | 4 |
Oakland Athletics | American | 80.32 | 227 | 0.252 | 97 | 3.81 | 1966 | 1.573616 | 1.02 |
Philadelphia Phillies | National | 104.3 | 186 | 0.234 | 80 | 4.14 | 2004 | 2.158124 | 1.7 |
Pittsburgh Pirates | National | 91.03 | 157 | 0.254 | 82 | 4 | 2001 | 1.465316 | 1.26 |
San Diego Padres | National | 101.34 | 162 | 0.235 | 66 | 4.4 | 2004 | 2.168536 | 1.27 |
San Francisco Giants | American | 205.67 | 176 | 0.254 | 89 | 4.13 | 2000 | 2.299489 | 2.85 |
Seattle Mariners | National | 160.99 | 133 | 0.239 | 73 | 3.95 | 1999 | 3.156185 | 1.45 |
St. Louis Cardinals | National | 163.78 | 205 | 0.249 | 88 | 3.85 | 2006 | 3.403587 | 1.9 |
Tampa Bay Rays | American | 68.81 | 150 | 0.258 | 90 | 3.74 | 1990 | 1.154973 | 0.9 |
Texas Rangers | American | 140.63 | 194 | 0.24 | 67 | 4.92 | 1994 | 2.107107 | 1.6 |
Toronto Blue Jays | American | 150.95 | 217 | 0.244 | 73 | 4.85 | 1989 | 2.325281 | 1.35 |
Washington Nationals | National | 181.38 | 191 | 0.254 | 82 | 4.04 | 2008 | 2.529604 | 1.675 |
Expert Solution
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
This is a popular solution!
Trending now
This is a popular solution!
Step by step
Solved in 3 steps with 4 images
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.Recommended textbooks for you
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman