MAT 243 Project Three Summary Report Template (1) (Repaired)
docx
keyboard_arrow_up
School
Southern New Hampshire University *
*We aren’t endorsed by this school
Course
243
Subject
Mathematics
Date
Feb 20, 2024
Type
docx
Pages
6
Uploaded by AmbassadorFieldNightingale28
MAT 243 Project Three Summary Report
Rachel Favella
rachel.favella@snhu.edu
Southern New Hampshire University
April 17, 2022
1. Introduction
During this time as a hired data analyst, I have had the opportunity and access to accrue large sets of historical data and have been able to utilize this data to analyze performance patterns. The head coach and management over the team that has hired me as analyst, it is
my duty to create such regression models that will determine a total number of wins for that specific regular season. These findings will be used by the performance of key metrics. These key datasets will be utilized to make key decisions. A analytical report will be prepared with the statistics of the skill levels for each team and will be provided to the management for review, as I will give a complete overview of my findings in the report to explain how these statistics are generated and how I came to these findings. Since management is not clear on this report, I will interpret the report and my conclusions. 2. Data Preparation
There are some important variables that are used in this project. Identify and explain these variables. See the introductory section and Step 1 of the Python script to address the following:
What does the variable avg_pts_differential
represent? How would you explain it to someone who does not understand the data?
What does the variable avg_elo_n
represent? How would you explain it to someone who does not understand the data?
Answer the questions in a paragraph response. Remove all questions and this note before submitting! Do not include Python code in your report. 3. Scatterplot and Correlation for the Total Number of Wins and Average Relative Skill
You constructed a scatterplot of the total number of wins and the average relative skill to study their correlation. You also calculated the Pearson correlation coefficient along with its P-value. See Step 2 in the Python script to address the following items:
In general, how are data visualization techniques used to study relationship trends between two variables?
How is the correlation coefficient used to get the strength and direction of the association between two variables?
In this activity, you generated a scatterplot of the total number of wins and the average relative skill. Include a screenshot of this plot in your report.
What do the scatterplot and the Pearson correlation coefficient tell you about the association between total number of wins and average relative skill?
Is the correlation coefficient statistically significant based on the P-value? Use a 1% level
of significance.
Answer the questions in a paragraph response. Remove all questions and this note before submitting! Do not include Python code in your report. 4. Simple Linear Regression: Predicting the Total Number of Wins using Average Relative Skill
You created a simple linear regression model for the total number of wins in a regular season using the average relative skill as the predictor variable.
See Step 3 in the Python script to address the following items:
In general, how is a simple linear regression model used to predict the response variable using the predictor variable?
What is the equation for your model?
What are the results of the overall F-test? Summarize all important steps of this hypothesis test. This includes:
a.
Null Hypothesis (statistical notation and its description in words)
b.
Alternative Hypothesis (statistical notation and its description in words)
c.
Level of Significance d.
Report the test statistic and the P-value in a formatted table as shown below:
Table 1: Hypothesis Test for the Overall F-Test
Statistic
Value
Test Statistic
X.XX
*Round off to 2 decimal places.
P-value
X.XXXX
*Round off to 4 decimal places.
e.
Conclusion of the hypothesis test and its interpretation based on the P-value
Based on the results of the overall F-test, can average relative skill predict the total number of wins in the regular season?
What is the predicted total number of wins in a regular season for a team that has an average relative skill of 1550? Round your answer down to the nearest integer.
What is the predicted number of wins in a regular season for a team that has an average relative skill of 1450? Round your answer down to the nearest integer.
Answer the questions in a paragraph response. Remove all questions and this note (but not the table) before submitting! Do not include Python code in your report. 5. Scatterplot and Correlation for the Total Number of Wins and Average Points Scored
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
You constructed a scatterplot of total number of wins and average points scored. You also calculated the Pearson correlation coefficient along with its P-value. See Step 4 in the Python script to answer the following questions:
In this activity, you generated a scatterplot of the total number of wins and average points
scored. Include a screenshot of this plot in your report.
What do the scatterplot and the Pearson correlation coefficient tell you about the association between total number of wins and average points scored?
Is the correlation coefficient statistically significant based on the P-value? Use a 1% level
of significance.
Answer the questions in a paragraph response. Remove all questions and this note before submitting! Do not include Python code in your report. 6. Multiple Regression: Predicting the Total Number of Wins using Average Points Scored and Average Relative Skill
You created a multiple regression model with the total number of wins as the response variable, with average points scored and average relative skill as predictor variables.
See Step 5 in the Python script to answer the following questions:
In general, how is a multiple linear regression model used to predict the response variable
using predictor variables?
What is the equation for your model?
What are the results of the overall F-test? Summarize all important steps of this hypothesis test. This includes:
a.
Null Hypothesis (statistical notation and its description in words)
b.
Alternative Hypothesis (statistical notation and its description in words)
c.
Level of Significance d.
Report the test statistic and the P-value in a formatted table as shown below:
Table 2: Hypothesis Test for the Overall F-Test Statistic
Value
Test Statistic
X.XX
*Round off to 2 decimal places.
P-value
X.XXXX
*Round off to 4 decimal places.
e.
Conclusion of the hypothesis test and its interpretation based on the P-value
Based on the results of the overall F-test, is at least one of the predictors statistically significant in predicting the total number of wins in the season?
What are the results of individual t-tests for the parameters of each predictor variable?
Is each of the predictor variables statistically significant based on its P-value? Use a 1% level of significance.
Report and interpret the coefficient of determination.
What is the predicted total number of wins in a regular season for a team that is averaging
75 points per game with a relative skill level of 1350?
What is the predicted total number of wins in a regular season for a team that is averaging
100 points per game with an average relative skill level of 1600?
Answer the questions in a paragraph response. Remove all questions and this note (but not the table) before submitting! Do not include Python code in your report.
7. Multiple Regression: Predicting the Total Number of Wins using Average Points Scored,
Average Relative Skill, Average Points Differential, and Average Relative Skill Differential
You created a multiple regression model with the total number of wins as the response variable, with average points scored, average relative skill, average points differential, and average relative skill differential as predictor variables.
See Step 6 in the Python script to answer the following questions:
In general, how is a multiple linear regression model used to predict the response variable
using predictor variables?
What is the equation for your model?
What are the results of the overall F-test? Summarize all important steps of this hypothesis test. This includes:
a.
Null Hypothesis (statistical notation and its description in words)
b.
Alternative Hypothesis (statistical notation and its description in words)
c.
Level of Significance d.
Report the test statistic and the P-value in a formatted table as shown below:
Table 3: Hypothesis Test for Overall F-Test
Statistic
Value
Test Statistic
X.XX
*Round off to 2 decimal places.
P-value
X.XXXX
*Round off to 4 decimal places.
e.
Conclusion of the hypothesis test and its interpretation based on the P-value
Based on the results of the overall F-test, is at least one of the predictors statistically significant in predicting the number of wins in the season?
What are the results of individual t-tests for the parameters of each predictor variable? Is each of the predictor variables statistically significant based on its P-value? Use a 1% level of significance.
Report and interpret the coefficient of determination.
What is the predicted total number of wins in a regular season for a team that is averaging
75 points per game with a relative skill level of 1350, average point differential of -5 and average relative skill differential of -30?
What is the predicted total number of wins in a regular season for a team that is averaging
100 points per game with a relative skill level of 1600, average point differential of +5 and average relative skill differential of +95?
Answer the questions in a paragraph response. Remove all questions and this note (but not the table) before submitting! Do not include Python code in your report. 8. Conclusion
Describe the results of the statistical analyses clearly, using proper descriptions of statistical terms and concepts. Fully describe what these results mean for your scenario.
Briefly summarize your findings in plain language.
What is the practical importance of the analyses that were performed?
9. Citations
You were not
required to use external resources for this report. If you did not use any resources, you should remove this entire section. However, if you did use any resources to help you with your interpretation, you must
cite them. Use proper APA format for citations.
Insert references here in the following format:
Author's Last Name, First Initial. Middle Initial. (Year of Publication). Title of book: Subtitle of book, edition. Place of Publication: Publisher.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help