KPE291_TermProject_Instructions
pdf
keyboard_arrow_up
School
University of Toronto *
*We aren’t endorsed by this school
Course
291
Subject
Mathematics
Date
Jan 9, 2024
Type
Pages
3
Uploaded by LieutenantExploration9673
KPE 291 Term Project (20 Marks)
This term project will allow you to apply the statistical theory and tests you are learning this term to a real dataset. At the end of this course, we hope that you would have not only gained an appreciation of the many real-world applications of statistics, but that you also have an idea of how to practically begin to answer a research question using quantitative approaches.
Overview
While the research objectives and hypotheses are typically decided before data collection, for the purposes of this assignment you will be given a choice of six
different pre-constructed datasets, from which you will identify a research question from one of the datasets and then use the techniques you have learned throughout the term to design a statistical approached to answer the question you have identified. Here is an example:
Example
Schooling Level
Physical Activity
Social Media Time
Undergraduate
30
840
High School
450
500
Undergraduate
120
400
Undergraduate
600
420
High School
100
1000
Undergraduate
900
210
Undergraduate
25
1200
High School
480
240
High School
100
800
High School
60
900
The above dataset consists of ten rows containing participant data and three columns of data variables, labelled “Schooling Level”, “Physical Activity”, and “Social Media Time”. The first part of the assignment will consist of selecting a question. By looking at the variables given, we can generate several potential questions: 1) Is there a relationship between physical activity and social media time? 2) Does social media time differ between Undergraduate vs. High School students? … etc. Remembering what you learned from this term, statistics will assist you in answering your question, therefore, the assignment’s main objective is to identify a research question and implement a statistical test learned this term to answer the question.
Term Project Requirements
Rationale, Objective and Hypothesis (5 Marks)
1.
Identify a question of interest (objective) from one of the six datasets provided and write a 250-word rationale justifying your research question. In this section use prior research (i.e., references) to support your rationale. Remember, your question has to involve two variables – one of the variables MUST be continuous.
(1 mark for stating objective, 1 mark for referencing literature, 1 mark for
rationale, 1 mark for style/grammar = 4).
2.
State the null and alternative hypotheses for your research questions. (0.5 mark each = 1 mark)
Methods (3 Marks)
1.
Identify and justify the statistical test you will use to answer your question. This section should be limited to 2-5 sentences. (0.5 mark for the correct test, 0.5 mark for justification = 1 mark)
2.
Using RStudio and the ggplot2 package, create a plot of one of your continuous variables. Make a statement on the distribution of the variable and justify this statement with an explanation.
(1 mark for correct plot with correct labelling, 0.5 mark for correct distribution assumption, 0.5 mark for justification = 2 marks)
Results (7 Marks)
1.
Variables: List each variable you are analyzing, along with the type of variable it is.
(0.
5 mark x 2 for correct type of variable = 1 mark)
2.
Descriptive Statistics: For one of your variables, in RStudio calculate the mean, median, mode, variance, standard deviation, and interquartile range. Be sure to write this out in sentence form in your report.
(0.
25 mark each x 6 = 1.5 marks)
3.
Graph:
Create a graph in RStudio using the ggplot2 package to illustrate potential between-group differences or relationships between variables. This graph should be a visualization of the question you are answering in this assignment. Be sure to save the plot and include it in your report.
(1 mark for correct plot with correct labelling, 1 mark for including appropriate titles, axis scaling, etc. = 2 marks)
4.
Test Results: Using RStudio, perform the statistical test chosen to evaluate your hypothesis. Formally write-up the results of this test.
(1 mark for the correct result, 1.5 marks for the correct writing of test results = 2.5 marks)
Interpretation and Conclusion (3 marks)
1.
In 300-400 words, interpret your results. To help with this section, incorporate the following: What was the answer to your question? Is this what you expected? Remember to return to your objective and hypothesis to help with this section.
(1 mark for summarizing results, 1 mark for referring to hypothesis and outside literature, 1 mark for style/grammar = 3 marks)
Appendix (2 Marks):
Save your code as a .R file containing all information used in the project: graphs, descriptives, statistical tests, etc. Please use comments in your script to help organize your code. In addition, please be sure to complete all data preparation / wrangling in your R code as the TAs need to be able to run the entire script (i.e., no Excel formatting of the datasets beforehand). (1 mark for completion [all tests and graphs were completed in R, comments to organize code], 1 mark for accuracy [code is correct and outputs match what is reported in the report] = 2 marks)
Summary:
Your submission should include the following:
1.
A word document that contains the report (the written portion of this project).
2.
A .R script containing all your code.
REMEMBER: The graphs should be included as figures in the written document, and the code to make the graph should be in the .R script.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help