Assignment 3 (5%)
pdf
keyboard_arrow_up
School
Western University *
*We aren’t endorsed by this school
Course
2143
Subject
Statistics
Date
Apr 3, 2024
Type
Pages
7
Uploaded by CommodoreJayMaster
4/30/2021
Assignment 3 (5%)
https://owl.uwo.ca/access/content/attachment/9155fbbb-4ff5-4431-b269-d36c50cda88c/Announcements/186d6c5a-0960-49ac-9edc-11a3b367dbf6/So…
1/7
Assignment 3 (5%)
Instructions
Submit one PDF document per team with the names and student numbers of all members. The project is due
Friday, April 2 (10:00PM), and to be submitted via Gradescope.
In this assignment you will use a sample of top chess players to conduct a variety of parametric hypothesis tests.
Adapted from data published in August 2020 by the International Chess Federation (FIDE), the dataset
“project3_data” provides data for 1987 players:
Country
Gender
FIDE title
Name
Standard game rating (>90 minute game)
Rapid rating (10 to 60 minutes)
Blitz rating (<10 minutes)
For the purposes of the assignment, the original dataset was modified to keep titled players from India, Russia and
the United States only.
Answer each of the questions below with full sentences accompanied by reproducible code from the software of
your choice (e.g. Excel, RStudio, Python, WolframAlpha). Report answers with software precision.
Conduct all hypothesis tests at the 95% confidence level.
# Import data and load packages
data <- read.csv("~/ss2143/project3/project3_data.csv") attach
(data)
Question 1 (9 points):
Are the averages of the Rapid rankings across three countries equal? Identify the null and the alternative
hypotheses (1 point), the test statistic (1 point), the rejection region (1 point), and the conclusion (1 point).
Answer
To compare more than two means, we use a single-factor ANOVA. We denote the mean Rapid game rating for
India, Russia, and the United States as , , and , respectively.
The null and alternative hypotheses are
4/30/2021
Assignment 3 (5%)
https://owl.uwo.ca/access/content/attachment/9155fbbb-4ff5-4431-b269-d36c50cda88c/Announcements/186d6c5a-0960-49ac-9edc-11a3b367dbf6/So…
2/7
# Subsamples
xIND <- Rapid[Country=='IND'] xRUS <- Rapid[Country=='RUS'] xUSA <- Rapid[Country=='USA'] # Number of countries
ncountries <- length(unique(Country)) # Number of observations
nobs <- length(Rapid) nobsIND <- length(xIND) nobsRUS <- length(xRUS) nobsUSA <- length(xUSA) # Sums of squares
SST <- sum(xIND)^2/nobsIND + sum(xRUS)^2/nobsRUS + sum(xUSA)^2/nobsUSA - sum(Rapid)^2/nobs SSErr <- sum(Rapid^2) - (sum(xIND)^2/nobsIND + sum(xRUS)^2/nobsRUS + sum(xUSA)^2/nobsUSA) # Mean Squares
MST <- SST/(ncountries - 1) MSErr <- SSErr/(nobs - ncountries) # Test statistic
Fstat <- MST/MSErr # Critical value
Fcritval <- qf(0.95,ncountries - 1,nobs - ncountries)
The test statistic for a single-factor ANOVA is the ratio of the mean square for treatments
and the mean square
for error
, resulting in 82.7448513
.
The rejection region at the 95% level is any value greater than 3.0002602
.
Since the test statistic is in the rejection region, we reject the null hypothesis according to which the mean rating
for Rapid games is equal across India, Russia, and the United States.
If the countries are statistically different, which country outperforms the other two (1 point)? Is the average
Rapid ranking greater than the average of the other two countries? Identify the null and the alternative
hypotheses (1 point), calculate the test statistic (1 point), state the rejection region (1 point), and draw the
conclusion (1 point).
Answer
xbarIND <- mean(xIND) xbarRUS <- mean(xRUS) xbarUSA <- mean(xUSA)
The sample averages for Rapid game ratings are 2059
, 2227.2023653
, and 2308.1156463
for India, Russia, and
the United States, respectively. The United States therefore seem to outperform the other two countries in Rapid
chess.
4/30/2021
Assignment 3 (5%)
https://owl.uwo.ca/access/content/attachment/9155fbbb-4ff5-4431-b269-d36c50cda88c/Announcements/186d6c5a-0960-49ac-9edc-11a3b367dbf6/So…
3/7
Let us test whether the United States have a greater average Rapid game rating than the other two countries. To
compare two means, we use a two-sample t-test. We denote the mean Rapid rating across India and Russia as . The null and alternative hypotheses are
# Sample of non-USA ratings
xIR <- Rapid[Country!='USA'] # Sample size
nobsIR <- length(xIR) # Average
xbarIR <- mean(xIR) # Standard error
sdUSA <- sd(xUSA)
sdIR <- sd(xIR) # Test statistic
Tstat <- (xbarUSA - xbarIR)/sqrt((sdUSA^2/nobsUSA + sdIR^2/nobsIR)) # Degrees of freedom
dof <- ((sdUSA^2/nobsUSA + sdIR^2/nobsIR))^2/( (sdUSA^2/nobsUSA)^2/(nobsUSA-1) + (sdIR^2/nobsIR)
^2/(nobsIR-1) ) # Critical value
Tcritval <- qt(0.95,dof)
The test statistic for the two-sample -test is 5.1743882
.
The rejection region at the 95% level for a one-sided test is any value greater than 1.6539298
. This value
corresponds to the 95th percentile of a -distribution with 168.8182402
degrees of freedom.
Since the test statistic is in the rejection region, we reject the null hypothesis according to which the mean rating
for Rapid games is equal between the United States and other countries.
Question 2 (3 points)
Do players tend to have higher rankings in Standard games than in Rapid games? Identify the null and the
alternative hypotheses (1 point), calculate the p-value (1 point), and draw the conclusion (1 point).
Answer
To compare the rating of Standard and Rapid games, we use the paired
-test. We denote the mean difference
between Standard and Rapid games as , where and denote average Standard and Rapid
ratings.
The null and alternative hypotheses are
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
4/30/2021
Assignment 3 (5%)
https://owl.uwo.ca/access/content/attachment/9155fbbb-4ff5-4431-b269-d36c50cda88c/Announcements/186d6c5a-0960-49ac-9edc-11a3b367dbf6/So…
4/7
# Sample of differences
diff <- Standard - Rapid # Test statistic
Tstat <- mean(diff)/sd(diff)*sqrt(nobs) # P-value
pvalue <- 1-pt(Tstat,nobs-1)
The test statistic for the paired -test is 23.8802265
.
The p-value can be defined as the largest such that the test would be rejected at the confidence
level. This corresponds to defining a critical value equal to the test statistic. The test would therefore be rejected if 23.8802265
. This value is a quantile associated with probability , where 0
for a Student’s distribution with 1986 degrees of freedom.
Since the p-value is less than 5%, we reject the null hypothesis according to which the mean rating for Rapid
games is equal to the mean rating for Standard games.
Question 3 (4 points):
Are the proportion of Grand Masters (GM) among the Indian players different than the proportion of GMs
among Russian players? Identify the null and the alternative hypotheses (1 point), calculate the p-value (1
point), and draw the conclusion (1 point).
Answer
We denote the proportion of GMs among Indian and Russian players as , and , respectively.
To test the difference between two proportions, the null and alternative hypotheses are
# Sample proportions
pIND <- mean(Title[Country=='IND']=='GM') pRUS <- mean(Title[Country=='RUS']=='GM') # Variances
vIND <- pIND*(1-pIND)/nobsIND vRUS <- pRUS*(1-pRUS)/nobsRUS # Test statistic
Zstat <- (pIND-pRUS)/sqrt(vIND+vRUS) # P-value
pvalue <- 2*(1-pnorm(abs(Zstat)))
The test statistic for the two-sample -test for proportions is 2.4852422
.
4/30/2021
Assignment 3 (5%)
https://owl.uwo.ca/access/content/attachment/9155fbbb-4ff5-4431-b269-d36c50cda88c/Announcements/186d6c5a-0960-49ac-9edc-11a3b367dbf6/So…
5/7
The p-value can be defined as the largest such that the test would be rejected at the confidence
level. This corresponds to defining a critical value equal to the test statistic. The test would therefore be rejected if 2.4852422
. This value is the standard normal quantile associated with probability ,
where 0.0129463
Since the p-value is less than 5%, we reject the null hypothesis according to which the proportions of GMs among
Indian and Russian players are equal.
Alternatively, one can use the large sample testing procedure as outlined below.
# Proportion of GMs across Indian and Russian players
phat <- mean(Title[Country=='IND'|Country=='RUS']=='GM') # Large sample procedure test statistic
Zstat2 <- (pIND-pRUS)/sqrt(phat*(1-phat)*(1/nobsIND + 1/nobsRUS)) # P-value
pvalue2 <- 2*(1-pnorm(abs(Zstat2)))
The test statistic for the two-sample -test for proportions is 2.7366541
. The corresponding p-value is
0.0062068
.
What is the power of this test given the sample proportions? In other words, evaluate at and (1 point).
Answer
The two-sided test has function
where , , and are the two
sample sizes, and .
barp <- (nobsIND*pIND + nobsRUS*pRUS)/(nobsIND + nobsRUS) barq <- (nobsIND*(1-pIND) + nobsRUS*(1-pRUS))/(nobsIND + nobsRUS) sigma <- sqrt(pIND*(1-pIND)/nobsIND + pRUS*(1-pRUS)/nobsRUS) zalpha <- qnorm(0.975) # Power calculation
beta <- pnorm( ( zalpha*sqrt(barp*barq*(1/nobsIND + 1/nobsRUS))-(pIND-pRUS) )/sigma ) - pnorm( ( -zalpha*sqrt(barp*barq*(1/nobsIND + 1/nobsRUS))-(pIND-pRUS) )/sigma ) power <- 1 - beta
If the sample proportions are good approximations of the true population proportions, the power of the test can be
approximated to 0.7597097
. The power measures the probability of correctly rejecting
the null hypothesis (i.e. rejecting the null hypothesis when it should be rejected).
Question 4 (4 points):
4/30/2021
Assignment 3 (5%)
https://owl.uwo.ca/access/content/attachment/9155fbbb-4ff5-4431-b269-d36c50cda88c/Announcements/186d6c5a-0960-49ac-9edc-11a3b367dbf6/So…
6/7
Are the variances of Blitz rankings different for males and females? Identify the null and the alternative hypotheses
(1 point), calculate the test statistic (1 point), state the rejection region (1 point), and draw the conclusion (1 point).
Answer
The sample variances for female and male players are denoted and , and population variances for female
and male players are denoted and .
To test the difference between two variances, the null and alternative hypotheses are
# Sample variances
vMale <- var(Blitz[Gender=='M']) vFemale <- var(Blitz[Gender=='F']) # Sample size
nobsMale <- sum(Gender=='M') nobsFemale <- sum(Gender=='F') # Test statistic
Fstat <- vMale/vFemale # Critical values
Fcritval <- qf(c(0.025,0.975),nobsMale,nobsFemale)
The sample variance for women and men’s Blitz ratings are and ,
respectively.
The test statistic for a comparison of variances is . Under the null hypothesis, we have ,
which then yields =
0.7202129
.
The rejection region at the 95% level is any value outside of the interval 0.8649173, 1.1631673
. This value
corresponds to the 2.5 and 97.5th percentiles of a -distribution with 1532
and 455
degrees of freedom.
Since the test statistic is in the rejection region, we reject the null hypothesis according to which the variances of
Blitz ratings are different from female to male players.
Alternatively, one can invert the ratio when computing the test statistic. One must then invert the degrees of
freedom of the distribution when computing the critical values for the rejection region.
# Test statistic
Fstat2 <- vFemale/vMale # Critical values
Fcritval2 <- qf(c(0.025,0.975),nobsFemale,nobsMale)
This yields a test statistic of 1.3884782
. The rejection region is any value outside of the interval
0.8597216, 1.15618
. The conclusion of the test is the same.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
4/30/2021
Assignment 3 (5%)
https://owl.uwo.ca/access/content/attachment/9155fbbb-4ff5-4431-b269-d36c50cda88c/Announcements/186d6c5a-0960-49ac-9edc-11a3b367dbf6/So…
7/7
Remarks
Drawing conclusions from hypothesis tests
A common mistake when drawing conclusions from hypothesis tests is to confirm one of the two hypotheses. One
never accepts
the null hypothesis, but only fails to reject
it given a predetermined confidence level. Similarly,
rejecting the null hypothesis with a predetermined confidence level is not the same thing as saying that the
alternative hypothesis is true. If we think of hypothesis tests as a means for scientific inquiry, we are measuring the
strength of the evidence
for a claim/hypothesis. No matter how strong the evidence, you can never be assertive
when you draw conclusions from statistical inference. Falsifiability is a fundamental principle of the philosophy of
science, which is why theories are never confirmed, but only unrefuted.
Here are some examples of incorrect conclusions:
is accepted.
is accepted.
is rejected so is true.
is not rejected so is true.
is rejected in favor of .
is rejected in favor of .
is incorrect.
is incorrect.
Here are some examples of correct conclusions:
is rejected at the significance level. The data gives strong support for .
is not rejected at the significance level. The data does not give strong support for .
Reporting reproducible code
Many students are decidedly reluctant to include reproducible code in their answers. Consistent with the previous
assignments’ marking scheme, one point was deducted for each question where instructions were ignored.
Software commands that refer to undefined values are not considered reproducible (e.g. calculating a formula with
the value in cell Z9 in Excel, where the value in cell Z9 is never defined).
Related Documents
Related Questions
INCWIBI
TN TestNav - Google Chrome
i testnavclient.psonsvc.net/#/question/3be5ad0e-Ob4d-4a86-adfa-43c574783a44/9adb7ec7-5c84-4c8a-8641-b292d5761bab
Review -
A Bookmark
Math 2 Unit 4 Common Assessment 20-21 I 10 of 20
If the domain of the function f (x) = v2x +1 is 4 < x < 12, what is the range of f (x)?
O A. 3
arrow_forward
tab
(2.1-2.6) Target *
← → C
caps lock
YouTube
→1
esc
Home
Maps Kindle
Winter 2023
canvas.seattlecolleges.edu/courses/10176/assignments/81095
Syllabus
Announcements
Modules
Assignments
People
Office 365
Central Learning
Support
Central eTutoring
Zoom 1.3
!
1
X
Q
A
N
Course Hero
2
= Psychology 2e - O... StatCrunch (1.4-1.7) Writing...
W
S
X
Match each scatterplot shown below with one of the four specified correlations.
#
3
C
E
D
O
0 0
xb Answered: You randomly surve X
CO
4
C
R
8
LL
%
5
Search or type URL
V
T
a. -0.45
b. -0.91
c. 0.86
d. 0.35
G
6
MacBook Pro
OF
Y
H
New Tab
&
7
U
00 *
8
J
1
(
9
x +
K
0
0
L
P
arrow_forward
* 100%
Mon 1:51 PM
Uni
Bb Pep
E Exp
O Mai
U My
*Ix. Que
Har
Cor
CC 201
A My
E Sel
O Fac
a Prir
E Pee
Am
mb//evo/index.html?deploymentid=59965220544781978962
e780357131596&ld%3D894632737&snapshotid-1740686&
AGE MINDTAP
Q Search this course
- Homework 7 (Chapter 14) - Part A
O The American Association of Individual Investors (AAII) On-Line Discount Broker Survey polls members on their experiences with discount brokers. As part of
the survey, members were asked to rate the quality of the speed of execution with their broker as well as provide an overall satisfaction rating for electronic
trades. Possible responses (scores) were no opinion (0), unsatisfied (1), somewhat satisfied (2), satisfied (3), and very satisfied (4). For each broker summary
scores were computed by calculating a weighted average of the scores provided by each respondent. A portion of the survey results follow (AAII website,
February 7, 2012).
Brokerage
Speed
Satisfaction
Scottrade, Inc.
2.4
2.4
Charles Schwab
3.8
3.5…
arrow_forward
domain
arrow_forward
Please help a brotha figure this one out.
arrow_forward
canvas
Subject pronoun X
Review
Subject pronoun X
md.testnav.com/client/index.html?username=6453073136&password=899938&spredirect=
youtube! SchoolMAX! google docs!
Bookmark
Tutorial: Les jour X
ALGEBRA 1 COMMON ASSESSMENT UNIT 1-1 SY24 / SECTION 1 / 1 OF 5
What is the value of a10-
google slides !
Common Assessment 1-1
Functions and Average Rate of Change
0
Apple Music
Grades fo
2
9 F F
A sequence is defined by an = -4+ (n 1)12, where n is a positive integer.
DELL
arrow_forward
ps6hsvc.net/#/question/075dd09e-fbf5-422a-af84-33220be2858a/3ea3e71-a5da-4a0f-abf3-629640619826
Review -
ABookmark
O'Connor Algebra 1 Exam 9-1 to 9-3 (Unit 8) 2020-2021 I 7 0of 28
Awadallah, Ahmad
Il Pause o Help -
Which type of function best describes the graph below?
O A Exponential Decay
O B Linear
O C Quadratic
O D Exponential Growth
/18/2021
arrow_forward
COBU 260 Spring 2021
Complete the assignment using Excel worksheet.
**Other than the Frequency column, every cell must be computed using Excel formulas
Attach your Excel file (one file) and submit.
****Do not embed Excel object in this Word file; do your work in a separate Excel file.
The following table represents the Frequency Distribution and Cumulative Distributions for this data set: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 30, 35, 37, 38, 39, 43, 44, 46, 53, 58
Using Excel formulas:
Compute the Total for the frequencies.
Compute the relative frequency for each class.
Compute the Total for the relative frequencies.
Compute the percentage for each class
Compute the Total for the percentages.
Compute the cumulative frequency for each class.
Compute the cumulative percentage for each class.
Class
Frequency
Relative
Frequency
Percentage
Cumulative
Frequency
Cumulative
Percentage
10 but less than 20
20 but less than 30…
arrow_forward
P Do Homework - Section 3.2 Addition of Whole Numbers - Google Chrome
A mathxl.com/Student/PlayerHomework.aspx?homeworkld=608090537&questionld31&
MAT 1723 FA21
E Homework: Section 3.2 Addition of Whole Numbers
Find the missing digits in each of the following.
2
b.
7
a.
5.
6.
3
3
4
2
7
3
8.
a.
2
6
3
4
Help Me Solve This
View an Example
Get More Help -
P Type here to search
曲
国5
arrow_forward
2.1 quest 2
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
Algebra & Trigonometry with Analytic Geometry
Algebra
ISBN:9781133382119
Author:Swokowski
Publisher:Cengage
data:image/s3,"s3://crabby-images/21a4f/21a4f62f7828afb60a7e1c20d51feee166b1a145" alt="Text book image"
Mathematics For Machine Technology
Advanced Math
ISBN:9781337798310
Author:Peterson, John.
Publisher:Cengage Learning,
Related Questions
- INCWIBI TN TestNav - Google Chrome i testnavclient.psonsvc.net/#/question/3be5ad0e-Ob4d-4a86-adfa-43c574783a44/9adb7ec7-5c84-4c8a-8641-b292d5761bab Review - A Bookmark Math 2 Unit 4 Common Assessment 20-21 I 10 of 20 If the domain of the function f (x) = v2x +1 is 4 < x < 12, what is the range of f (x)? O A. 3arrow_forwardtab (2.1-2.6) Target * ← → C caps lock YouTube →1 esc Home Maps Kindle Winter 2023 canvas.seattlecolleges.edu/courses/10176/assignments/81095 Syllabus Announcements Modules Assignments People Office 365 Central Learning Support Central eTutoring Zoom 1.3 ! 1 X Q A N Course Hero 2 = Psychology 2e - O... StatCrunch (1.4-1.7) Writing... W S X Match each scatterplot shown below with one of the four specified correlations. # 3 C E D O 0 0 xb Answered: You randomly surve X CO 4 C R 8 LL % 5 Search or type URL V T a. -0.45 b. -0.91 c. 0.86 d. 0.35 G 6 MacBook Pro OF Y H New Tab & 7 U 00 * 8 J 1 ( 9 x + K 0 0 L Parrow_forward* 100% Mon 1:51 PM Uni Bb Pep E Exp O Mai U My *Ix. Que Har Cor CC 201 A My E Sel O Fac a Prir E Pee Am mb//evo/index.html?deploymentid=59965220544781978962 e780357131596&ld%3D894632737&snapshotid-1740686& AGE MINDTAP Q Search this course - Homework 7 (Chapter 14) - Part A O The American Association of Individual Investors (AAII) On-Line Discount Broker Survey polls members on their experiences with discount brokers. As part of the survey, members were asked to rate the quality of the speed of execution with their broker as well as provide an overall satisfaction rating for electronic trades. Possible responses (scores) were no opinion (0), unsatisfied (1), somewhat satisfied (2), satisfied (3), and very satisfied (4). For each broker summary scores were computed by calculating a weighted average of the scores provided by each respondent. A portion of the survey results follow (AAII website, February 7, 2012). Brokerage Speed Satisfaction Scottrade, Inc. 2.4 2.4 Charles Schwab 3.8 3.5…arrow_forwarddomainarrow_forwardPlease help a brotha figure this one out.arrow_forwardcanvas Subject pronoun X Review Subject pronoun X md.testnav.com/client/index.html?username=6453073136&password=899938&spredirect= youtube! SchoolMAX! google docs! Bookmark Tutorial: Les jour X ALGEBRA 1 COMMON ASSESSMENT UNIT 1-1 SY24 / SECTION 1 / 1 OF 5 What is the value of a10- google slides ! Common Assessment 1-1 Functions and Average Rate of Change 0 Apple Music Grades fo 2 9 F F A sequence is defined by an = -4+ (n 1)12, where n is a positive integer. DELLarrow_forwardps6hsvc.net/#/question/075dd09e-fbf5-422a-af84-33220be2858a/3ea3e71-a5da-4a0f-abf3-629640619826 Review - ABookmark O'Connor Algebra 1 Exam 9-1 to 9-3 (Unit 8) 2020-2021 I 7 0of 28 Awadallah, Ahmad Il Pause o Help - Which type of function best describes the graph below? O A Exponential Decay O B Linear O C Quadratic O D Exponential Growth /18/2021arrow_forwardCOBU 260 Spring 2021 Complete the assignment using Excel worksheet. **Other than the Frequency column, every cell must be computed using Excel formulas Attach your Excel file (one file) and submit. ****Do not embed Excel object in this Word file; do your work in a separate Excel file. The following table represents the Frequency Distribution and Cumulative Distributions for this data set: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 30, 35, 37, 38, 39, 43, 44, 46, 53, 58 Using Excel formulas: Compute the Total for the frequencies. Compute the relative frequency for each class. Compute the Total for the relative frequencies. Compute the percentage for each class Compute the Total for the percentages. Compute the cumulative frequency for each class. Compute the cumulative percentage for each class. Class Frequency Relative Frequency Percentage Cumulative Frequency Cumulative Percentage 10 but less than 20 20 but less than 30…arrow_forwardP Do Homework - Section 3.2 Addition of Whole Numbers - Google Chrome A mathxl.com/Student/PlayerHomework.aspx?homeworkld=608090537&questionld31& MAT 1723 FA21 E Homework: Section 3.2 Addition of Whole Numbers Find the missing digits in each of the following. 2 b. 7 a. 5. 6. 3 3 4 2 7 3 8. a. 2 6 3 4 Help Me Solve This View an Example Get More Help - P Type here to search 曲 国5arrow_forwardarrow_back_iosarrow_forward_ios
Recommended textbooks for you
- Algebra & Trigonometry with Analytic GeometryAlgebraISBN:9781133382119Author:SwokowskiPublisher:CengageMathematics For Machine TechnologyAdvanced MathISBN:9781337798310Author:Peterson, John.Publisher:Cengage Learning,
Algebra & Trigonometry with Analytic Geometry
Algebra
ISBN:9781133382119
Author:Swokowski
Publisher:Cengage
data:image/s3,"s3://crabby-images/21a4f/21a4f62f7828afb60a7e1c20d51feee166b1a145" alt="Text book image"
Mathematics For Machine Technology
Advanced Math
ISBN:9781337798310
Author:Peterson, John.
Publisher:Cengage Learning,