STATS Homework 7
docx
keyboard_arrow_up
School
University of Minnesota-Twin Cities *
*We aren’t endorsed by this school
Course
3011
Subject
Statistics
Date
Jan 9, 2024
Type
docx
Pages
4
Uploaded by SuperHumanKookaburaPerson998
Problem 1
For each part below answer
(i) state the name of relevant hypothesis test:
• One sample
z
-test for proportion
p -
Normal hypothesis
• One sample
t
-test for mean
µ
-
t-test
• Matched pairs
t
-test for mean of difference
µ
D
-
Two-sided hypothesis test
• Independent two sample
t
-test for difference of means
µ
1
− µ
2
- Two sided hypothesis test
(ii) define parameter(s) of interest and state the null and alternative hypotheses.
a) It is known that 40% of college students in the U.S. identified Instagram as their favorite social media
application. To test if this information is still accurate in 2022, you surveyed 100 college students randomly
and found 32% of them identified Instagram as their favorite.
➔
H0 :
µ = 0.40
➔
Ha :
µ > 0.40
b) Forty-four sixth graders were randomly selected from a school district. Then, they were divided into 22
matched pairs, each pair having equal IQ’s. One member of each pair was randomly selected to receive
special training. Then all of the students were given an IQ test. The researcher wants to test if the special
training improves IQ scores.
➔
H0 :
µ
D
= 0
➔
Ha :
µ
D
≠ 0
c) An experiment investigated whether cell phone use increases drivers’ reaction times, using a sample of 100
college students. Students were randomly assigned to a cell phone group or to a control group, 50 to each.
In a simulation of driving situations, a target flashed red or green at irregular periods. Participants pressed a
brake button as soon as they detected a red light. The control group listened to the radio or audiobook while
they performed the simulated driving. The cell phone group carried out a phone conversation about a
political issue with someone in a separate room. The experiment measured each group’s mean response
time over many trials.
➔
H0 :
µ
= .50
➔
Ha :
µ
> .50
d) You want to know if students at the University of Minnesota sleeps more than 7 hours on average. To
investigate this, you sampled a few class mates and asked how many hours they sleep on a typical week
day.
➔
H0 :
µ
= .70
➔
Ha :
µ
> .70
e) Lucy wanted to test if the selling prices of e-book is cheaper than prices of hard cover book. She first took a
random sample of 7 books. Then she looked up e-book prices and hard copy prices of those 7 books.
➔
H0 :
µ
= 7
➔
Ha :
µ
≠ 7
Problem 2
98.6 F is commonly referred as the average body temperature. Is this information accurate? A study was
conducted in 1992 to examine this belief. The oral body temperatures of a random sample of 100 healthy
adults were measured. The mean and standard deviations are ¯
x
= 98
.
2
F
and
s
= 0
.
7
F
. Use this sample
information to test whether the true human body temperature is equal to 98.6 or not. Use
α
= 0
.
05.
• Assumption : the sample size is large enough, greater than 100. Second, the observations are
independent from each other. Lastly, the data is randomly sampled from the population of interest. This
follows normal distribution.
• Hypotheses : Ho:
μ =
98.6
Ha: μ
≠ 98.6
• Test statistic: What is the distribution of test statistics? Based on the sample information provided in
Problem 2, what is the value of the test statistic?
(98.2 - 98.6) / 0.7√100 = -5.71
• P-value: Interpret the p-value in the context of the problem.
❖
2 * pt (-5.71, df= 99, lower.tail = FALSE)
• Conclusion and interpretation in context.
❖
Since the P value is less than the significance level, we must reject the null hypothesis, H0. This
concludes that there is enough evidence to support that the human body temperature is not equal to
98.6.
Problem 3:
type 1 vs type 2 error
A researcher wants to see if a new drug with potentially dangerous side effects is significantly better than the
current drug. If it is found to be more effective, it will be prescribed to millions of people.
a) What does a Type I error mean in this scenario?
❖
This showcases the probability of accepting the new drug would be better than the current drug.
b) What does a type II error mean in this scenario?
❖
This is the possibility of rejecting that the new drug is going to be better than the current drug that is
given.
c) Which type of error is worse to make in this situation? Explain.
❖
Type 1 is the worst error because if the new drug is not better than the current drug, then that falls into
many categories of it being false to millions of people.
Problem 4
A graduate teaching assistant collected data to investigate whether study time per week (average number of
hour) differed between students who planned to go to graduate school and those who did not. The data were
as follows:
Graduate school : 15, 7, 15, 10, 5, 5, 2, 3, 12, 16, 15, 37, 8, 14, 10, 18, 3, 25, 15, 5, 5
No graduate school : 6, 8, 15, 6, 5, 14, 10, 10, 12, 5
Use the following R command to input data.
grad<-c(15, 7, 15, 10, 5, 5, 2, 3, 12, 16, 15, 37, 8, 14, 10, 18, 3, 25, 15, 5, 5) noGrad<-c(6, 8, 15, 6, 5,
14, 10, 10, 12, 5)
a) Use R to find sample mean and sample standard deviation for each group. Copy and paste your R output.
❖
> mean(grad)
❖
[1] 11.66667
❖
> sd(grad)
❖
[1] 8.338665
❖
> mean(noGrad)
❖
[1] 9.1
❖
> sd(noGrad)
❖
[1] 3.695342
b) Show formula with numbers plugged in to verify that standard error for difference between the sample
means is 2.16. Interpret this value.
√
8.33^2/ 21 +3.70/ 10^2 = T = .005
c) What is the value of t-multiplier,
t
α/
2
,df
=min(
n
1
−
1
,n
2
−
1)
, to construct a 99% confidence interval? Copy and paste
your R command and output.
❖
9t(.005, df = 10 -1) = -3.25
d) Show formula with numbers plugged in to verify that 99% confidence interval to estimate the difference
between the two population mean is (-4.4, 9.59). Interpret.
mean(grad)-mean(noGrad)+-3.249836*2.161 = -4.4
mean(grad)-mean(noGrad)--3.249836*2.161 = 9.59
This means that we are 99% confident that the difference between hour average for those who plan to go to
graduate school and those who will not is between these two points. (-4.4, 9.59)
Problem 5
Use the same information from Problem 4 to conduct a hypothesis test to test whether the average study hours
for students who plan to go to graduate school is greater than average study hours for students who don’t plan
to go to graduate school. Use
α
= 0
.
1. (Do not use t.test())
• Assumption : assume whether the average study hours for students who plan to go to graduate school
match those who do not plan to go.
• Hypotheses : H0 : mu 1 = mu2
Ha: mu1 > mu2
• Test statistic : What is the distribution of test statistic? Based on the sample information provided in the
Problem 4, what is the value of the test statistic?
• P-value
> pt(-3.25, df= .99)
[1] 0.09602097
• Conclusion and interpretation in context.
Since this has concluded that the p value is smaller than 0.1, we fail to reject the null hypothesis
.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Problem 6
Type two because it could have been due to not rejecting the null hypothesis due to it being false.
Problem 7
Use R command t.test() to repeat Problem 5. (Test whether students who plan to go to graduate school study
more than students who don’t at
α
= 0
.
1). Copy and paste your R command and output. Note to students : R
uses ”Welch’s degrees of freedom” instead of minimum of
n
1
−
1,
n
2
−
1, hence gives a slightly different p-
value from Problem 5.
Problem 8
P-value of hypothesis test
H
a
:
µ
1
̸
=
µ
2
is 0.03. Which of the following is always true? Explain. (A)
Using the same sample information, the 95% confidence interval for
µ
1
− µ
2
will contain 0.
(B)
Using the same sample information, the 99% confidence interval for
µ
1
− µ
2
will contain 0.
(C) Using the same sample information, the 95% confidence interval for
µ
1
−µ
2
will contain positive numbers
only.
P value is greater than
0.01 so we fail to reject h0, thus having the 99 percent interval contain 0.
Related Documents
Related Questions
URGENT
arrow_forward
For the following hypothesis, state the IV and DV with their respective measurement scales:
"The gender of the interviewer affects the interviewee's anxiety levels (low/med/high)."
Independent Variable =
IV scale of measurement =
Dependent Variable =
DV scale of measurement =
arrow_forward
I think I'm supposed to do a hypothesis test for a difference in means on this review problem, but I'm not completely sure. I'd be very grateful for some help!
arrow_forward
URGENT
arrow_forward
How many levels or categories does a study have if using a goodness of fit test and reporting a df=2
A study using the chi-square test of independence has 4 categories of one variables and 2 categories for its other variable. What are the degrees of freedom for the study?
If a chi-square test of Independence ends in a non-significant result it means that the variables depend on each other.
True
False
arrow_forward
You may need to use the appropriate appendix table or technology to answer this question.
The following results are for independent random samples taken from two populations.
Sample 1
Sample 2
n1 = 20
n2 = 30
x1 = 22.9
x2 = 20.1
s1 = 2.6
s2 = 4.6
(a)
What is the point estimate of the difference between the two population means? (Use
x1 − x2.
)
(b)
What is the degrees of freedom for the t distribution? (Round your answer down to the nearest integer.)
(c)
At 95% confidence, what is the margin of error? (Round your answer to one decimal place.)
(d)
What is the 95% confidence interval for the difference between the two population means? (Use
x1 − x2.
Round your answers to one decimal place.)
to
arrow_forward
The first blank for both questions is: "estimate of the standard ('distance' or 'score' or 'mean')"
Second blanks: "...between a ('sample mean' or 'sample mean difference' or 'score')"
Third Blanks: "...and the ('population mean difference' or 'population mean' or sample mean')"
The choices are the same for the corresponding blanks, first, second and third.
arrow_forward
You may need to use the appropriate appendix table or technology to answer this question.
The following results are for independent random samples taken from two populations.
Sample 1 Sample 2
n = 20
n2 = 30
X,22.5
X = 20.1
S1 = 2.9
S2 = 4.2
(a) What is the point estimate of the difference between the two population means? (Use x, - x.)
2.4
(b) What is the degrees of freedom for the t distribution? (Round your answer down to the nearest integer.)
46
(c) At 95% confidence, what is the margin of error? (Round your answer to one decimal place.)
22
Enter a number.
(d) What is the 95% confidence interval for the difference between the two population means? (Use x, - Xg. Round your answers to on
| to (46
arrow_forward
You wish to test the following claim (H.) at a significance level of a = 0.005. For the context of this
problem, µa = PostTest - PreTest where the first data set represents a pre-test and the second data
set representS a post-test. (Each row represents the pre and post test scores for an individual. Be careful
when you enter your data and specify what your uj and uz are so that the differences are computed
correctly.)
H,: Pd = 0
07 Prl :"H
You believe the population of difference scores is normally distributed, but you do not know the standard
deviation. You obtain the following sample of data:
post-test
65.7
54.6
62.4
43.8
61.4
58.8
46
47.7
61.3
pre-test
56.6
52.8
63.3
59.5
58.4
51
51.4
51
57.7
55.8
53.7
54.3
64.5
60.5
44.7
56.3
48.6
48.6
50.9
56.8
56.2
55.4
83.9
68.7
35.7
29
42.3
44.8
46.4
What is the test statistic for this sample?
test statistic=
(Report answer accurate to 4 decimal places.)
What is the p-value for this sample?
p-value
(Report answer accurate to 4 decimal places.)…
arrow_forward
I need help with all parts of this question 3
arrow_forward
Selecting the Appropriate Inferential Test:
For each of the studies below, please identify the following (or indicate N/A “not applicable" if the feature is not relevant to
the design):
a) Independent Variable(s) (IVs) and # of levels for any IVs (for Correlation/Regression, identify the predictor and
predicted variable rather than IV and DV)
b) Dependent Variable (DV)
c) Scale of Measurement for the Dependent Variable: (Nominal, Ordinal, Interval, Ratio) for
Correlation/Regression please make sure you identify the scale of measurement for both variables
d) Appropriate Statistical Test (If it is a Factorial ANOVA, make sure to name the design: e.g., 2 X 2, 2 X 3, etc.)
1. A researcher is interested in how electrical stimulation of the brain influences emotion regulation capability. She
delivers electrical stimulation to a different brain region (hippocampus, hypothalamus, amygdala, cerebellum) in each of
four separate groups of young adult subjects. She then records the amount of time…
arrow_forward
A sample of 46 observations gave sample mean 20 and coefficient of variation 0.6 Then the sample variance is :
arrow_forward
When you are testing the difference between two means, it is not important to
distinguish whether the samples are independent of each other.
True
arrow_forward
What conditions are necessary in order to use the dependent samples
t-test
for the mean of the difference of two populations?
arrow_forward
For this study, we should use
The null and alternative hypotheses would be: z-test for a population proportion or t-test for a population mean Ho: (p or u) (please enter a decimal) H1: (p or u) (Please enter a decimal)
The test statistic z = (please show your answer to 3 decimal places.)
The critical value = (Please show your answer to 3 decimal places.)
The test statistic (is or is not) in the rejection region.
Based on this, we should (reject or accept or fail to accept) the null hypothesis.
Thus, the final conclusion is that ...
The data suggest the population proportion is not significantly larger 50% at � = 0.05, so there is not sufficient evidence to conclude that the proportion of voters who prefer the Democratic candidate is larger 50%.
The data suggest the populaton proportion is significantly larger 50% at � = 0.05, so there is sufficient evidence to conclude that the proportion of voters who prefer the Democratic candidate is…
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you
data:image/s3,"s3://crabby-images/b9e14/b9e141b888912793d57db61a53fa701d5defdb09" alt="Text book image"
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Related Questions
- URGENTarrow_forwardFor the following hypothesis, state the IV and DV with their respective measurement scales: "The gender of the interviewer affects the interviewee's anxiety levels (low/med/high)." Independent Variable = IV scale of measurement = Dependent Variable = DV scale of measurement =arrow_forwardI think I'm supposed to do a hypothesis test for a difference in means on this review problem, but I'm not completely sure. I'd be very grateful for some help!arrow_forward
- URGENTarrow_forwardHow many levels or categories does a study have if using a goodness of fit test and reporting a df=2 A study using the chi-square test of independence has 4 categories of one variables and 2 categories for its other variable. What are the degrees of freedom for the study? If a chi-square test of Independence ends in a non-significant result it means that the variables depend on each other. True Falsearrow_forwardYou may need to use the appropriate appendix table or technology to answer this question. The following results are for independent random samples taken from two populations. Sample 1 Sample 2 n1 = 20 n2 = 30 x1 = 22.9 x2 = 20.1 s1 = 2.6 s2 = 4.6 (a) What is the point estimate of the difference between the two population means? (Use x1 − x2. ) (b) What is the degrees of freedom for the t distribution? (Round your answer down to the nearest integer.) (c) At 95% confidence, what is the margin of error? (Round your answer to one decimal place.) (d) What is the 95% confidence interval for the difference between the two population means? (Use x1 − x2. Round your answers to one decimal place.) toarrow_forward
- The first blank for both questions is: "estimate of the standard ('distance' or 'score' or 'mean')" Second blanks: "...between a ('sample mean' or 'sample mean difference' or 'score')" Third Blanks: "...and the ('population mean difference' or 'population mean' or sample mean')" The choices are the same for the corresponding blanks, first, second and third.arrow_forwardYou may need to use the appropriate appendix table or technology to answer this question. The following results are for independent random samples taken from two populations. Sample 1 Sample 2 n = 20 n2 = 30 X,22.5 X = 20.1 S1 = 2.9 S2 = 4.2 (a) What is the point estimate of the difference between the two population means? (Use x, - x.) 2.4 (b) What is the degrees of freedom for the t distribution? (Round your answer down to the nearest integer.) 46 (c) At 95% confidence, what is the margin of error? (Round your answer to one decimal place.) 22 Enter a number. (d) What is the 95% confidence interval for the difference between the two population means? (Use x, - Xg. Round your answers to on | to (46arrow_forwardYou wish to test the following claim (H.) at a significance level of a = 0.005. For the context of this problem, µa = PostTest - PreTest where the first data set represents a pre-test and the second data set representS a post-test. (Each row represents the pre and post test scores for an individual. Be careful when you enter your data and specify what your uj and uz are so that the differences are computed correctly.) H,: Pd = 0 07 Prl :"H You believe the population of difference scores is normally distributed, but you do not know the standard deviation. You obtain the following sample of data: post-test 65.7 54.6 62.4 43.8 61.4 58.8 46 47.7 61.3 pre-test 56.6 52.8 63.3 59.5 58.4 51 51.4 51 57.7 55.8 53.7 54.3 64.5 60.5 44.7 56.3 48.6 48.6 50.9 56.8 56.2 55.4 83.9 68.7 35.7 29 42.3 44.8 46.4 What is the test statistic for this sample? test statistic= (Report answer accurate to 4 decimal places.) What is the p-value for this sample? p-value (Report answer accurate to 4 decimal places.)…arrow_forward
- I need help with all parts of this question 3arrow_forwardSelecting the Appropriate Inferential Test: For each of the studies below, please identify the following (or indicate N/A “not applicable" if the feature is not relevant to the design): a) Independent Variable(s) (IVs) and # of levels for any IVs (for Correlation/Regression, identify the predictor and predicted variable rather than IV and DV) b) Dependent Variable (DV) c) Scale of Measurement for the Dependent Variable: (Nominal, Ordinal, Interval, Ratio) for Correlation/Regression please make sure you identify the scale of measurement for both variables d) Appropriate Statistical Test (If it is a Factorial ANOVA, make sure to name the design: e.g., 2 X 2, 2 X 3, etc.) 1. A researcher is interested in how electrical stimulation of the brain influences emotion regulation capability. She delivers electrical stimulation to a different brain region (hippocampus, hypothalamus, amygdala, cerebellum) in each of four separate groups of young adult subjects. She then records the amount of time…arrow_forwardA sample of 46 observations gave sample mean 20 and coefficient of variation 0.6 Then the sample variance is :arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw Hill
data:image/s3,"s3://crabby-images/b9e14/b9e141b888912793d57db61a53fa701d5defdb09" alt="Text book image"
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill