2019 H2 Prelim (Hypothesis Testing) Qns with Sol
pdf
keyboard_arrow_up
School
Nanyang Technological University *
*We aren’t endorsed by this school
Course
8865
Subject
Statistics
Date
Nov 24, 2024
Type
Pages
32
Uploaded by CaptainGalaxyTapir10
2019 H2 MA Prelim Compilation - Hypothesis Testing (17 Questions with Solutions) ACJC JC2 Prelim 9758/2019/02/Q7 A concert promoter claims that the mean price of a ticket to a pop concert is $200. A media company collected information on ticket price, $
x
, for 50 randomly chosen people who bought pop concert tickets. The results are summarised as follows. (
200)
450
x
2
(
200)
55 000
x
(i)
Test, at the 4% level of significance, the concert promoter’s claim that the mean price of a ticket to a pop concert is $200. You should state your hypotheses and define any symbols you use. [5] (ii)
The media company took another random sample of n
tickets, and found that the average ticket price for this sample is $206. If the standard deviation of ticket price is now known to be $32.25, find the maximum value of n
such that there is insufficient evidence at the 4% level of significance to reject t
he concert promoter’s claim.
[2] ACJC JC2 Prelim 9758/2019/02/Q7 (Solutions) 7(i) (
200)
200
209
50
x
x
2
2
450
1
50950
55000
49
50
49
s
or 1039.8 (5 s.f.) Let μ be the mean price of all concert tickets. To test 0
H :
μ
200
against 1
H :
μ
200
at the 4% level of significance. Under 0
H
, Z =
200
~N 0,1
50
X
s
or 50950
~N 200, 49(50)
X
approximately by central limit theorem since sample size, 50, is large. Value of test statistic 1.97
z
p
-value = 0.0484 > 0.04 (do not reject 0
H
) There is insufficient
evidence at the 4% level of significance
that the mean concert ticket price is not $200. (ii) Now given the standard deviation is 32.25 Under 0
H
, 2
32.25
~N 200, X
n
by CLT since n
is large
For H
0
to not be rejected, the test statistic, z
, must not be in critical region. Hence 2.0537
2.0537 z
i.e. 206
200
2.0537
2.0537 32.25
n
i.e. 6
2.0537
2.0537 32.25
n
i.e. 11.039
11.039
n
Hence largest value of n
is 121. ASRJC JC2 Prelim 9758/2019/02/Q10 Along a 3km stretch of a road, the speed in km/h of a vehicle is a normally distributed random variable T
. Over a long period of time, it is known that the mean speed of vehicles traveling along that stretch of the road is 90.0 km/h. To deter speeding, the traffic governing body introduced a speed monitoring camera. Subsequently, the speeds of a random sample of 60 vehicles are recorded. The results are summarised as follows.
2
5325,
2000.
t
t
t
(i)
Find unbiased estimates of the population mean and variance, giving your answers to 2 decimal places. [2] (ii)
Test, at the 5% significance level, whether the speed-monitoring camera is effective in deterring the speeding of vehicles on the stretch of road. [4] In another sample of size n
(
n
> 30) that was collected independently, it is given that t
= 89.0. The result of the subsequent test using this information and the unbiased estimate of the population variance in part
(i)
is that the null hypothesis is not rejected. Obtain an inequality involving n
, and hence find the largest possible value n
can take. [4] ASRJC JC2 Prelim 9758/2019/02/Q10 (Solutions) 10
(iii)
Unbiased estimate of the population mean = 5325
88.75
60
t
2%
-2.0537
2.0537
2%
Unbiased estimate of the population variance =
2
2
2000
59
59
t
t
s
= 33.89830508 = 33.90 (iv)
Let
denote the population mean speed of the vehicles traveling along the stretch of the road. To test H
0
: μ = 90.0
Against H
1
: μ < 90.0
Conduct a one-
tail test at 5% level of significance, i.e., α = 0.05
Under H
0
, since n
= 60 and is sufficiently large, by Central Limit Theorem, 33.89830508
~
90.0,
60
T
N
approximately. Using GC, p-value = 0.0481545117 Since p-value < 0.05, we reject H
0. There is sufficient evidence at 5% level of significance to conclude that that the mean speed along the 3km stretch of road has been reduced. (v)
If the null hypothesis is not rejected, z
calc
must lie outside the critical region. Critical Region: 1.644853626 z
Test Statistics, 90.0
33.89830508
T
Z
n
~ N(0, 1)
z
calc
= 89
90.0
33.89830508
n
> –
1.644853626 1.644853626 33.89830508
n
9.576708062
n
n
< 91.713 Since n
is an integer, the largest possible value n can take is 91.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
CJC JC2 Prelim 9758/2019/02/Q9 The security guard of a particular school claims that the average speed of the cars in the school compound is greater than the speed limit 25 km/h. To investigate the security guard’s claim, the traffic police randomly selected 50 cars and the speed was recorded. The total speed and the standard deviation of the 50 cars are found to be 1325 km/h and 7.75 km/h respectively. (i) Find the unbiased estimates of the population mean and variance. [2] (ii)
Test at 5% level of significance whether there is sufficient evidence to support the security guard’s claim.
[4] It is now known that the speed of the cars is normally distributed with mean 25 km/h and standard deviation of 6 km/h. (iii)
A new sample of n
cars is obtained and the sample mean is found to be unchanged. Using this sample, the traffic police conducts another test at 10% level of significance and concludes that the security guard’s claim is valid. Find the set of values that n
can take. [3] (iv) What is the speed exceeded by 75% of the cars? [1] CJC JC2 Prelim 9758/2019/02/Q9 (Solutions) 9 (i) Let X
be the random variable denoting the speed of a car in the school compound. Unbiased estimate of population mean
1325
26.5
50
Unbiased estimate of population variance
2
50
7.75
61.288
61.3(3sf)
49
(ii) 0
H :
25
1
H :
25
, where
is the population mean of X. Under 0
H
, Since n
is large, by CLT, 61.28
0
~
2
N
8
5,
5
X
approx. 26.5
25
61.288
50
test
z
Since
-value
0.0877345
0.05
p
, we do not reject 0
H
and conclude that we have insufficient evidence at 5% level of significance that the mean speed of the cars is more than 25. (iii)
6
~ N 25,3
X
6
~ N
3
25,
X
n
26.5
25
0.25
36
test
z
n
n
Since 0
H
is rejected at 10 % level of significance, 1.28155,
test
z
0.25
1.28155
26.278
{ :
,
27}
n
n
n n
n
(iv)
25,36
P(
)
0.75
21.0 (3 s.f.)
~ N
X
X
t
t
DHS JC2 Prelim 9758/2019/02/Q9 The time taken, T
(in minutes), for a 17-year-old student to complete a 5-km run is a random variable with mean 30. After a new training programme is introduced for these students, a random sample of n
students is taken. The mean time and standard deviation for the sample are found to be 28.9 minutes and 4.0 minutes respectively. (a)
Find the unbiased estimate of the population variance in terms of n
. [1] (b)
Using 40,
n
(i)
carry out a test at the 10% significance level to determine if the mean time taken has changed. State appropriate hypotheses for the test and define any symbols you use. [4] (ii)
State what it means by the p
-value in this context. [1] (iii)
Give a reason why no assumptions about the population are needed in order for the test to be valid. [1] (c)
The trainer claims instead that the new training programme is able to improve the mean of T
, 30 minutes, by at least 5%. The school wants to test his claim. (i)
Write down the null and alternative hypothesis. [1] 1.28155 0.10
(ii)
Using the existing sample, the school carried out a test at 1% significance level and found that there was sufficient evidence to reject the trainer’s claim. Find the set of values that n
can take, stating any necessary assumption(s) needed to carry out the test. [4] DHS JC2 Prelim 9758/2019/02/Q9 (Solutions) Qn Suggested Solution
9 (a) An unbiased estimate for the population variance :
2
2
16
4
1
1
n
n
s
n
n
minutes
2
(b) (i) Let
be the population mean time taken for a 17-year-old student to complete a 5 km run. To test at 10 % significance level, 0
1
:
30.0 min
:
30.0 min
H
H
For 40,
n
2
9
16(40)
6
39
40
3
s
Test Statistic: Under 0
H
, 640
39
~ N 30.0,
40
T
approximately by Central Limit Theorem since n
is large p
-value =
2P
28.9
0.0859
0.10
T
, we reject 0
H
and conclude that there is sufficient evidence at the 10 % significance level that the population mean time taken has changed. (ii) The p
-value is the probability of obtaining a sample mean at least as extreme as the given sample, assuming that the population mean time taken has not changed from 30.0 min. OR
The p
-value is the smallest significance level to conclude that the population mean time has changed from 30.0 min. (iii) Since the sample size of 40 is large, by Central Limit Theorem, T
follows a normal distribution approximately. Thus no assumptions are needed. (c) (i) New population mean timing = 0.95
30
28.5
min To test at 5 % significance level, 0
1
:
28.5 min
:
28.5 min
H
H
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
(ii) Assumption: n
is large for Central Limit Theorem to apply. Test Statistic: Under 0
H
, 2
4.0
~ N 28.5,
1
T
n
approximately by Central Limit Theorem For 0
H
to be rejected, we need
P
28.9
0.01
T
4
1
28.9
28.5
P
0.01
n
Z
1
P
0.01
10
n
Z
1
2.3263
10
542.2
n
n
Thus required set =
:
543
n
n
EJC JC2 Prelim 9758/2019/02/Q10 A company manufactures packets of potato chips with X
mg of sodium in each packet. It is known that the mean amount of sodium per packet is 1053 mg. After some alterations to the production workflow, 50 randomly chosen packets of potato chips were selected for analysis. The amount of sodium in each packet was measured, and the results are summarised as follows:
1050
58.0
x
,
2
1050
2326
x
(i)
Test at 5% level of significance whether the mean amount of sodium in a packet of potato chips has changed, after the alterations to the workflow.
[6] (ii)
Explain what ‘5% level of significance’ means in this context. [1] (iii)
A second tester conducted the same test at %
level of significance, for some integer
. However, he came to a different conclusion from the first tester. What is the range of
for which the second tester could have taken? [1] (iv)
Without performing another hypothesis test, explain whether the conclusion in part (i)
would be different if the alternative hypothesis was that the mean amount of sodium had decreased after the alterations to the workflow. [1]
It is given instead that the standard deviation of amount of sodium in a packet of potato chips is 6.0 mg. The mean amount of sodium of a second randomly chosen sample of 40 packets of potato chips is y
. (v)
A hypothesis test on this sample, at 5% level of significance, led to a conclusion that the amount of sodium has decreased. Find the range of values of y
, to 1 decimal place. Explain if it is necessary to make any assumptions about the distribution of the amount of sodium in each packet of potato chips. [3] EJC JC2 Prelim 9758/2019/02/Q10 (Solutions) 1 (i) 58.0
1051.16
50
1050
x
2
2
2
2
1
1
1
58.0
2326
49
50
46.096 (5 1050
105
s. .
0
f )
n
n
x
s
x
To test H
0
: 1053
against H
1
: 1053
at 5% level of significance Since 50
n
is large, by Central Limit Theorem, under H
0
, 46.096
5
~ N 1053
0
,
X
approximately either
p
-value: 0.055322 or z-
value: −1.9163 and critical region: 0.025
1.96
z
Since p
-value > 0.05 (or |z-value| < 1.96), we do not reject H
0
and conclude at 5% level of significance that there is insufficient evidence that the population mean amount of sodium per packet has changed after alterations to the workflow. (ii) The probability of wrongly concluding that the mean amount of sodium is not 1053mg, when it is in fact 1053mg, is 0.05.
(iii) Since p
-value is 0.055322, 6
(iv) If we tested H
1
: 1053
, Either -value
0.027661
0.05
p
or -value
1.9163
1.645
z
So we may reject H
0
and conclude at 5% level of significance that the population mean amount of sodium had decreased. (v) To test H
0
: 1053
against H
1
: 1053
at 5% level of significance Since 50
n
is large, by Central Limit Theorem, under H
0
, 2
6.0
4
~ N 10
,
0
53
X
approximately To reject H
0
, p
-value < 0.05 1051.4 (to 1 d.p.)
y
It is not necessary to assume anything about the population distribution, as sample size (= 40) is large enough, so the Central Limit Theorem says the sample mean amount of sodium approximately follows a normal distribution. HCI JC2 Prelim 9758/2019/02/Q9 A company purchased a machine to pack shower gel into its bottles. The expected mean volume of shower gel in a bottle is 950 ml. (a)
The floor supervisor believes that the machine is packing less amount of shower gel than expected. A random sample of 80 bottles is taken and the data is as follows: Volume of shower gel in a bottle (correct to nearest ml) 948 949 950 951 952 953 955
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Number of bottles 9 22 36 6 4 1 2 (i)
Find unbiased estimates of the population mean and variance, giving your answers correct to 2 decimal places. [2] (ii)
Write down the appropriate hypotheses to test the floor supervisor’s belief. You should define any symbols used. [2] (iii)
Using the given data, find the p
-value of the test. State what is meant by this p
-value in the context of this question. [2] (iv)
It was concluded at %
level of significance that the machine is indeed packing less amount of shower gel than expected. State the set of values of
. [1] (b)
Due to a change in marketing policy, the machine is being recalibrated to pack smaller bottles of shower gel with mean volume of 250 ml. The volume of a recalibrated bottle of shower gel is denoted by Y
ml. A random sample of 50 bottles of y
ml each is taken and the data obtained is summarised by: 2
(
250)
25, (
250)
. y
y
k
Another test was conducted at the 1% significance level. The test concluded that the machine had been calibrated incorrectly. Find the range of values of k
, correct to 1 decimal place. [4] (c)
Explain why there is no need for the floor supervisor to know anything about the population distribution of the volume of shower gel in a bottle for both parts (a)
and (b)
. [1] HCI JC2 Prelim 9758/2019/02/Q9 (Solutions) 9(i) Using GC, Unbiased estimate of population mean is
949.84 2 d.p.
x
Unbiased estimate of population variance is
2
2
1.31634
1.73 2 d.p.
s
Note: If student write 949.84
or 2
2
1.31634
1.73
, annotate but do not deduct. 9 (b)(
ii) Let X
be the volume of shower gel dispensed by the machine. Let
denote the population mean volume of shower gel dispensed by the machine. 2
~ N
,
s
X
n
approx. by Central Limit Theorem. 2
~ N 950,
s
X
n
H
0
: 950
H
1
: 950
9 (a)(i
ii) Under H
0
, using GC, value
0.13474
0.135
p
The p - value is the probability of the sample mean volume of shower gel in a bottle is as extreme as 949.84
when the mean is actually 950. 9 (a)(i
v) Since we reject H
0
, value
/100
p
13.476
:13.5
100
9(b) Let Y
be the volume of shower gel dispensed by the machine after recalibration. Let
denote the population mean volume of refill dispensed by the machine. 2
~ N
,
s
Y
n
approx. by Central Limit Theorem
H
0
: 250
H
1
: 250
Test statistic: Y
Z
s
n
Level of significance: 1% Reject H
0 if value
0.01
p
249.5
250
249.5
250
2.5758293 or 2.5758293 (NA)
50
50
0
0.19411
50
0
1.3726
s
s
s
s
2
2
25
1
0
1.3726
49
50
12.5
104.8
k
k
9(c) There is no need for the floor supervisor to assume the volume of shower gel follow a normal distribution as the sample sizes in both part (a) and (b) are large. The sample mean volume of shower gel can be approximated to follow a normal distribution by Central Limit Theorem.
JPJC JC2 Prelim 9758/2019/02/Q7 A company sells peanut butter in jars. Each jar is labelled as containing m
grams of peanut butter on average. A consumer group suspects that the average mass of peanut butter in a jar is overstated. To test this suspicion, the consumer group checks a random sample of 50 jars and the mass of peanut butter per jar, x
grams, are summarized by
(
390)
120
x
and 2
(
390)
3100
x
. The consumer group uses the above data to carry out a test at the 2% level of significance. The result leads the consumer group to conclude that the company has overstated the average mass of peanut butter in a jar. (i)
Explain why the consumer group is able to carry out a hypothesis test without knowing anything about the distribution of the mass of the peanut butter of the jars. [1] (ii)
Find the least possible value of m
, to the nearest gram that leads to the result of the hypothesis test as stated above. [7]
JPJC JC2 Prelim 9758/2019/02/Q7 (Solutions) 7 (i)
Central Limit Theorem states that sample means will follow a normal distribution approximately when sample size is big enough. (ii) (
390)
120
x
and 2
(
390)
3100
x
120
390
392.4
50
x
2
2
1
120
3100
57.3877551
57.388
50
1
50
s
0
H :
m
vs 1
H :
m
Since n
= 50 is large, by Central Limit Theorem, 57.388
N(
,
)
50
X
m
approximately Level of significance: 2% Critical region: Reject 0
H
when 2.0537
z
Consider 392.4
2.0537
57.388
50
m
394.6
m
(5 s.f.) The least possible value of m
is 395grams. MI PU3 Prelim 9758/2019/02/Q9 A chemist is conducting experiments to analyse the amount of active ingredient A
in a particular type of health supplement tablets. Each tablet is said to contain an average amount of 50 mg of ˗
2.0537
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
active ingredient A
. A random sample of 40 tablets is taken and the amount of active ingredient A
per tablet is recorded. The sample mean is 50.6 mg and the sample variance is 2.15 mg
2
respectively. (i)
Explain the meaning of ‘a random sample’ in the context of the question.
[1] (ii)
Test, at the 1% level of significance, whether the mean amount of active ingredient A
in a tablet has changed. You should state your hypotheses and define any symbols you use. [6] In a revised formula of the same type of health supplement tablets, the amount of active ingredient A
now follows a normal distribution with population variance 1.5 mg
2
. The chemist wishes to test his claim that the mean amount of active ingredient A
per tablet is more than 50 mg. A second random sample of n such tablets is analysed and its mean is found to be 50.4 mg. Find the set of values that n
can take such that the chemist’s claim is valid at 2.5% level of significance.
[4] MI PU3 Prelim 9758/2019/02/Q9 (Solutions) 9(i) A random sample means that each of the tablets were selected
independently
from each other
and
each tablet has an equal chance
of being selected
. 9(ii) Let X
be the amount in grams of active ingredient A
in a particular type of health supplement tablets. Define
: Let
be the population mean amount/mass of active ingredient A
in a particular type of health supplement tablets. Given that sample mean 50.6, sample variance = 2.15
x
Unbiased estimate
of population variance
40
2.15
2.2051
39
0
1
H
:
50
H :
50
Under H
0
, since n
= 40 is large, by Central Limit Theorem
, 2.2051
~ N 50, 40
X
approximately. Use z
-test at 0.01
Using GC
, p
-value = 0.010605 = 0.0106 (3 sf) Since p
-value >
, do not reject H
0
. There is insufficient evidence
at 1% level of significance to conclude that the mean amount of active ingredient
A
has changed (or is not 50 mg).
9 last part Let Y
be the amount in grams of active ingredient A
in the revised formula 0
1
H
:
50
H :
50
Under H
0
, 1.5
~ N 50, Y
n
Use z
-test at 0.025
. Method 1 Test-statistic distribution:
50
~
N
0, 1
1.5
Y
Z
n
Corresponding test-statistic: 50.4
50
0.4
1.5
1.5
n
z
n
Critical region: 1.95996
z
Since the chemist claim is valid, H
0
is rejected. Test-statistic falls within the critical region.
0.4
1.95996
1.5
36.0135
37
:
37
n
n
n
n
n
Method 2 (Using GC, Table/Graph method) For H
0
to be rejected, - value
0.025
p
. p
-value =
P
50.4
0.025
Y
37
n
:
37
n
n
NJC JC2 Prelim 9758/2019/02/Q11 In view of the rise in the number of accidents involving Personal Mobility Devices (PMDs) in the past year, regulations on PMDs are being tightened to step up safety for pedestrians on footpaths. Since February 2019, the speed limit for riding a PMD on footpaths has been lowered to 10 km/h. In June 2019, a concerned citizen believes that the mean speed of PMD riders still exceeds the new speed limit. To test his belief, a team of enforcement officers capture the speeds, x km/h, of a random sample of 250 PMD riders at five hotspots. The findings are summarised by
2
10
38.9, 10
390.31
x
x
.
Assume that the speed of a PMD rider follows a normal distribution.
(i)
Calculate unbiased estimates of the population mean and variance of the speed of PMD riders on footpaths. [2]
(ii)
Test at the 4% level of significance whether the belief should be accepted. [4]
(iii)
Explain, in the context of the question, the meaning of ‘at the 4% level of significance’.
[1]
(iv)
Without any further test, explain if the conclusion in part (ii)
will change if instead we are testing that the mean speed of PMD riders on footpaths is not 10 km/h. [1]
Three months later, to investigate if the mean speed of PMD riders on footpaths differs from the regulated speed limit, another large random sample is collected at the same hotspots. It is found that the mean speed captured is 9.5 km/h, with standard deviation 3.2 km/h. (v) Find the smallest sample size that will provide significant evidence at the 5% level of significance that the mean speed of PMD riders on footpaths differs from the regulated speed limit. [4] NJC JC2 Prelim 9758/2019/02/Q11 (Solutions) 11 Suggested Solution (i) Unbiased estimate of the population mean, 38.9
10
10.1556
10.2
250
x
Unbiased estimate of the population variance,
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
2
2
1
38.9
390.31
1.543201
1.54
249
250
s
(ii) Test 0
H :
10
against 1
H :
10
Level of significance: 4% (upper-tailed) Under
0
H
, 1.543201
~ N
10,
approximately.
250
X
10
~ N 0,1 approximately
1.543201
250
X
Z
From GC, p
-value 0.0238
< 0.04 OR
critical value, 0.96
1.75
z
and observed
test statistic 1.98
> 1.75
Thus, we reject 0
H
and conclude that there is sufficient evidence at the 4 % level of significance that the mean speed of PMD riders exceeds the new speed limit. (iii) There is a probability of 0.04 (or 4% chance) of wrongly rejecting the claim that the mean speed of PMD riders does not exceed (or is) 10 km/h. (iv) Test 0
H :
10
against 1
H :
10
Level of significance: 4% (2-tailed) For a 2-tailed test, p
-value
2 0.0238
0.0476
> 0.04 The conclusion will change. (v) Let n
be the sample size.
2
2
10.24
3.2
1
1
n
n
s
n
n
Test 0
H :
10
against 1
H :
10
Under
0
H
, 10.24
~ N
10,
approximately.
1
X
n
Test statistic:
10
~ N 0,1 approximately
10.24
1
X
Z
n
Level of significance: 5% (2-tailed) To reject 0
H
, 9.5
10
9.5
10
1.96 or 1.96
10.24
10.24
1
1
1
1
3.92 3.92 (N.A.)
10.24
10.24
158.35
159
n
n
n
n
n
n
Thus, smallest n
= 159. NYJC JC2 Prelim 9758/2019/02/Q10 In the manufacturing of a computer device, there is a process which coats a computer part with a material that is supposed to be 100 microns thick. If the coating is too thin, the proper insulation of the computer device will not occur and it will not function reliably. Similarly, if the coating is too thick, the device will not fit properly with other computer components. The manufacturer has calibrated the machine that applies the coating so that it has an average coating depth of 100 microns with a standard deviation of 10 microns. When calibrated this way, the process is said to be “in control”.
Due to wear out of mechanical parts, there is a tendency for the process to drift. Hence the process has to be monitored to make sure that it is in control. (i)
After running the process for a reasonable time, a random sample of 50 computer devices is drawn. The sample mean is found to be 103.4 microns. Test at the 5% level of significance whether the sample suggests that the process is not in control. State any assumptions for this test to be valid. [4] (ii)
To ease the procedure of checking, the supervisor of this process would like to find the range of values of the sample mean of a random sample of size 50 that will suggests that the process is not in control at 5% level of significance. Find the required range of values of the sample mean, leaving your answer to 1 decimal places. [3] On another occasion, a random sample of 40 computer devices is taken. The data can be summarised by 2
(
100)
164,
(
100)
9447
y
y
. (iii)
Calculate the unbiased estimate for the population mean and population variance of the thickness of a coating on the computer device. [2] (iv)
Give, in context, a reason why we may not be able to use 10 microns for the standard deviation of the thickness of a coating on the computer device. [1]
(v)
Assume that the standard deviation has changed, test at the 4% level of significance whether the sample suggests that the process is not in control. [3] NYJC JC2 Prelim 9758/2019/02/Q10 (Solutions) 10(i) Let X
be the thickness of the coating on a randomly chosen computer device. Let
be the mean thickness of the coating of a computer device. Assume that the standard deviation of the coating of a computer device remains unchanged. To test :
0
1
:
100
:
100
H
H
Level of Significance: 5% Under 0
H
, since sample size 50
n
is large, by Central Limit Theorem, 100
~
(0,1) approx.
50
10 /
X
N
Z
Reject 0
H
if 0.05
p
value
. Calculations: 103.4
x
0.0162
p
value
Conclusion: Since 0.05
p
value
, we reject 0
H
and conclude that there is significant evidence at 5% level of significance that the process is not in control. 10(ii) Reject 0
H
is |
| 1.960
calc
z
For 0
H
to be rejected, 100
1.95996
50
10
10
100
1.95996
or 100
1.95996
50
50
97.228 or 102
/
2
0
.77
1
x
x
x
x
x
Thus the required range of values of x
is 0
97.2 or 102.8
x
x
. 10(iii) 4164
104.1
40
y
(
100)
4164
4000
164
y
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
2
2
2
2
(
100
1
(
100)
39
1
164
9447
3
40
40
9
43873
224.9897
195
y
s
y
10(iv) The standard deviation may have changed due to the wear out of mechanical parts as well. 10(v) To test :
0
1
:
100
:
100
H
H
Level of Significance: 4% Under 0
H
, since sample size 40
n
is large, by Central Limit Theorem, 100
~
(0,1) approx.
40
/
Y
N
Z
S
Reject 0
H
if 0.04
p
value
. Calculations: 2
104.1,
224.9897
x
s
0.0839
p
value
Conclusion: Since 0.04
p
value
, we do not reject 0
H
and conclude that there is insignificant evidence at 4% level of significance that the process is not in control. RI JC2 Prelim 9758/2019/02/Q7 Based on past records, at government polyclinics, on average each medical consultation lasts 15 minutes with a standard deviation of 10 minutes.
(i) Give a reason why a normal distribution, with this mean and standard deviation, would not give a good approximation to the distribution of consultation duration.
[2]
From recent patient and doctor feedback, a polyclinic administrator claims that the average consultation duration at government polyclinic is taking longer. State suitable null and alternative hypotheses to test this claim. [1]
(ii) Given that the null hypothesis will be rejected if the sample mean from a random sample of size 30 is at least 18, find the smallest level of significance of the test. State clearly any assumptions required to determine this value. [3]
The administrator suspects the average consultation duration at private clinics is actually less than 15 minutes. A survey is carried out by recording the consultation duration, y
, in minutes, from 80 patients as they enter and leave a consultation room at a private clinic. The results are summarized by
(
15)
50
y
, 2
(
15)
555
y
. (iii) Calculate unbiased estimates of the population mean and variance of the consultation duration at private clinics. Determine whether there is sufficient evidence at the 5% level of significance to support the administrator’s claim.
[4]
RI JC2 Prelim 9758/2019/02/Q7 (Solutions) 7(i) [2] Let X be the time in minutes of a medical consultation. If 2
~ N(15,10 )
X
, then P(
0)
0.066807 (5 s.f.)
0.0668 (3 s.f.)
X
This means that about 7 patients out of every 100 take a “negative” amount of time which is not possible. [1] 0
H :
15
vs 1
H :
15
where
denotes the population mean consultation time in minutes. 7(ii) [3] Under 0
H
, since n
= 30 is large, 2
10
~ N 15,
30
X
approximately by Central Limit Theorem. We also assume that the population standard deviation is unchanged. P(
18)
0.050174 (5 s.f.)
0.0502 (3 s.f.)
X
The smallest level of significance is 5.02% 7(iii) [4] (
15)
50
y
, 2
(
15)
555
y
. An unbiased estimate of the population mean is, 50
115
15
80
8
y
An unbiased estimate of the population variance is, 2
2
1
(50)
2095
555
79
80
316
s
0
H :
15
vs 1
H :
15
Under 0
H
, since n
= 80 is large, 2095
316
~ N 15,
80
Y
approximately by Central Limit Theorem. Since
p
-value = 115
P
8
Y
= 0.0149623748 = 0.0150 (3 sf) < 0.05.
We reject 0
H
, and conclude that there is sufficient evidence, at the 5% significance level, to support the administrator’s claim that the average consultation time at private clinics is less than 15 minutes. RVHS JC2 Prelim 9758/2019/02/Q10 Engineers claimed that a newly developed engine is efficient. The number of hours x
the engine ran on one litre of fuel was measured on 70 occasions. The results are summarized by
46
27
x
,
2
46
30939.
x
(i) Calculate unbiased estimates for the population mean and variance of the number of hours the engine runs on one litre of fuel. [2] (ii) Determine the greatest mean number of hours the engineers should claim so that there is sufficient evidence at the 5% level of significance that the engine is efficient. [3] (iii) The engineers collected five more data points: 56, 34, 63, 50, 54. The engineers found that when all 75 data points are considered, they can no longer claim that the engine is efficient at the
% level of significance. Show that the unbiased estimate for the population variance, for the set of 75 data points is 426.378, correct to 3 decimal places. Find the greatest integer value of
. [4] RVHS JC2 Prelim 9758/2019/02/Q10 (Solutions) Question 10 [9] Hypo Test (i) 27
46
45.61428
45.6 (to 3s.f.)
70
x
2
2
27
1
30939
448.2403727
448 (to 3 s.f.)
69
70
s
(ii) Let 0
be the mean number of hours the engineer claimed. Test H
0
:
= 0
against H
1
:
> 0
at 5% significance level Test statistic: Under H
0
,
0
~ N 0,1
/
70
X
Z
s
To reject H
0
,
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
0
0
0
0
1.644853632
/
70
1.644853632
70
41.45197671
41.5 (to 3 s.f.)
x
s
s
x
Thus, the greatest mean number of hours the engineers should claim is 41.5. (iii) Based on n
= 70, 27
46
70
3193
70
x
Based on n
= 75,
2
2
2
2
2
2
2
2
2
3193
56
34
63
50
54
46
75
46
30939
(56
46)
(34
46)
(63
46)
(50
46)
(54
46)
31552
426.3783783784
426.378 (3.s.f)
74
x
x
x
x
x
x
s
Test H
0
:
= 41.5 against H
1
:
> 41.5 at
% significance level Test statistic: Under H
0
,
41.5
~ N 0,1
/
75
X
Z
s
By GC, p
-value = 0.029559 Do not reject H
0
, 100
0.029559
100
2.9559
p
value
The greatest integer value of
is 2. [ If 0
41.4
, p
-value = 0.026649, then the greatest integer value of
is 2.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
If 0
41.45197671
, p
-value = 0.028230374, then the greatest integer value of
is 2. ] SAJC JC2 Prelim 9758/2019/02/Q11 The time T
seconds required for a computer to boot up, from the moment it is switched on, is a normally distributed random variable. The specifications for the computer state that the population mean time should not be more than 30 seconds
. A Quality Control inspector checks the boot up time using a sample of 25 randomly chosen computers.
A particular sample yielded t
802.5 and
2
26360.25
t
. (i) Calculate the unbiased estimates of the population mean and variance. [2] (ii) What do you understand by the term “unbiased estimate”?
[1] (iii) Test, at the 5% level of significance level, whether the specification is being met. Explain in the context of the question, the meaning of “5% level of significance”.
[5] (iv) Find the
range of values of t
such that the specification will be met in the test carried out in part (iii)
.
[1] (v)
A new Quality Control policy is that when the specification is not met, all the computers will be sent back to the manufacturer for upgrading. The inspector tested a second random sample of 25 computers, and the boot up time, y
seconds, of each computer is measured, with 32.4
y
. Using a hypothesis test at the 5% level of significance, find the range of values of the population standard deviation such that the computers will not be sent back for upgrading. [3] SAJC JC2 Prelim 9758/2019/02/Q11 (Solutions) 11(i) Let T
be the random variable “ time taken in seconds for a computer to boot up”, with population mean
. Unbiased estimate of the population mean, 802.5
32.1
25
t
Unbiased estimate of the population variance, 2
2
1
802.5
26360.25
25
24
25
s
(ii) A statistic is said to be an unbiased estimate of a given parameter when the mean of the sampling distribution of the statistic can be shown to be equal to the parameter being estimated. For example, E(
)
X
. (iii) Test H
0
: 30
against 1
:
30
H
at the 5% level of significance. Under H
0
, 25
~
(30,
)
25
T
N
. Using GC,
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
t
= 32.1 gives rise to z
calc
= 2.1 and p
-value = 0.0179 Since p
-value = 0.0179
0.05, we reject H
0
and conclude that there is sufficient evidence at the 5% significance level that the specification is not being met (or the computer requires more than 30 seconds to boot up). “5% significance level” is the probability of wrongly concluding that the mean boot up time for the computer is more than 30 seconds when in fact it is not more than 30 seconds. (iv) The critical value for the test is 31.645. For the specification to be met, H
0
is not rejected. 31.6 (3 s.f.)
t
Since t
> 0, Answer is 0
31.6.
t
(v) Under H
0, 2
~
30,
25
Y
N
(30)
~ N(0,1)
25
Y
Z
Using a 1 –
tailed z
test, calc
5
32.4
30
12
z
, crit
1.64485
z
In order not to reject H
0, 12
1.64485
7.2955
7.30
TJC JC2 Prelim 9758/2019/02/Q10 (a)
Two random samples of different sample sizes of households in the town of Aimek were taken to find out the mean number of computers per household there. The first sample of 50 households gave the following results. Number of computers 0 1 2 3 4 Number of households 5 12 18 10 5 The results of the second sample of 60 households were summarised as follows. 118
y
2
314
y
, where y
is the number of computers in a household.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
(i)
By combining the two samples, find unbiased estimates of the population mean and variance of the number of computers per household in the town. [4] (ii)
Describe what you understand by ‘population’ in the context of this question.
[1] (b)
Past data has shown that the working hours of teachers in a city are normally distributed with mean 48 hours per week. In a recent study, a large random sample of n teachers in the city was surveyed and the number of working hours per week was recorded. The sample mean was 46 hours and the sample variance was 131.1 hours
2
. A hypothesis test is carried out to determine whether the mean working hours per week of teachers has been reduced. (i) State appropriate hypotheses for the test. [1] The calculated value of the test statistic is z
= −1.78133 correct to 5 decimal places.
(ii) Deduce the conclusion of the test at the 2.5 % level of significance. [2] (iii) Find the value of n
. [3] (iv) In another test, using the same sample, there is significant evidence at the %
level that there is a change in the mean working hours per week of teachers in that city. Find the smallest possible integral value of
. [2]
TJC JC2 Prelim 9758/2019/02/Q10 (Solutions) 10(a)(i) Let X
be the number of computers in each household in the first sample. Using GC, 98
fx
,
2
254
fx
Unbiased estimate of the population mean 98
118
108
(
1.96)
50
60
55
Unbiased estimate of the population variance
2
98
118
1
7912
254
314
(
1.32)
110
1
110
5995
(a)(ii) The population in this question refers to all the households in the town.
(b)(i) 0
H :
48
1
H :
48
(ii) Critical region =
:
1.960
z
z
Since z
cal = 1.781
> −1.960, we do not reject H
0
OR
p
-value =
P
1.781
0.0375
Z
> 0.025, we do not reject H
0 There is insufficient evidence at 2.5% level of significance that the mean working hours per week of teachers in the city has been reduced. (iii)
2
131.1
1
n
s
n
Given z
= 1.781
46
48
1.781
131.1
1
n
[Turn over
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
131.1
2
1.781
1
n
1.781 131.1
1
2
n
105
n
(iv) p
-value for the two tailed test =
2
P
1.781
0.0749
Z
(accept 0.075) For H
0
to be rejected, p
-
value ≤ 100
7.49
Smallest value of
= 8
TMJC JC2 Prelim 9758/2019/02/Q7 MHL bakery sells mini breads that weighs an average of 45 grams each. A customer claims that the bakery is overstating the average weight of mini breads. To test this claim, a random sample of 80 mini breads are selected from MHL bakery and the weight, x
grams, of each mini bread is measured. The results are summarised as follows. 80
n
3571
x
2
159701
x
Calculate unbiased estimates of the population mean and variance of the weight of mini breads. [2] Test, at the 4% level of significance, whether there is sufficient evidence to support the customer’s claim. [4] From past records, it is known that the weights of the mini breads from MHL bakery are normally distributed with standard deviation 1.5 grams. To further investigate the customer’s claim, the bakery records the weights of another 20 randomly selected mini breads and the average weight for the second sample is k
grams. Based on the combined sample of 100 mini breads, find the range of values of k
such that the customer’s claim is valid at the 4% level of significance. [4] TMJC JC2 Prelim 9758/2019/02/Q7 (Solutions) Qn Solution Mark Scheme 7 Hypothesis Testing [10]
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Let X
be the weight of a randomly chosen mini bread (in grams). Let
denote the population mean weight of mini breads (in grams) 3571
Unbiased estimate of population mean, 80
44.6375
x
2
2
3571
1
Unbiased estimate of population variance, 159701
79
80
3.8036
3.80 (3 s.f.)
s
0
1
H : 45
H : 45
Under 0
H
, since 80
n
is large, by Central Limit Theorem, 3.8036
~ N 45,
80
X
approximately Test Statistic: Z
= 45
3.8036
80
X
Level of significance : 4 % Reject H
0
if p - value < 0.04 Using GC, p
-value = 0.0482
Since p
-value = 0.0482
0.04
, we do not reject 0
H
and conclude that there is insufficient evidence, at the 4% level of significance, that the population mean weight is less than 45 grams. Thus, the customer’s claim is not supported at the 4% significance level. Sample mean based on combined sample = 20
3571
20
80
20
100
x
k
k
0
1
H : 45
H : 45
Under H
0
,
2
2
1.5
~ N 45,1.5
~ N 45,
100
X
X
Test statistic: 2
45
1.5
100
X
Z
Level of significance : 4 % Reject H
0
if value
1.7507
z
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
2
45
value
1.5
100
x
z
Since there is sufficient evidence that the customer’s claim
is valid at 4% level of significance, H
0
is rejected 2
45
1.7507
1.5
100
44.737395
3571
20
44.737395
100
45.137
45.1
(3 s.f)
x
x
k
k
k
VJC JC2 Prelim 9758/2019/02/Q9 Exposure to Volatile Organic Compounds (VOCs), which have been identified in indoor air, is suspected as a cause for headaches and respiratory symptoms. Indoor plants have not only a positive psychological effect on humans, but may also improve the air quality. Certain species of indoor plants were found to be effective removers of VOCs. A commonly known VOC is Benzene
. The following data gives the benzene levels, x
(in ppm) in 40 test chambers containing the indoor plant Epipremnum aureum
. , , . The initial mean Benzene level (in ppm) without Epipremnum aureum
was found to be 26.0. (i)
Test, at the 5% level of significance, the claim that the mean Benzene level, (in ppm), has decreased as a result of the indoor plant Epipremnum aureum
. You should state your hypotheses clearly. [5] (ii) State, giving a reason, whether there is a need to make any assumptions about the population distribution of the Benzene level in order for the test to be valid. [2] The Benzene levels of another 50 test chambers containing the indoor plant Epipremnum aureum
were recorded, The sample mean is ppm and the sample variance is 8.33 ppm
2
. (iii) The acceptance region of a test of the null hypothesis is . State the alternative hypothesis and find the level of significance of the test. [4] 40
n
26.0
30.1
x
2
26.0
214.61
x
x
26.0
25.1
x
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
(iv) If the null hypothesis is , where , would the significance level of a test with the same acceptance region in part (iii)
be larger or smaller than that found in part (iii)
? Give a reason for your answer. [2]
VJC JC2 Prelim 9758/2019/02/Q9 (Solutions) 9i 0
1
H : 26.0
H : 26.0
Level of significance: 5% Test Statistic: Since n
is sufficiently large, by CLT, X
is approximately normal.
0
26.0
When H
is true N 0,1 X
Z
S
n
approx Computation:
2
2
30.1
40, 26
25.2475
40
30.1
1
214.61
4.9220
39
40
n
x
s
value
0.015969
0.0160
p
Conclusion: Since
value
0.0160
p
< 0.05, 0
H
is rejected at the 5% level of significance. Therefore, there is sufficient evidence to conclude that the mean benzene level has decreased as a result of the indoor plant Epipremnum aureum at the 5% level of significance. 9ii There is no need to assume the population distribution of the benzene level because n
= 40 is sufficiently large, so by Central Limit Theorem, the sample mean benzene level, X
, follows a normal distribution approximately
. 9iii 0
1
H : 26.0
H : 26.0
Test Statistic: Since n
is sufficiently large, by Central Limit Theorem, X
is approximately normal.
0
26.0
When H
is true N 0,1
X
Z
S
n
approx. Computation:
2
50
8.33
8.5
49
s
0
0
26.0
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Rejection region is 25.1
x
Level of significance
0
0
P
25.1 when H
is true
OR P
2.1828 when H
is true
0.014524
X
Z
Level of significance of this test is 1.45%. 9iv 0
0
0
1
0
H : ,
where 26.0
H :
Level of significance
0
P
25.1 when X
P
25.1 when 26.0 X
(refer to diagram above) Level of significance is smaller than that in (iii)
. YIJC JC2 Prelim 9758/2019/02/Q10 A previous study revealed that the average time taken to assemble a certain type of electrical component is at least 15 minutes. The manager wants to investigate if the results of the study is valid. A random sample of 40 components is taken and the times taken to assemble the components are summarised in the following table: Time to assemble a component (min) 9 10 12 13 15 16 17 18 Number of components 1 6 3 8 5 7 8 2 (i) Find unbiased estimates of the population mean and variance. [2] (ii) Test at the 5% level of significance whether the results of the study is valid. You should state your hypotheses and define any symbols you use. [5] (iii) Explain why the manager is able to conduct the test without knowing anything about the distribution of the times taken to assemble the electrical components. [1] (iv) Explain what is meant by the phrase “5% level of significance” in this context. [1] 0.05 25.1 26.0
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
The manager claims that the average time taken to assemble another type of electrical component is 30 minutes. A random sample of 50 components of this type is chosen and the time taken to assemble each component is recorded. The mean and standard deviation of the sample are 29.7 minutes and k
minutes respectively. Find the range of possible values of k
if a test at the 8% significance level shows that there is sufficient evidence that the manager’s claim is valid.
[4] YIJC JC2 Prelim 9758/2019/02/Q10 (Solutions) 10(i) An unbiased estimate of
is x
14.2 2
2
An unbiased estimate of is 7.0871
7.09
s
10(ii)
is the population mean time taken to assemble an electrical component. H
0
: 15
(results of study) H
1
: 15
Under H
0
, the test statistic is ~ N(0,1)
X
Z
S
n
approximately (by CLT), where 15
, 2
7.0871
s
, n
= 40, 14.2
x
. From GC, p
-value = 0.0287 Since the p-
value = 0.0287
0.05, we reject H
0
and conclude that at the 5% level, there is sufficient evidence that the results of the study is not valid. 10(iii) Since sample size is large, by Central Limit Theorem, the sample mean time taken to assemble an electrical component will be approximately normal. 10(iv) “5% level of significance” means that there is a probability of 0.05 of concluding that the results of the study is not valid when the mean time taken to assemble an electrical component is indeed 15 minutes. H
0
: 30
(claim) H
1
: 30
Under H
0
, the test statistic is ~ N(0,1)
X
Z
S
n
approximately (by CLT), where 30
, 2
2
50
49
s
k
, n
= 50, 29.7
x
. Critical Regions: z
1.750686 or z
1.750686
Given that H
0
is not rejected
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
1.750686
1.750686
0.250098
0.3 and 0.3
0.250098
1.19953 1.19
2
95
.
7
3
9 7
30
k
k
k
k
k
1.20
k
(3 s.f)
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Recommended textbooks for you
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Recommended textbooks for you
- Glencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL