chap 9 Expect_The_Unexpected_A_First_Course_In_Biostatist..._----_(Statistics) (2)

pdf

School

University of Ottawa *

*We aren’t endorsed by this school

Course

2379

Subject

Statistics

Date

Jan 9, 2024

Type

pdf

Pages

Uploaded by GrandUniverseHyena41

Chapter 9 Hypothesis Testing In this chapter, we introduce another statistical method for drawing conclu- sions about the values of a parameter. This method consists in confronting two hypotheses which speak about the parameter values. It is used when one wants to gain support (or evidence) towards a desired statement, called “the research hypothesis”, and denoted by H 1 . The other hypothesis, which the researcher wants to reject, is called the “null hypothesis”, and is de- noted by H 0 . When using this method, we formulate the two hypotheses with the goal of rejecting H 0 , and gaining evidence towards H 1 . 9.1 Hypothesis Testing for the Mean: Large Samples In this section, we introduce the method of hypothesis testing, when the parameter of interest is the population mean μ , and the sample size n is large, i.e. n ≥ 40. The null hypothesis H 0 says that the unknown parameter μ is equal to a specified numerical value μ 0 : H 0 : μ = μ 0 . Under new experimental conditions, the mean measurement μ is thought to deviate from μ 0 , which is a value obtained under standard conditions. The alternative hypothesis H 1 (that we would like to gain evidence for) specifies the direction of this change in μ . This hypothesis can take three different forms: (1) μ is larger than μ 0 . In this case, we write H 1 : μ > μ 0 , and we say that we perform a right-tailed test . This set-up is used when one wants to gain evidence that μ exceeds the hypothesized value μ 0 . 141 Balan, R., & Lamothe, G. (2017). Expect the unexpected : A first course in biostatistics (second edition). World Scientific Publishing Company. Created from ottawa on 2023-09-29 20:14:56. Copyright © 2017. World Scientific Publishing Company. All rights reserved.

142 Expect the Unexpected: A First Course in Biostatistics (2) μ is smaller than μ 0 . In this case, we write H 1 : μ < μ 0 and we say that we perform a left-tailed test . This set-up is used when one wants to gain evidence that μ diminishes compared to μ 0 . (3) μ is different than μ 0 . In this case, we write H 1 : μ 6 = μ 0 and we say that we perform a two-tailed test . This set-up is used when the direction of the change in μ is unknown. Setting up the hypothesis in the desired way (i.e. choosing the appropri- ate alternative hypothesis H 1 , among the three possibilities listed above) is the first and most important step of a statistical testing procedure. Before performing the test, the statistician has to decide what is the alternative hypothesis H 1 . This decision dictates automatically which of the three cases above has to be used for the problem at hand. The conclusion of a test of hypothesis is one of the following: (i) We reject H 0 . In this case, we say that there is enough evidence in favour of H 1 . (We may say that H 1 is true.) (ii) We fail to reject H 0 . In this case, we say that there is not enough evidence in favour of H 1 . (We avoid saying that H 0 is true, although this may help with the logic.) As a consequence, hypothesis testing can result in two types of errors: • Type I error (whose probability is denoted by α ) is encountered if we reject H 0 , when H 0 is true. • Type II error (whose probability is denoted by β ) is encountered if we fail to reject H 0 , when H 1 is true. Ideally, both probabilities α and β should be small. The table below illus- trates all 4 possibilities: Reject H 0 Fail to Teject H 0 H 0 True Type I error Correct decision (probability α ) (probability 1 - α ) H 1 True Correct decision Type II error (probability 1 - β ) (probability β ) Fig. 9.1 Probabilities associated with a test of hypothesis Balan, R., & Lamothe, G. (2017). Expect the unexpected : A first course in biostatistics (second edition). World Scientific Publishing Company. Created from ottawa on 2023-09-29 20:14:56. Copyright © 2017. World Scientific Publishing Company. All rights reserved.

Hypothesis Testing 143 Example 9.1. The effects of inhaling particle matter (PM) have been widely studied in humans. The smaller particles PM 10 (particles with di- ameter of less that 10 micrometers) are especially dangerous, and possibly related to asthma and lung cancer. As of January 1, 2005 the European Commission has set the limit for the PM 10 in the air at 50 μg/m 3 (daily average). Local health organizations in a large European city are concerned that the PM 10 level in the outdoor air is higher than the 50 μg/m 3 permis- sible. To test the validity of this statement, levels of PM 10 were measured on 40 different days, yielding an average ¯ x = 52 . 5 μg/m 3 and a sample variance s 2 = 33 . 5. To set-up correctly the two hypotheses, we keep in mind that we want to reject H 0 , in favor of H 1 . Therefore, we set H 0 : “the average level of PM 10 is equal to 50” and H 1 : “the average level of PM 10 exceeds 50”. We are confronting the following two hypotheses: H 0 : μ = 50 versus H 1 : μ > 50 . A type I error occurs when we decide that the PM 10 level is higher than 50, when in fact it is not. This does not have a negative health impact, but may result in falsely alarming the public. A type II error occurs when we are unable to gain evidence that the PM 10 level is higher than 50, when in fact it is. This may have a negative health effect on the population. Example 9.2. Cholesterol is one of the body’s fats, used for making cell membranes, vitamin D and hormones. High levels of low-density lipoprotein (LDL) cholesterol in the blood can cause the build up of plaque in the artery walls, which is a major risk factor for heart disease and stroke. The Canadian Heart and Stroke Foundation advises a diet low in saturated fats and regular physical activities as effective measures for reducing the LDL blood cholesterol levels. To gain evidence for this statement, we use a sample of 52 Canadians with a high level of LDL blood cholesterol of 4.0 nmol/L, who were on a low-fat diet for 30 days, combined with 30 minutes of daily cardio exercises. After this period, the average LDL blood cholesterol level for this sample was found to be ¯ x = 3 . 5, (which is lower than the initial value μ 0 = 4 . 0), with a sample standard deviation s = 1 . 12. We now set-up the two hypotheses in the desired direction. The goal is to reject H 0 , and gain evidence for H 1 . The null hypothesis H 0 : μ = 4 . 0 says that despite the new measures, the average LDL blood cholesterol level stays the same. The alternative hypothesis H 1 : μ < 4 . 0 says that the LDL Balan, R., & Lamothe, G. (2017). Expect the unexpected : A first course in biostatistics (second edition). World Scientific Publishing Company. Created from ottawa on 2023-09-29 20:14:56. Copyright © 2017. World Scientific Publishing Company. All rights reserved.

Your preview ends here