MNET 315 Ch 11 Text Nonparametric Tests (Missing from book)

pdf

School

New Jersey Institute Of Technology *

*We aren’t endorsed by this school

Course

315

Subject

Statistics

Date

Jan 9, 2024

Type

pdf

Pages

Uploaded by LieutenantKookabura2603

C H A P T E R 11 582 Nonparametric Tests 11.1 The Sign Test 11.2 The Wilcoxon Tests Case Study 11.3 The Kruskal-Wallis Test 11.4 Rank Correlation 11.5 The Runs Test Uses and Abuses Real Statistics—Real Decisions Technology In a recent year, the most common form of reported identity theft was employment- or tax-related fraud, which accounted for 34% of cases. The second most common form was credit card fraud, which accounted for 33% of cases.

583 Where You’re Going In this chapter, you will study additional statistical tests that do not require the population distribution to meet any specific conditions. Each of these tests has usefulness in real-life applications. With the data above, the number of fraud complaints F and the number of identity theft victims V can be related by the regression equation V = 0.145 F + 429.103. The correlation coefficient is approximately 0.915, so there is a strong positive correlation. You can determine that the correlation is significant by using Table 11 in Appendix B. Further analysis of the data, however, can show that the variables do not appear to have a bivariate normal distribution, which is one of the requirements for using the Pearson correlation coefficient. So, although a simple correlation test might indicate a relationship between the number of fraud complaints and the number of identity theft victims, one might question the results because the data do not fit the requirements for the test. Similar tests you will study in this chapter, such as Spearman’s rank correlation test, will give you additional information. The Spearman’s rank correlation coefficient for this data is approximately 0.962. At a = 0.01, there is in fact a significant correlation between the number of fraud complaints and the number of identity theft victims for each state. Fraud complaints Identity theft victims x y Number of Fraud Complaints and Identity Theft Victims for 25 States 20,000 40,000 60,000 80,000 100,000 120,000 5,000 10,000 15,000 20,000 25,000 Where You’ve Been Up to this point in the text, you have studied dozens of different statistical formulas and tests that can help you in a decision-making process. Specific conditions had to be satisfied in order to use these formulas and tests. Suppose it is believed that as the number of fraud complaints in a state increases, the number of identity theft victims also increases. Can this belief be supported by actual data? The table below shows the numbers of fraud complaints and the numbers of identify theft victims for 25 randomly selected states in a recent year. (Source: Federal Trade Commission) Fraud complaints 39,344 45,528 33,745 21,117 7593 117,189 5768 7800 14,635 Identity theft victims 4007 8748 6203 4933 1484 12,787 789 1348 2532 Fraud complaints 5642 48,594 107,557 4600 25,636 7525 112,006 77,213 Identity theft victims 1170 8251 17,430 711 3993 1352 20,205 11,009 Fraud complaints 20,350 22,385 7206 2775 51,036 12,750 40,423 9948 Identity theft victims 3337 4312 1216 503 5718 2540 8310 1093

The Sign Test 11.1 584 CHAPTER 11 Nonparametric Tests What You Should Learn How to use the sign test to test a population median How to use the paired-sample sign test to test the difference between two population medians (dependent samples) The Sign Test for a Population Median The Paired-Sample Sign Test The Sign Test for a Population Median Many of the hypothesis tests studied so far have imposed one or more requirements for a population distribution. For instance, some tests require that a population must have a normal distribution, and other tests require that population variances be equal. What should you do when such requirements cannot be met? For these cases, statisticians have developed hypothesis tests that are “distribution free.” Such tests are called nonparametric tests. A nonparametric test is a hypothesis test that does not require any specific conditions concerning the shapes of population distributions or the values of population parameters. DEFINITION Nonparametric tests are usually easier to perform than corresponding parametric tests. They are, however, usually less efficient than parametric tests. Stronger evidence is required to reject a null hypothesis using the results of a nonparametric test. Consequently, whenever possible, you should use a parametric test. One of the easiest nonparametric tests to perform is the sign test. The only condition necessary to use a sign test is that the sample is randomly selected. The sign test is a nonparametric test that can be used to test a population median against a hypothesized value k. DEFINITION The sign test for a population median can be left-tailed, right-tailed, or two-tailed. The null and alternative hypotheses for each type of test are shown below. Left-tailed test: H 0 : median Ú k and H a : median 6 k Right-tailed test: H 0 : median … k and H a : median 7 k Two-tailed test: H 0 : median = k and H a : median ≠ k To use the sign test, first compare each entry in the sample with the hypothesized median k . When the entry is below the median, assign it a - sign; when the entry is above the median, assign it a + sign; and when the entry is equal to the median, assign it a 0. Then compare the number of + and - signs. (The 0’s are ignored.) When there is a large difference between the number of + signs and the number of - signs, it is likely that the median is different from the hypothesized value and you should reject the null hypothesis. Study Tip For many nonparametric tests, statisticians test the median instead of the mean.

Your preview ends here