Concept explainers
The two-way table for the given situation may vary. One of the possible answers is given below:
To give: The two-way for the given situation.
To show: The obese smokers and obese non-smokers who are more likely to die earlier than people who are not obese.
To check: Whether the Simpson’s paradox holds good for the given situation.
Answer to Problem 6.26E
The two-way table for smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 8 | 57 |
No | 192 | 1,443 |
Total | 200 | 1,500 |
The two-way table for non-smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 8 | 6 |
No | 592 | 594 |
Total | 600 | 600 |
The result tells that 4% of obese smokers had died, 1.3% of obese non-smokers had died. However, only 3.8% of not obese smokers had died and 1% of not obese non-smokers had died.
The Simpson’s paradox holds good for the given situation because the two individual tables tell that obese people are more prone to death but the combined table tells that non-obese people who are smoking are more prone to death.
Explanation of Solution
Given info:
Two-way table can be constructed using obese (yes or no) and death (yes or no) for smokers and non-smokers. Also, combine the two-way table of smokers and non-smokers with respect to obese and death.
Calculation:
The two-way table for smokers is constructed by taking death as a row variable and obese as a column variable.
The two-way table for smokers is given below:
Table 1
Obese | ||
Death | Yes | No |
Yes | 8 | 57 |
No | 192 | 1,443 |
Total | 200 | 1,500 |
The two-way table for non-smokers is given below:
Table 2
Obese | ||
Death | Yes | No |
Yes | 8 | 6 |
No | 592 | 594 |
Total | 600 | 600 |
Percentage of obese smokers who died is given below:
Thus, 4% of obese smokers had died.
Percentage of obese nonsmokers who died is given below:
Thus, 1.3% of obese non-smokers had died.
Percentage of not obese smokers who died is given below:
Thus, 3.8% of not obese smokers had died.
Percentage of non-smokers who are not obese died is given below:
Thus, 1% of not obese non-smokers had died.
Combined table:
The combined table is obtained by combining the corresponding values from Table 1 and Table 2.
Table 3
Obese | ||
Death | Yes | No |
Yes | 16 | 63 |
No | 784 | 2,037 |
Total | 800 | 2,100 |
Percentage of non-obese smokers who died is given below:
Thus, 3% of non-obese smokers had died.
Percentage of obese smokers who died is given below:
Thus, 2% of obese smokers had died.
The percentage of deaths for smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 4% | 3.8% |
The percentage of deaths for non-smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 1.3% | 1% |
The percentage of deaths under combined table is given below:
Obese | ||
Death | Yes | No |
Yes | 2% | 3% |
Justification:
Simpson paradox:
Conclusion drawn from aggregated table might go wrong by drawing conclusions from the individual tables.
Thus, the scenario of Simpson’s paradox is stated. The individual tables tell that obese people are more prone to death, but the combined table tells that non-obese people who are smoking are more prone to death.
Want to see more full solutions like this?
Chapter 6 Solutions
BASIC PRAC OF STATISTICS+LAUNCHPAD+REE
- F Make a box plot from the five-number summary: 100, 105, 120, 135, 140. harrow_forward14 Is the standard deviation affected by skewed data? If so, how? foldarrow_forwardFrequency 15 Suppose that your friend believes his gambling partner plays with a loaded die (not fair). He shows you a graph of the outcomes of the games played with this die (see the following figure). Based on this graph, do you agree with this person? Why or why not? 65 Single Die Outcomes: Graph 1 60 55 50 45 40 1 2 3 4 Outcome 55 6arrow_forward
- lie y H 16 The first month's telephone bills for new customers of a certain phone company are shown in the following figure. The histogram showing the bills is misleading, however. Explain why, and suggest a solution. Frequency 140 120 100 80 60 40 20 0 0 20 40 60 80 Telephone Bill ($) 100 120arrow_forward25 ptical rule applies because t Does the empirical rule apply to the data set shown in the following figure? Explain. 2 6 5 Frequency 3 сл 2 1 0 2 4 6 8 00arrow_forward24 Line graphs typically connect the dots that represent the data values over time. If the time increments between the dots are large, explain why the line graph can be somewhat misleading.arrow_forward
- 17 Make a box plot from the five-number summary: 3, 4, 7, 16, 17. 992) waarrow_forward12 10 - 8 6 4 29 0 Interpret the shape, center and spread of the following box plot. brill smo slob.nl bagharrow_forwardSuppose that a driver's test has a mean score of 7 (out of 10 points) and standard deviation 0.5. a. Explain why you can reasonably assume that the data set of the test scores is mound-shaped. b. For the drivers taking this particular test, where should 68 percent of them score? c. Where should 95 percent of them score? d. Where should 99.7 percent of them score? Sarrow_forward
- 13 Can the mean of a data set be higher than most of the values in the set? If so, how? Can the median of a set be higher than most of the values? If so, how? srit to estaarrow_forwardA random variable X takes values 0 and 1 with probabilities q and p, respectively, with q+p=1. find the moment generating function of X and show that all the moments about the origin equal p. (Note- Please include as much detailed solution/steps in the solution to understand, Thank you!)arrow_forward1 (Expected Shortfall) Suppose the price of an asset Pt follows a normal random walk, i.e., Pt = Po+r₁ + ... + rt with r₁, r2,... being IID N(μ, o²). Po+r1+. ⚫ Suppose the VaR of rt is VaRq(rt) at level q, find the VaR of the price in T days, i.e., VaRq(Pt – Pt–T). - • If ESq(rt) = A, find ES₁(Pt – Pt–T).arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman