
Concept explainers
The two-way table for the given situation may vary. One of the possible answers is given below:
To give: The two-way for the given situation.
To show: The obese smokers and obese non-smokers who are more likely to die earlier than people who are not obese.
To check: Whether the Simpson’s paradox holds good for the given situation.

Answer to Problem 6.26E
The two-way table for smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 8 | 57 |
No | 192 | 1,443 |
Total | 200 | 1,500 |
The two-way table for non-smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 8 | 6 |
No | 592 | 594 |
Total | 600 | 600 |
The result tells that 4% of obese smokers had died, 1.3% of obese non-smokers had died. However, only 3.8% of not obese smokers had died and 1% of not obese non-smokers had died.
The Simpson’s paradox holds good for the given situation because the two individual tables tell that obese people are more prone to death but the combined table tells that non-obese people who are smoking are more prone to death.
Explanation of Solution
Given info:
Two-way table can be constructed using obese (yes or no) and death (yes or no) for smokers and non-smokers. Also, combine the two-way table of smokers and non-smokers with respect to obese and death.
Calculation:
The two-way table for smokers is constructed by taking death as a row variable and obese as a column variable.
The two-way table for smokers is given below:
Table 1
Obese | ||
Death | Yes | No |
Yes | 8 | 57 |
No | 192 | 1,443 |
Total | 200 | 1,500 |
The two-way table for non-smokers is given below:
Table 2
Obese | ||
Death | Yes | No |
Yes | 8 | 6 |
No | 592 | 594 |
Total | 600 | 600 |
Percentage of obese smokers who died is given below:
Thus, 4% of obese smokers had died.
Percentage of obese nonsmokers who died is given below:
Thus, 1.3% of obese non-smokers had died.
Percentage of not obese smokers who died is given below:
Thus, 3.8% of not obese smokers had died.
Percentage of non-smokers who are not obese died is given below:
Thus, 1% of not obese non-smokers had died.
Combined table:
The combined table is obtained by combining the corresponding values from Table 1 and Table 2.
Table 3
Obese | ||
Death | Yes | No |
Yes | 16 | 63 |
No | 784 | 2,037 |
Total | 800 | 2,100 |
Percentage of non-obese smokers who died is given below:
Thus, 3% of non-obese smokers had died.
Percentage of obese smokers who died is given below:
Thus, 2% of obese smokers had died.
The percentage of deaths for smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 4% | 3.8% |
The percentage of deaths for non-smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 1.3% | 1% |
The percentage of deaths under combined table is given below:
Obese | ||
Death | Yes | No |
Yes | 2% | 3% |
Justification:
Simpson paradox:
Conclusion drawn from aggregated table might go wrong by drawing conclusions from the individual tables.
Thus, the scenario of Simpson’s paradox is stated. The individual tables tell that obese people are more prone to death, but the combined table tells that non-obese people who are smoking are more prone to death.
Want to see more full solutions like this?
Chapter 6 Solutions
EBK THE BASIC PRACTICE OF STATISTICS
- 3. A bag of Skittles contains five colors: red, orange, green, yellow, and purple. The probabilities of choosing each color are shown in the chart below. What is the probability of choosing first a red, then a purple, and then a green Skittle, replacing the candies in between picks? Color Probability Red 0.2299 Green 0.1908 Orange 0.2168 Yellow 0.1889 Purple 0.1736arrow_forwardName: Quiz A 5.3-5.4 Sex Female Male Total Happy 90 46 136 Healthy 20 13 33 Rich 10 31 41 Famous 0 8 8 Total 120 98 218 Use the following scenario for questions 1 & 2. One question on the Census at School survey asks students if they would prefer to be happy, healthy, rich, or famous. Students may only choose one of these responses. The two-way table summarizes the responses of 218 high school students from the United States by sex. Preferred status 1. Define event F as a female student and event R as rich. a. Find b. Find or c. Find and 2. Define event F as a female student and event R as rich. a. Find b. Find c. Using your results from a and b, are these events (female student and rich) independent? Use the following scenario for questions 3 & 4. At the end of a 5k race, runners are offered a donut or a banana. The event planner examined each runner's race bib and noted whether Age Less than 30 years old At least 30 years old Total Choice Donut Banana 52 54 106 5 72 77 Total 57 126…arrow_forwardI need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forward
- I need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forwardI need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forwardI need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forward
- I need help with this problem and an explanation of the solution for the image described below. (Statistics: Engineering Probabilities)arrow_forward3. Consider the following regression model: Yi Bo+B1x1 + = ···· + ßpxip + Єi, i = 1, . . ., n, where are i.i.d. ~ N (0,0²). (i) Give the MLE of ẞ and σ², where ẞ = (Bo, B₁,..., Bp)T. (ii) Derive explicitly the expressions of AIC and BIC for the above linear regression model, based on their general formulae.arrow_forwardHow does the width of prediction intervals for ARMA(p,q) models change as the forecast horizon increases? Grows to infinity at a square root rate Depends on the model parameters Converges to a fixed value Grows to infinity at a linear ratearrow_forward
- Consider the AR(3) model X₁ = 0.6Xt-1 − 0.4Xt-2 +0.1Xt-3. What is the value of the PACF at lag 2? 0.6 Not enough information None of these values 0.1 -0.4 이arrow_forwardSuppose you are gambling on a roulette wheel. Each time the wheel is spun, the result is one of the outcomes 0, 1, and so on through 36. Of these outcomes, 18 are red, 18 are black, and 1 is green. On each spin you bet $5 that a red outcome will occur and $1 that the green outcome will occur. If red occurs, you win a net $4. (You win $10 from red and nothing from green.) If green occurs, you win a net $24. (You win $30 from green and nothing from red.) If black occurs, you lose everything you bet for a loss of $6. a. Use simulation to generate 1,000 plays from this strategy. Each play should indicate the net amount won or lost. Then, based on these outcomes, calculate a 95% confidence interval for the total net amount won or lost from 1,000 plays of the game. (Round your answers to two decimal places and if your answer is negative value, enter "minus" sign.) I worked out the Upper Limit, but I can't seem to arrive at the correct answer for the Lower Limit. What is the Lower Limit?…arrow_forwardLet us suppose we have some article reported on a study of potential sources of injury to equine veterinarians conducted at a university veterinary hospital. Forces on the hand were measured for several common activities that veterinarians engage in when examining or treating horses. We will consider the forces on the hands for two tasks, lifting and using ultrasound. Assume that both sample sizes are 6, the sample mean force for lifting was 6.2 pounds with standard deviation 1.5 pounds, and the sample mean force for using ultrasound was 6.4 pounds with standard deviation 0.3 pounds. Assume that the standard deviations are known. Suppose that you wanted to detect a true difference in mean force of 0.25 pounds on the hands for these two activities. Under the null hypothesis, 40 0. What level of type II error would you recommend here? = Round your answer to four decimal places (e.g. 98.7654). Use α = 0.05. β = 0.0594 What sample size would be required? Assume the sample sizes are to be…arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





