Concept explainers
The two-way table for the given situation may vary. One of the possible answers is given below:
To give: The two-way for the given situation.
To show: The obese smokers and obese non-smokers who are more likely to die earlier than people who are not obese.
To check: Whether the Simpson’s paradox holds good for the given situation.

Answer to Problem 6.26E
The two-way table for smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 8 | 57 |
No | 192 | 1,443 |
Total | 200 | 1,500 |
The two-way table for non-smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 8 | 6 |
No | 592 | 594 |
Total | 600 | 600 |
The result tells that 4% of obese smokers had died, 1.3% of obese non-smokers had died. However, only 3.8% of not obese smokers had died and 1% of not obese non-smokers had died.
The Simpson’s paradox holds good for the given situation because the two individual tables tell that obese people are more prone to death but the combined table tells that non-obese people who are smoking are more prone to death.
Explanation of Solution
Given info:
Two-way table can be constructed using obese (yes or no) and death (yes or no) for smokers and non-smokers. Also, combine the two-way table of smokers and non-smokers with respect to obese and death.
Calculation:
The two-way table for smokers is constructed by taking death as a row variable and obese as a column variable.
The two-way table for smokers is given below:
Table 1
Obese | ||
Death | Yes | No |
Yes | 8 | 57 |
No | 192 | 1,443 |
Total | 200 | 1,500 |
The two-way table for non-smokers is given below:
Table 2
Obese | ||
Death | Yes | No |
Yes | 8 | 6 |
No | 592 | 594 |
Total | 600 | 600 |
Percentage of obese smokers who died is given below:
Thus, 4% of obese smokers had died.
Percentage of obese nonsmokers who died is given below:
Thus, 1.3% of obese non-smokers had died.
Percentage of not obese smokers who died is given below:
Thus, 3.8% of not obese smokers had died.
Percentage of non-smokers who are not obese died is given below:
Thus, 1% of not obese non-smokers had died.
Combined table:
The combined table is obtained by combining the corresponding values from Table 1 and Table 2.
Table 3
Obese | ||
Death | Yes | No |
Yes | 16 | 63 |
No | 784 | 2,037 |
Total | 800 | 2,100 |
Percentage of non-obese smokers who died is given below:
Thus, 3% of non-obese smokers had died.
Percentage of obese smokers who died is given below:
Thus, 2% of obese smokers had died.
The percentage of deaths for smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 4% | 3.8% |
The percentage of deaths for non-smokers is given below:
Obese | ||
Death | Yes | No |
Yes | 1.3% | 1% |
The percentage of deaths under combined table is given below:
Obese | ||
Death | Yes | No |
Yes | 2% | 3% |
Justification:
Simpson paradox:
Conclusion drawn from aggregated table might go wrong by drawing conclusions from the individual tables.
Thus, the scenario of Simpson’s paradox is stated. The individual tables tell that obese people are more prone to death, but the combined table tells that non-obese people who are smoking are more prone to death.
Want to see more full solutions like this?
Chapter 6 Solutions
BASIC PRACTICE OF STATISTICS >C<
- Consider an MA(6) model with θ1 = 0.5, θ2 = −25, θ3 = 0.125, θ4 = −0.0625, θ5 = 0.03125, and θ6 = −0.015625. Find a much simpler model that has nearly the same ψ-weights.arrow_forwardLet {Yt} be an AR(2) process of the special form Yt = φ2Yt − 2 + et. Use first principles to find the range of values of φ2 for which the process is stationary.arrow_forwardDescribe the important characteristics of the autocorrelation function for the following models: (a) MA(1), (b) MA(2), (c) AR(1), (d) AR(2), and (e) ARMA(1,1).arrow_forward
- « CENGAGE MINDTAP Quiz: Chapter 38 Assignment: Quiz: Chapter 38 ips Questions ra1kw08h_ch38.15m 13. 14. 15. O Which sentence has modifiers in the correct place? O a. When called, she for a medical emergency responds quickly. b. Without giving away too much of the plot, Helena described the heroine's actions in the film. O c. Nearly the snakebite victim died before the proper antitoxin was injected. . O O 16 16. O 17. 18. O 19. O 20 20. 21 21. 22. 22 DS 23. 23 24. 25. O O Oarrow_forwardQuestions ra1kw08h_ch36.14m 12. 13. 14. 15. 16. Ӧ 17. 18. 19. OS 20. Two separate sentences need Oa. two separate subjects. Ob. two dependent clauses. c. one shared subject.arrow_forwardCustomers experiencing technical difficulty with their Internet cable service may call an 800 number for technical support. It takes the technician between 30 seconds and 11 minutes to resolve the problem. The distribution of this support time follows the uniform distribution. Required: a. What are the values for a and b in minutes? Note: Do not round your intermediate calculations. Round your answers to 1 decimal place. b-1. What is the mean time to resolve the problem? b-2. What is the standard deviation of the time? c. What percent of the problems take more than 5 minutes to resolve? d. Suppose we wish to find the middle 50% of the problem-solving times. What are the end points of these two times?arrow_forward
- Exercise 6-6 (Algo) (LO6-3) The director of admissions at Kinzua University in Nova Scotia estimated the distribution of student admissions for the fall semester on the basis of past experience. Admissions Probability 1,100 0.5 1,400 0.4 1,300 0.1 Click here for the Excel Data File Required: What is the expected number of admissions for the fall semester? Compute the variance and the standard deviation of the number of admissions. Note: Round your standard deviation to 2 decimal places.arrow_forward1. Find the mean of the x-values (x-bar) and the mean of the y-values (y-bar) and write/label each here: 2. Label the second row in the table using proper notation; then, complete the table. In the fifth and sixth columns, show the 'products' of what you're multiplying, as well as the answers. X y x minus x-bar y minus y-bar (x minus x-bar)(y minus y-bar) (x minus x-bar)^2 xy 16 20 34 4-2 5 2 3. Write the sums that represents Sxx and Sxy in the table, at the bottom of their respective columns. 4. Find the slope of the Regression line: bi = (simplify your answer) 5. Find the y-intercept of the Regression line, and then write the equation of the Regression line. Show your work. Then, BOX your final answer. Express your line as "y-hat equals...arrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Generate the log of birthweight and family income of children. Name these new variables Ibwght & Ifaminc. Include the output of this code. ii. Apply the command sum with the detail option to the variable faminc. Note: you should find the 25th percentile value, the 50th percentile and the 75th percentile value of faminc from the output - you will need it to answer the next question Include the output of this code. iii. iv. Use the output from part ii of this question to Generate a variable called "high_faminc" that takes a value 1 if faminc is less than or equal to the 25th percentile, it takes the value 2 if faminc is greater than 25th percentile but less than or equal to the 50th percentile, it takes the value 3 if faminc is greater than 50th percentile but less than or equal to the 75th percentile, it takes the value 4 if faminc is greater than the 75th percentile. Include the outcome of this code…arrow_forward
- solve this on paperarrow_forwardApply STATA commands & submit the output for each question only when indicated below i. Apply the command egen to create a variable called "wyd" which is the rowtotal function on variables bwght & faminc. ii. Apply the list command for the first 10 observations to show that the code in part i worked. Include the outcome of this code iii. Apply the egen command to create a new variable called "bwghtsum" using the sum function on variable bwght by the variable high_faminc (Note: need to apply the bysort' statement) iv. Apply the "by high_faminc" statement to find the V. descriptive statistics of bwght and bwghtsum Include the output of this code. Why is there a difference between the standard deviations of bwght and bwghtsum from part iv of this question?arrow_forwardAccording to a health information website, the distribution of adults’ diastolic blood pressure (in millimeters of mercury, mmHg) can be modeled by a normal distribution with mean 70 mmHg and standard deviation 20 mmHg. b. Above what diastolic pressure would classify someone in the highest 1% of blood pressures? Show all calculations used.arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman





