Introduction To Statistics And Data Analysis
Introduction To Statistics And Data Analysis
6th Edition
ISBN: 9781337793612
Author: PECK, Roxy.
Publisher: Cengage Learning,
bartleby

Videos

Textbook Question
Book Icon
Chapter 6.4, Problem 52E

The paper “Good for Women, Good for Men, Bad for People: Simpson’s Paradox and the Importance of Sex-Specific Analysis in Observational Studies” (Journal of Women’s Health and Gender-Based Medicine [2001]: 867-872) described the results of a medical study in which one treatment was shown to be better for men and better for women than a competing treatment. However, if the data for men and women are combined, it appears as though the competing treatment is better.

To see how this can happen, consider the accompanying data tables constructed from information in the paper. Subjects in the study were given either Treatment A or Treatment B, and survival was noted. Let S be the event that a patient selected at random survives, A be the event that a patient selected at random received Treatment A, and B be the event that a patient selected at random received Treatment B.

  1. a. The following table summarizes data for men and women combined:

Chapter 6.4, Problem 52E, The paper Good for Women, Good for Men, Bad for People: Simpsons Paradox and the Importance of , example  1

  1. i. Find P(S).
  2. ii. Find P(S|A).
  3. iii. Find P(S|B).
  4. iv. Which treatment appears to be better?
  5. b. Now consider the summary data for the men who participated in the study:

Chapter 6.4, Problem 52E, The paper Good for Women, Good for Men, Bad for People: Simpsons Paradox and the Importance of , example  2

  1. v. Find P(S).
  2. vi. Find P(S|A).
  3. vii. Find P(S|B).
  4. viii. Which treatment appears to be better?
  5. c. Now consider the summary data for the women who participated in the study:

Chapter 6.4, Problem 52E, The paper Good for Women, Good for Men, Bad for People: Simpsons Paradox and the Importance of , example  3

  1. ix. Find P(S). looks like Treatment B is better. This is an
  2. x. Find P(S|A).
  3. xi. Find P(S|B).
  4. xii. Which treatment appears to be better?
  5. d. You should have noticed from Parts (b) and (c) that for both men and women, Treatment A appears to be better. But in Part (a), when the data for men and women are combined, it looks like Treatment B is better. This is an example of what is called Simpson’s paradox. Write a brief explanation of why this apparent inconsistency occurs for this data set. (Hint: Do men and women respond similarly to the two treatments?)

a.

Expert Solution
Check Mark
To determine

i. Compute P(S).

ii. Obtain P(S|A).

iii. Calculate P(S|B).

iv. Find the better treatment.

Answer to Problem 52E

i. The value of P(S) is 0.76.

ii. The value of P(S|A) is 0.717.

iii. The value of P(S|B) is 0.803.

iv. Treatment B is better than Treatment A.

Explanation of Solution

Calculation:

The given information is the summary table of the survey. Event S denotes the event that a patient selected at random and survives, event A denotes that a patient selected at random received Treatment A, and B denotes the event that a patient selected at random and received Treatment B.

i.

The probability of any event A is given below:

P(A)=Number of outcomes in ATotal number of outcomes in the samplespace

The total number of randomly selected patient is 600.

The total number of patient selected at random survives is 456.

The probability of a randomly selected patients and who survive is calculated as follows:

P(S)=456600=0.76

Thus, the probability of a randomly selected patients  who survive is 0.76.

ii.

Conditional rule:

The formula for probability of E given F is, P(E|F)=n(EF)n(F).

The total number of patient selected at random and received Treatment A is 300.

The number of patient selected at random and received Treatment A and survive is 215.

The probability that the selected patients at random received Treatment A, given that the patient selected at random survives. It is calculated as follows:

P(S|A)=215300=0.717

Thus, the value of P(S|A) is equal to 0.717.

iii.

The total number of patient selected at random and received Treatment B is 300.

The number of patient selected at random that received Treatment B and survive is 241.

The probability that the selected patient at random received Treatment B, given that the patient selected at random survives. It is calculated as follows:

P(S|B)=241300=0.803

Thus, the value of P(S|B) is equal to 0.803.

iv.

The probability of patient who received Treatment B survived more than that of Treatment A.

Thus, Treatment B is better than Treatment A.

b.

Expert Solution
Check Mark
To determine

i. Compute P(S).

ii. Obtain P(S|A).

iii. Calculate P(S|B).

iv. Find the better treatment.

Answer to Problem 52E

i. The value of P(S) is 0.583.

ii. The value of P(S|A) is 0.6.

iii. The value of P(S|B) is 0.5.

iv. Treatment A is better than Treatment B.

Explanation of Solution

Calculation:

The given information is the summary table of the survey.

i.

The total number of randomly selected patient is 240.

The total number of patient selected at random and survives is 140.

The probability of a randomly selected patients who survive is calculated as follows:

P(S)=140240=0.583

Thus, the probability of a randomly selected patients who survive is 0.583.

ii.

Conditional rule:

The formula for probability of E given F is, P(E|F)=n(EF)n(F).

The total number of patients selected at random that received Treatment A is 200.

The number of patient selected at random that received Treatment A and survives is 120.

The probability that the selected patient at random received Treatment A, given that the patient selected at random survives. It is calculated as follows:

P(S|A)=120200=0.6

Thus, the value of P(S|A) is equal to 0.6.

iii.

The total number of patients selected at random that received Treatment B is 40.

The number of patient selected at random that received Treatment B and survive is 20.

The probability that the selected patient at random received Treatment B, given that the patient selected at random survives. It is calculated as follows:

P(S|B)=2040=0.5

Thus, the value of P(S|B) is equal to 0.5.

iv.

The probability of patient who received Treatment A survived more than that of Treatment B.

Thus, Treatment A is better than Treatment B.

c.

Expert Solution
Check Mark
To determine

i. Compute P(S).

ii. Obtain P(S|A).

iii. Calculate P(S|B).

iv. Find the better treatment.

Answer to Problem 52E

i. The value of P(S) is 0.878.

ii. The value of P(S|A) is 0.95.

iii. The value of P(S|B) is 0.85.

iv. Treatment A is better than Treatment B.

Explanation of Solution

Calculation:

The given information is the summary table of the survey.

i.

The total number of randomly selected patient is 360.

The total number of patient selected at random that survive is 316.

The probability of a randomly selected patients who survive is calculated as follows:

P(S)=316360=0.878

Thus, the probability of a randomly selected patients who survive is 0.878.

ii.

Conditional rule:

The formula for probability of E given F is, P(E|F)=n(EF)n(F).

The total number of patient selected at random that received Treatment A is 100.

The number of patient selected at random that received Treatment A and survive is 95.

The probability that the selected patients at random received Treatment A, given that the patient selected at random survives. It is calculated as follows:

P(S|A)=95100=0.95

Thus, the value of P(S|A) is equal to 0.95.

iii.

The total number of patient selected at random that received Treatment B is 260.

The number of patient selected at random that received Treatment B and survive is 221.

The probability that the selected patients at random received Treatment B, given that the patient selected at random survives. It is calculated as follows:

P(S|B)=221260=0.85

Thus, the value of P(S|B) is equal to 0.85.

iv.

The probability of patients who received Treatment A survived more than that of Treatment B.

Thus, Treatment A is better than Treatment B.

d.

Expert Solution
Check Mark
To determine

Explain the reason for the existence of apparent inconsistency in the data.

Explanation of Solution

From part (a), (b) and (c), it can be observed that Treatment A performs better than that of Treatment B, except part (a). In part (a), the data for men and women are combined. Thus, Treatment B performs better than that of Treatment A.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!
Students have asked these similar questions
Which one of the following statements is/are true or false regarding cohort and case-control studies? (I) Case-control studies are controlled experiments, whilst cohort studies are observational studies. (II) Case-control studies are better for rare diseases, compared to cohort studies.
Eighteen participants took part in an experimental study that consisted of three levels: normal classroom teaching, normal classroom teaching with tutors, and an experimental teaching method to increase student scores on a statistics exam. Your data set includes the following variables: Treatment (1 - normal classroom teaching; 2 - normal classroom teaching with tutors; 3 - experimental teaching method) Gender (1 – female; 2 – male) Pre-test score (higher scores represent better student performance on a statistics exam) Post-test score (higher scores represent better student performance on a statistics exam) Stanford-Binet Intelligence Test scores (IQ) The number of hours the student studied per week. Do test scores at pre-treatment differ from test scores at post-treatment? Note, this analysis should be conducted ignoring the treatment group they are in. Select an appropriate statistical test and state teh hullhypothesis.
Are nursing salaries in Tampa, Florida, lower than those in Dallas, Texas? Salary data show staff nurses in Tampa earn less than staff nurses in Dallas (The Tampa Tribune, January 15, 2007). Suppose that in a follow-up study of 40 staff nurses in Tampa and 50 staff nurses in Dallas you obtain the following results. Tampa n1  = 40    X1  = $56,100  S1 = $6000 Dallas  n2  = 50     X2  = $59,400  S2  = $7000 Complete the confidence interval of the data set

Chapter 6 Solutions

Introduction To Statistics And Data Analysis

Ch. 6.1 - Refer to the previous exercise and now suppose...Ch. 6.1 - A family consisting of three peopleP1, P2, and...Ch. 6.1 - Prob. 13ECh. 6.1 - An engineering construction firm is currently...Ch. 6.1 - For the events described in the previous exercise,...Ch. 6.1 - Consider a Venn diagram picturing two events A and...Ch. 6.3 - A large department store offers online ordering....Ch. 6.3 - Consider the chance experiment described in the...Ch. 6.3 - The manager of an online music store has kept...Ch. 6.3 - Consider the chance experiment described in the...Ch. 6.3 - A bookstore sells two types of books (fiction and...Ch. 6.3 - Consider the chance experiment described in the...Ch. 6.3 - Medical insurance statuscovered (C) or not covered...Ch. 6.3 - Roulette is a game of chance that involves...Ch. 6.3 - Phoenix is a hub for a large airline. Suppose that...Ch. 6.3 - A customer satisfaction survey is planned. The...Ch. 6.3 - A professor assigns five problems to be completed...Ch. 6.3 - Refer to the following information on full-term...Ch. 6.3 - The report Teens, Social Media Technology...Ch. 6.3 - According to The Chronicle for Higher Education...Ch. 6.3 - The same issue of The Chronicle for Higher...Ch. 6.3 - A deck of 52 playing cards is mixed well, and 5...Ch. 6.3 - After all students have left the classroom, a...Ch. 6.3 - Use the information given in the previous exercise...Ch. 6.3 - The student council for a school of science and...Ch. 6.3 - A student placement center has requests from five...Ch. 6.3 - Suppose that a six-sided die is weighted so that...Ch. 6.4 - Two different airlines have a flight from Los...Ch. 6.4 - The article Chances Are You Know Someone with a...Ch. 6.4 - The accompanying data are from the article...Ch. 6.4 - Using the probabilities calculated in the previous...Ch. 6.4 - The following graphical display is similar to one...Ch. 6.4 - The article Americans Growing More Concerned About...Ch. 6.4 - The events E and T are defined as E = the event...Ch. 6.4 - The newspaper article Folic Acid Might Reduce Risk...Ch. 6.4 - Suppose that an individual is randomly selected...Ch. 6.4 - Is ultrasound a reliable method for determining...Ch. 6.4 - The paper Accuracy and Reliability of...Ch. 6.4 - The report 2015 Utah Seat Belt Use Survey (Utah...Ch. 6.4 - The National Highway Traffic Safety Administration...Ch. 6.4 - Use the information given in the previous exercise...Ch. 6.4 - The paper Good for Women, Good for Men, Bad for...Ch. 6.5 - Many fire stations handle emergency calls for...Ch. 6.5 - Refer to the information given in the previous...Ch. 6.5 - The paper Predictors of Complementary Therapy Use...Ch. 6.5 - The report TV Drama/Comedy Viewers and Health...Ch. 6.5 - The report Great Jobs, Great Lives. The...Ch. 6.5 - In a small city, approximately 15% of those...Ch. 6.5 - Jeanie is a bit forgetful, and if she doesnt make...Ch. 6.5 - Consider a system consisting of four components,...Ch. 6.5 - Consider the system described in the previous...Ch. 6.5 - In a January 2016 Harris Poll, each of 2252...Ch. 6.5 - Consider the following events: T = event that a...Ch. 6.5 - The following case study was reported in the...Ch. 6.5 - Three friends (A, B, and C) will participate in a...Ch. 6.5 - A store sells two different brands of dishwasher...Ch. 6.5 - The National Public Radio show Car Talk used to...Ch. 6.5 - Refer to the previous exercise. Suppose now that...Ch. 6.6 - A university has 10 vehicles available for use by...Ch. 6.6 - Prob. 70ECh. 6.6 - There are two traffic lights on Darlenes route...Ch. 6.6 - Let F denote the event that a randomly selected...Ch. 6.6 - According to a July 31, 2013 posting on cnn.com, a...Ch. 6.6 - Suppose that Blue Cab operates 15% of the taxis in...Ch. 6.6 - A large cable company reports the following: 80%...Ch. 6.6 - Refer to the information given in the previous...Ch. 6.6 - The authors of the paper Do Physicians Know When...Ch. 6.6 - A study of how people are using online services...Ch. 6.6 - The report Twitter in Higher Education: Usage...Ch. 6.6 - Use the information given in the previous exercise...Ch. 6.6 - Prob. 81ECh. 6.6 - Use the table of estimated probabilities from the...Ch. 6.6 - Suppose that we define the following events: C =...Ch. 6.6 - The article U.S. Investors Split Between Digital...Ch. 6.6 - Prob. 85ECh. 6.6 - The paper referenced in the previous exercise also...Ch. 6.6 - In an article that appears on the web site of the...Ch. 6.7 - The report Airline Quality Rating 2016...Ch. 6.7 - Five hundred first-year students at a state...Ch. 6.7 - Use the information given in the previous exercise...Ch. 6.7 - The table given below describes (approximately)...Ch. 6.7 - On April 1, 2010, the Bureau of the Census in the...Ch. 6.7 - Refer to the information given in the previous...Ch. 6.7 - Refer to the information given in Exercises 6.92...Ch. 6 - False positive results are not uncommon with...Ch. 6 - A company uses three different assembly linesA1,...Ch. 6 - Consider the following information about...Ch. 6 - Use the information given in the previous exercise...Ch. 6 - Use the information given in exercise 6.102 to...Ch. 6 - Prob. 105CRCh. 6 - The following table summarizing data on smoking...Ch. 6 - A study of the impact of seeking a second opinion...Ch. 6 - A company sends 40% of its overnight mail parcels...Ch. 6 - Prob. 109CRCh. 6 - Prob. 110CRCh. 6 - In a school machine shop, 60% of all machine...Ch. 6 - There are five faculty members in a certain...Ch. 6 - The general addition rule for three events states...Ch. 6 - A theater complex is currently showing four...Ch. 6 - Prob. 117CRCh. 6 - Suppose that a box contains 25 light bulbs, of...Ch. 6 - Return to Exercise 6.118, and suppose that 4 bulbs...Ch. 6 - A transmitter is sending a message using a binary...
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Similar questions
SEE MORE QUESTIONS
Recommended textbooks for you
Text book image
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Text book image
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Text book image
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Hypothesis Testing using Confidence Interval Approach; Author: BUM2413 Applied Statistics UMP;https://www.youtube.com/watch?v=Hq1l3e9pLyY;License: Standard YouTube License, CC-BY
Hypothesis Testing - Difference of Two Means - Student's -Distribution & Normal Distribution; Author: The Organic Chemistry Tutor;https://www.youtube.com/watch?v=UcZwyzwWU7o;License: Standard Youtube License