In this question, we will formulate a measure to quantify the level of association between the two categorical variables. Such a measure is often used in a statistical test called Chi-square test for assessing whether there is an association between two categorical variables. This question is also used to motivate the learning of independence and to connect the concept back to what we have learnt in the course. Let's revisit the example we have looked at in the course. How is diet type (high cholesterol diet versus low cholesterol diet) related to the risk of coronary heart disease? Data of 23 individuals:   Heart Disease No Heart Disease Total High Cholesteral Diet (i) 11 (iii) 4 15 Low Cholesteral Diet (ii) 2 (iv) 6 8   13 10 23

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question

 In this question, we will formulate a measure to quantify the level of association between the two categorical variables. Such a measure is often used in a statistical test called Chi-square test for assessing whether there is an association between two categorical variables. This question is also used to motivate the learning of independence and to connect the concept back to what we have learnt in the course.

Let's revisit the example we have looked at in the course. How is diet type (high cholesterol diet versus low cholesterol diet) related to the risk of coronary heart disease? Data of 23 individuals:


  Heart Disease No Heart Disease Total
High Cholesteral Diet (i) 11 (iii) 4 15
Low Cholesteral Diet (ii) 2 (iv) 6 8
  13 10 23



From the table we find that the probability of having heart disease is 13/23 and the probability of having high cholesterol diet is 15/23. Similarly, we can find the probability of not having heart disease and the probability of having low cholesterol diet.

Part a
If there is no association between the two variables (i.e., the two are independent), the probability of having heart disease and high cholesterol diet is: [Round to four decimal places].


Part b
If the two variables are independent, we should expect the number of individuals with heart disease and high cholestoral diet to be the probability in Part a multiplied by 23 individuals, which is: [Round to two decimal places].


Part c
Repeating Part b, we find that the expected number of individuals for the cells (ii), (iii), (iv) respectively on the table are: 4.52, 6.52, 3.48.

The following measure (called Chi-square test statistic):

χ2=∑(Observed−Expected)^2/Expected

quantifies the level of association between two categorical variables. The symbol ∑ means a sum. "Observed" here refers to the observed counts on the table, while "Expected" refers to the expected counts given independence for the two variables is true. The sum is taken across all the cells (i) to (iv) on the table.

If there is no association, the observed counts should not differ very much from the expected counts, which results in a relatively small value of χ2. A large χ2 value indicates disagreement between the expected and observed counts which suggests the assumption of independence does not hold and the two variables are likely to be associated.

Compute χ2. [Round to two decimal places].

Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 2 steps with 2 images

Blurred answer
Similar questions
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman