Workers at an aluminium plant are placed into categories based on their duration of exposure, (in years) to aluminium disease and level of disease prevalence. Category 1- heavily diseased, category 2- lightly diseased and category 3 – normal. There is a need to determine if there is a relationship between the duration of exposure and the categories. Duration of Exposure <1 1 to 5 >5 Heavily diseased 10 8 23 Lightly diseased 9 19 11 Normal 70 136 206
Correlation
Correlation defines a relationship between two independent variables. It tells the degree to which variables move in relation to each other. When two sets of data are related to each other, there is a correlation between them.
Linear Correlation
A correlation is used to determine the relationships between numerical and categorical variables. In other words, it is an indicator of how things are connected to one another. The correlation analysis is the study of how variables are related.
Regression Analysis
Regression analysis is a statistical method in which it estimates the relationship between a dependent variable and one or more independent variable. In simple terms dependent variable is called as outcome variable and independent variable is called as predictors. Regression analysis is one of the methods to find the trends in data. The independent variable used in Regression analysis is named Predictor variable. It offers data of an associated dependent variable regarding a particular outcome.
Workers at an aluminium plant are placed into categories based on their duration of exposure, (in years) to aluminium disease and level of disease prevalence. Category 1- heavily diseased, category 2- lightly diseased and category 3 – normal. There is a need to determine if there is a relationship between the duration of exposure and the categories.
|
Duration of Exposure |
|
|
|
<1 |
1 to 5 |
>5 |
Heavily diseased |
10 |
8 |
23 |
Lightly diseased |
9 |
19 |
11 |
Normal |
70 |
136 |
206 |
Do the following:
- explain the hypothesis test you would use in this analysis.
- determine whether you can conclude that the proportions of workers in the various disease categories differ among exposure levels. You need to first set up an alternate and null hypothesis and give all details of the level of confidence testing is being done at.
Step by step
Solved in 2 steps with 2 images