5.2. Give the estimated logistic regression decided upon and give the odds ratios for regression coefficients brand by of this regression.

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question
100%
Please help with 5.2 only.
Question 5
5.1. A study was conducted to ascertain the effectiveness of a treatment of diseased plants. A recorded
value of Y= 1 indicates that the treatment is effective and a value of Y = 0 is recorded when no
effect is noticed. The outcome of the effectiveness of the treatment considers the treatment
dosage, the above ground biomass, the rate of growth and the height of a plant recorded as I if
above average height and recorded 0 as below average height. The following partial data set is
provided.
EFFECTS (Y) DOSAGE X₁ BIOMASS X₂ RATE X₂
0.2138
0.0525
1.1
0.0264
1.2
0.0920
1.3
1
0
The output for running the full logistic regression model is given below:
Coefficients
(Intercept)
HEIGHT
BIOMASS
RATE
DOSAGE
0.3907
0.2138
Estimate
-1.23
0.31
0.33
0.34
-1.86
Log Likelihood: - 2.579
The output for the reduced model is given below:
Coefficients
(Intercept)
BIOMASS
Estimate
-1.25
0.21
RATE
Log Likelihood: -4.246
Answer the following questions.
0.41
Std. Error
0.6795
0.6821
5.9374
24112
2.1420
Y = 0
Y = 1
Std. Error
0.4129
4.7390
2.4223
True Classification
6
z value
-1.653
0.456
2.594
3.167
-0866
z value
-2.488
2.572
3.161
P=0
45
10
HEIGHT X₂
0
1
1
5.1.1. State the full and reduced logistic regression models.
5.1.2. Give a complete and formal test to establish whether or not predictors HEIGHT and
DOSAGE are necessary in the model. Test at a = 0.05.
Pr(>Z)
0.0984
5.2. Give the estimated logistic regression decided upon and give the odds ratios for regression
coefficients band by of this regression.
0.6484
0.0095
0.0076
0.3864
5.3. Use the estimated logistic regression from Question 5.2 to find the probability of the treatment
being effective if the amount of biomass above ground is 3 grams and the rate of growth is 1.3.
Interpret the result.
Pr(>Z)
0.0144
0.0101
5.4. To establish whether RATE is indeed needed to predict the effectiveness of a treatment, a formal
test can be used. Give the name of this test, state the hypothesis, test statistic formula and the
decision rule.
Predicted
0.0076
5.5. Logistic regression's power lies also in the ability to predict a new observation's class. In order to
do that, a cut-off point for the different classes must be established. In this study, an effective
treatment of a diseased plant was recorded as (Y= 1) and if not, it was recorded as (Y= 0).
The following table shows the predictions by the classification method.
Y = 1
17
28
Total
62
38
5.5.1. The above table was created based on the following rule: Predict 1 if ft > 0.6 and predict
0 if ft < 0.6, explain this rule.
5.5.2. Discuss the above table. Also give the error prediction rate.
5.5.3. Give the sensitivity and 1- specificity values.
5.5.4. What are the motivations for using a receiver operating characteristic called a ROC curve
and the AUC, area under the curve, in classification? Would you consider the
classification based on the rule in Question 5.5.1 as good enough based on the values in
Question 5.5.3?
Transcribed Image Text:Question 5 5.1. A study was conducted to ascertain the effectiveness of a treatment of diseased plants. A recorded value of Y= 1 indicates that the treatment is effective and a value of Y = 0 is recorded when no effect is noticed. The outcome of the effectiveness of the treatment considers the treatment dosage, the above ground biomass, the rate of growth and the height of a plant recorded as I if above average height and recorded 0 as below average height. The following partial data set is provided. EFFECTS (Y) DOSAGE X₁ BIOMASS X₂ RATE X₂ 0.2138 0.0525 1.1 0.0264 1.2 0.0920 1.3 1 0 The output for running the full logistic regression model is given below: Coefficients (Intercept) HEIGHT BIOMASS RATE DOSAGE 0.3907 0.2138 Estimate -1.23 0.31 0.33 0.34 -1.86 Log Likelihood: - 2.579 The output for the reduced model is given below: Coefficients (Intercept) BIOMASS Estimate -1.25 0.21 RATE Log Likelihood: -4.246 Answer the following questions. 0.41 Std. Error 0.6795 0.6821 5.9374 24112 2.1420 Y = 0 Y = 1 Std. Error 0.4129 4.7390 2.4223 True Classification 6 z value -1.653 0.456 2.594 3.167 -0866 z value -2.488 2.572 3.161 P=0 45 10 HEIGHT X₂ 0 1 1 5.1.1. State the full and reduced logistic regression models. 5.1.2. Give a complete and formal test to establish whether or not predictors HEIGHT and DOSAGE are necessary in the model. Test at a = 0.05. Pr(>Z) 0.0984 5.2. Give the estimated logistic regression decided upon and give the odds ratios for regression coefficients band by of this regression. 0.6484 0.0095 0.0076 0.3864 5.3. Use the estimated logistic regression from Question 5.2 to find the probability of the treatment being effective if the amount of biomass above ground is 3 grams and the rate of growth is 1.3. Interpret the result. Pr(>Z) 0.0144 0.0101 5.4. To establish whether RATE is indeed needed to predict the effectiveness of a treatment, a formal test can be used. Give the name of this test, state the hypothesis, test statistic formula and the decision rule. Predicted 0.0076 5.5. Logistic regression's power lies also in the ability to predict a new observation's class. In order to do that, a cut-off point for the different classes must be established. In this study, an effective treatment of a diseased plant was recorded as (Y= 1) and if not, it was recorded as (Y= 0). The following table shows the predictions by the classification method. Y = 1 17 28 Total 62 38 5.5.1. The above table was created based on the following rule: Predict 1 if ft > 0.6 and predict 0 if ft < 0.6, explain this rule. 5.5.2. Discuss the above table. Also give the error prediction rate. 5.5.3. Give the sensitivity and 1- specificity values. 5.5.4. What are the motivations for using a receiver operating characteristic called a ROC curve and the AUC, area under the curve, in classification? Would you consider the classification based on the rule in Question 5.5.1 as good enough based on the values in Question 5.5.3?
Expert Solution
steps

Step by step

Solved in 2 steps

Blurred answer
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman