A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100 classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set: Predicted Donation No Donation Donation 265 5,430 23 23,384 No Donation The following table lists some information on individual observations from the validation set: Actual Observation ID Actual Class No Donation No Donation Donation Donation (a) Choose the correct explanation for how the probability of Donation was computed for the three observations. (1) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "Donation" and those that classified it as "No Donation." (i) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "No Donation and those that classified it as "Donation." (ii) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "Donation." (iv) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "No Donation." Option (iii) Why were Observations B and C classified as Donation and Observation A was classified as No Donation? Donation No Donation Probability of Donation 0.4 0.8 0.6 If required, round your answers to one decimal place. The probability of Donation for Observation A is The probability of Donation for Observation B is The probability of Donation for Observation C is If required, round your answer to three decimal places. Accuracy 0.810 Predicted Class (b) Compute the values of accuracy, sensitivity, specificity, and precision. Explain why accuracy is a misleading measure to consider in this case. Evaluate the performance of the random forest, particularly commenting on the precision measure. 0.0462 0.4. It is less 0.8. It is greater 0.6. It is greater If required, round your answers to the nearest whole percentage. Accuracy is not the best measure to use for unbalanced data sets because less than The value of precision seems disturbingly small is not than 0.5, so Observation A is classified as No Donation by the random forest. than 0.5, so Observation B is classified as Donation by the random forest. ✓than 0.5, so Observation C is classified as Donation by the random forest. If required, round your answers for Sensitivity and Specificity to three decimal places and round your answer for Precision to four decimal places. Sensitivity 0.920 Specificity 0.812 Precision- % of the alumni in the data have donated. V. The precision measure represents the percentage of alumni classified by the random forest as Donations Va tremendous improvement in the ability to target alumni who may be more likely to donate. ✓that are donors. Comparing the value of precision with the proportion of observations corresponding to donations, there

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question
100%

Please check and answer the blank %

A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100
classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set:
Actual
No Donation
23
23,384
No Donation
The following table lists some information on individual observations from the validation set:
Donation
Observation ID
A
B
с
Predicted
Donation
265
5,430
Actual Class
No Donation
Donation
No Donation
Probability of
Donation
0.4
0.8
0.6
If required, round your answers to one decimal place.
The probability of Donation for Observation A is
The probability of Donation for Observation B is
The probability of Donation for Observation C is
(a) Choose the correct explanation for how the probability of Donation was computed for the three observations.
(i) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "Donation" and those that classified it as "No Donation."
(ii) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "No Donation" and those that classified it as "Donation."
(iii) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "Donation."
(iv) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "No Donation."
Option (iii)
Why were Observations B and C classified as Donation and Observation A was classified as No Donation?
If required, round your answer to three decimal places.
Accuracy =
0.810
Predicted
Class
0.812
0.0462
No Donation
Donation
Donation
The value of precision seems disturbingly small
is not
0.4
0.8
(b) Compute the values of accuracy, sensitivity, specificity, and precision. Explain why accuracy is a misleading measure to consider in this case. Evaluate the performance of the random forest, particularly commenting on the precision measure.
It is less
It is greater
0.6 It is greater
If required, round your answers to the nearest whole percentage.
Accuracy is not the best measure to use for unbalanced data sets because less than
than 0.5, so Observation A is classified as No Donation by the random forest.
V than 0.5, so Observation B is classified as Donation by the random forest.
than 0.5, so Observation C is classified as Donation by the random forest.
If required, round your answers for Sensitivity and Specificity to three decimal places and round your answer for Precision to four decimal places.
Sensitivity =
0.920
Specificity =
Precision =
% of the alumni in the data have donated.
The precision measure represents the percentage of alumni classified by the random forest as Donations
V a tremendous improvement in the ability to target alumni who may be more likely to donate.
that are donors. Comparing the value of precision with the proportion of observations corresponding to donations, there
Transcribed Image Text:A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100 classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set: Actual No Donation 23 23,384 No Donation The following table lists some information on individual observations from the validation set: Donation Observation ID A B с Predicted Donation 265 5,430 Actual Class No Donation Donation No Donation Probability of Donation 0.4 0.8 0.6 If required, round your answers to one decimal place. The probability of Donation for Observation A is The probability of Donation for Observation B is The probability of Donation for Observation C is (a) Choose the correct explanation for how the probability of Donation was computed for the three observations. (i) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "Donation" and those that classified it as "No Donation." (ii) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "No Donation" and those that classified it as "Donation." (iii) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "Donation." (iv) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "No Donation." Option (iii) Why were Observations B and C classified as Donation and Observation A was classified as No Donation? If required, round your answer to three decimal places. Accuracy = 0.810 Predicted Class 0.812 0.0462 No Donation Donation Donation The value of precision seems disturbingly small is not 0.4 0.8 (b) Compute the values of accuracy, sensitivity, specificity, and precision. Explain why accuracy is a misleading measure to consider in this case. Evaluate the performance of the random forest, particularly commenting on the precision measure. It is less It is greater 0.6 It is greater If required, round your answers to the nearest whole percentage. Accuracy is not the best measure to use for unbalanced data sets because less than than 0.5, so Observation A is classified as No Donation by the random forest. V than 0.5, so Observation B is classified as Donation by the random forest. than 0.5, so Observation C is classified as Donation by the random forest. If required, round your answers for Sensitivity and Specificity to three decimal places and round your answer for Precision to four decimal places. Sensitivity = 0.920 Specificity = Precision = % of the alumni in the data have donated. The precision measure represents the percentage of alumni classified by the random forest as Donations V a tremendous improvement in the ability to target alumni who may be more likely to donate. that are donors. Comparing the value of precision with the proportion of observations corresponding to donations, there
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 3 steps

Blurred answer
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman