A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100 classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set: Predicted Donation No Donation Donation 265 5,430 23 23,384 No Donation The following table lists some information on individual observations from the validation set: Actual Observation ID Actual Class No Donation No Donation Donation Donation (a) Choose the correct explanation for how the probability of Donation was computed for the three observations. (1) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "Donation" and those that classified it as "No Donation." (i) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "No Donation and those that classified it as "Donation." (ii) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "Donation." (iv) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "No Donation." Option (iii) Why were Observations B and C classified as Donation and Observation A was classified as No Donation? Donation No Donation Probability of Donation 0.4 0.8 0.6 If required, round your answers to one decimal place. The probability of Donation for Observation A is The probability of Donation for Observation B is The probability of Donation for Observation C is If required, round your answer to three decimal places. Accuracy 0.810 Predicted Class (b) Compute the values of accuracy, sensitivity, specificity, and precision. Explain why accuracy is a misleading measure to consider in this case. Evaluate the performance of the random forest, particularly commenting on the precision measure. 0.0462 0.4. It is less 0.8. It is greater 0.6. It is greater If required, round your answers to the nearest whole percentage. Accuracy is not the best measure to use for unbalanced data sets because less than The value of precision seems disturbingly small is not than 0.5, so Observation A is classified as No Donation by the random forest. than 0.5, so Observation B is classified as Donation by the random forest. ✓than 0.5, so Observation C is classified as Donation by the random forest. If required, round your answers for Sensitivity and Specificity to three decimal places and round your answer for Precision to four decimal places. Sensitivity 0.920 Specificity 0.812 Precision- % of the alumni in the data have donated. V. The precision measure represents the percentage of alumni classified by the random forest as Donations Va tremendous improvement in the ability to target alumni who may be more likely to donate. ✓that are donors. Comparing the value of precision with the proportion of observations corresponding to donations, there
A university is applying classification methods in order to identify alumni who may be interested in donating money. The university has a database of 58,205 alumni profiles containing numerous variables. Of these 58,205 alumni, only 576 have donated in the past. The university has oversampled the data and trained a random forest of 100 classification trees. For a cutoff value of 0.5, the following confusion matrix summarizes the performance of the random forest on a validation set: Predicted Donation No Donation Donation 265 5,430 23 23,384 No Donation The following table lists some information on individual observations from the validation set: Actual Observation ID Actual Class No Donation No Donation Donation Donation (a) Choose the correct explanation for how the probability of Donation was computed for the three observations. (1) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "Donation" and those that classified it as "No Donation." (i) The probability of Donation for each observation is the ratio of the individual classification trees that classified the observation as "No Donation and those that classified it as "Donation." (ii) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "Donation." (iv) The probability of Donation for each observation is the proportion of the 100 individual classification trees that classified the observation as "No Donation." Option (iii) Why were Observations B and C classified as Donation and Observation A was classified as No Donation? Donation No Donation Probability of Donation 0.4 0.8 0.6 If required, round your answers to one decimal place. The probability of Donation for Observation A is The probability of Donation for Observation B is The probability of Donation for Observation C is If required, round your answer to three decimal places. Accuracy 0.810 Predicted Class (b) Compute the values of accuracy, sensitivity, specificity, and precision. Explain why accuracy is a misleading measure to consider in this case. Evaluate the performance of the random forest, particularly commenting on the precision measure. 0.0462 0.4. It is less 0.8. It is greater 0.6. It is greater If required, round your answers to the nearest whole percentage. Accuracy is not the best measure to use for unbalanced data sets because less than The value of precision seems disturbingly small is not than 0.5, so Observation A is classified as No Donation by the random forest. than 0.5, so Observation B is classified as Donation by the random forest. ✓than 0.5, so Observation C is classified as Donation by the random forest. If required, round your answers for Sensitivity and Specificity to three decimal places and round your answer for Precision to four decimal places. Sensitivity 0.920 Specificity 0.812 Precision- % of the alumni in the data have donated. V. The precision measure represents the percentage of alumni classified by the random forest as Donations Va tremendous improvement in the ability to target alumni who may be more likely to donate. ✓that are donors. Comparing the value of precision with the proportion of observations corresponding to donations, there
MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
Related questions
Question
100%
Please check and answer the blank %
Expert Solution
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
This is a popular solution!
Trending now
This is a popular solution!
Step by step
Solved in 3 steps
Recommended textbooks for you
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman