The dating web site Oollama.com requires its users to create profiles based on a survey in which they rate their interest (on a scale from 0 to 3) in five categories: physical fitness, music, spirituality, education, and alcohol consumption. A new Oollama customer, Erin O'Shaughnessy, has reviewed the profiles of 40 prospective dates and classified whether she is interested in learning more about them. Based on Erin's classification of these 40 profiles, Oollama has applied a logistic regression to predict Erin's interest in other profiles that she has not yet viewed. The resulting logistic regression model is as follows:   For the 40 profiles (observations) on which Erin classified her interest, this logistic regression model generates that following probability of Interested.     Probability of     Probability of Observation Interested Interested Observation Interested Interested 35   1 1.000 13   0 0.412 21   1 0.999 2   0 0.285 29   1 0.999 3   0 0.219 25   1 0.999 7   0 0.168 39   1 0.999 9   0 0.168 26   1 0.990 12   0 0.168 23   1 0.981 18   0 0.168 33   1 0.974 22   1 0.168 1   0 0.882 31   1 0.168 24   1 0.882 6   0 0.128 28   1 0.882 20   0 0.128 36   1 0.882 15   0 0.029 16   0 0.791 5   0 0.020 27   1 0.791 14   0 0.015 30   1 0.791 19   0 0.011 32   1 0.791 8   0 0.008 34   1 0.791 10   0 0.001 37   1 0.791 17   0 0.001 40   1 0.791 4   0 0.001 38   1 0.732 11   0 0.000   (a) Using a cutoff value of 0.5 to classify a profile observation as Interested or not, construct the confusion matrix for this 40-observation training set.         Predicted Actual 0 1 0     1

MATLAB: An Introduction with Applications
6th Edition
ISBN:9781119256830
Author:Amos Gilat
Publisher:Amos Gilat
Chapter1: Starting With Matlab
Section: Chapter Questions
Problem 1P
icon
Related questions
Question
100%

The dating web site Oollama.com requires its users to create profiles based on a survey in which they rate their interest (on a scale from 0 to 3) in five categories: physical fitness, music, spirituality, education, and alcohol consumption. A new Oollama customer, Erin O'Shaughnessy, has reviewed the profiles of 40 prospective dates and classified whether she is interested in learning more about them.

Based on Erin's classification of these 40 profiles, Oollama has applied a logistic regression to predict Erin's interest in other profiles that she has not yet viewed. The resulting logistic regression model is as follows:

 

For the 40 profiles (observations) on which Erin classified her interest, this logistic regression model generates that following probability of Interested.

    Probability of     Probability of
Observation Interested Interested Observation Interested Interested
35   1 1.000 13   0 0.412
21   1 0.999 2   0 0.285
29   1 0.999 3   0 0.219
25   1 0.999 7   0 0.168
39   1 0.999 9   0 0.168
26   1 0.990 12   0 0.168
23   1 0.981 18   0 0.168
33   1 0.974 22   1 0.168
1   0 0.882 31   1 0.168
24   1 0.882 6   0 0.128
28   1 0.882 20   0 0.128
36   1 0.882 15   0 0.029
16   0 0.791 5   0 0.020
27   1 0.791 14   0 0.015
30   1 0.791 19   0 0.011
32   1 0.791 8   0 0.008
34   1 0.791 10   0 0.001
37   1 0.791 17   0 0.001
40   1 0.791 4   0 0.001
38   1 0.732 11   0 0.000

 

(a) Using a cutoff value of 0.5 to classify a profile observation as Interested or not, construct the confusion matrix for this 40-observation training set.
   
 
  Predicted
Actual 0 1
0    
1    
   
  Compute sensitivity, specificity, and precision measures and interpret them within the context of Erin's dating prospects.
   
  If required, round your answers to two decimal places. Do not round intermediate calculations.
 

The sensitivity of the model is  . This suggests that the model is reasonably  at identifying the profiles that Erin is interested in.

The specificity of the model is  . This suggests that the model is reasonably  at avoiding recommending profiles to Erin that she will not be interested in.

The precision of the model is  . This suggests that the model is reasonably  at suggesting profiles of interest to Erin.

   
(b) Oollama understands that its clients have a limited amount of time for dating and therefore use decile-wise lift charts to evaluate their classification models. For the training data, what is the first decile lift resulting from the logistic regression model? Interpret this value.
   
 

The first decile lift of this classification is  . It means that the first decile of the logistic regression model  the number of profiles that Erin is interested in versus random selection.

   
(c) A recently posted profile has values of Fitness = 3, Music = 1, Education = 3, and Alcohol = 1. Use the estimated logistic regression equation to compute the probability of Erin's interest in this profile.
   
  If required, round your answers to three decimal places. Do not round intermediate calculations.
 

Log odds = 

Probability of Interest = 

   
(d) Now that Oollama has trained a logistic regression model based on Erin's initial evaluations of 40 profiles, what should its next steps be in the modeling process?
   
  Oollama should use their model to suggest profiles  to Erin in order to compute classification accuracy measures on a validation set.
   
The dating web site Oollama.com requires its users to create profiles based on a survey in which they rate their interest (on a scale from 0 to 3) in five categories: physical fitness, music, spirituality, education, and alcohol consumption. A new Oollama customer, Erin O'Shaughnessy, has reviewed the profiles of 40 prospective dates and
classified whether she is interested in learning more about them.
Based on Erin's classification of these 40 profiles, Oollama has applied a logistic regression to predict Erin's interest in other profiles that she has not yet viewed. The resulting logistic regression model is as follows:
Log odds of Interested = -0.920 +0.325 x Fitness - 3.611 x Music +5.535 x Education - 2.927 x Alcohol
For the 40 profiles (observations) on which Erin classified her interest, this logistic regression model generates that following probability of Interested.
Probability of
Interested
1.000
0.999
0.999
0.999
0.999
0.990
0.981
0.974
0.882
0.882
0.882
0.882
0.791
0.791
0.791
0.791
0.791
0.791
0.791
0.732
Observation
35
21
29
25
39
26
23
33
1
24
28
36
16
27
30
32
34
37
40
38
Actual
0
Interested
1
1
1
1
1
1
1
1
0
1
1
1
0
1
1
1
1
1
1
1
1
0
Predicted
1
Observation
13
2
3
7
9
12
18
22
31
6
20
15
5
14
19
8
10
17
4
11
The first decile lift of this classification is
Interested
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
(a) Using a cutoff value of 0.5 to classify a profile observation as Interested or not, construct the confusion matrix for this 40-observation training set.
0
0
0
Probability of
Interested
0.412
0.285
0.219
0.168
0.168
0.168
0.168
0.168
0.168
0.128
0.128
0.029
0.020
0.015
0.011
0.008
0.001
0.001
0.001
0.000
Compute sensitivity, specificity, and precision measures and interpret them within the context of Erin's dating prospects.
If required, round your answers to two decimal places. Do not round intermediate calculations.
The sensitivity of the model is
The specificity of the model is
This suggests that the model is reasonably
This suggests that the model is reasonably
This suggests that the model is reasonably
The precision of the model is.
Select your answer -
Select your answer
Select your answer
(b) Oollama understands that its clients have a limited amount of time for dating and therefore use decile-wise lift charts to evaluate their classification models. For the training data, what is the first decile lift resulting from the logistic regression model? Interpret this value.
at identifying the profiles that Erin is interested in.
at avoiding recommending profiles to Erin that she will not be interested in.
at suggesting profiles of interest to Erin.
It means that the first decile of the logistic regression model Select your answer the number of profiles that Erin is interested in versus random selection.
If required, round your answers to three decimal places. Do not round intermediate calculations.
Log odds =
Probability of Interest =
(c) A recently posted profile has values of Fitness = 3, Music = 1, Education = 3, and Alcohol = 1. Use the estimated logistic regression equation to compute the probability of Erin's interest in this profile.
Transcribed Image Text:The dating web site Oollama.com requires its users to create profiles based on a survey in which they rate their interest (on a scale from 0 to 3) in five categories: physical fitness, music, spirituality, education, and alcohol consumption. A new Oollama customer, Erin O'Shaughnessy, has reviewed the profiles of 40 prospective dates and classified whether she is interested in learning more about them. Based on Erin's classification of these 40 profiles, Oollama has applied a logistic regression to predict Erin's interest in other profiles that she has not yet viewed. The resulting logistic regression model is as follows: Log odds of Interested = -0.920 +0.325 x Fitness - 3.611 x Music +5.535 x Education - 2.927 x Alcohol For the 40 profiles (observations) on which Erin classified her interest, this logistic regression model generates that following probability of Interested. Probability of Interested 1.000 0.999 0.999 0.999 0.999 0.990 0.981 0.974 0.882 0.882 0.882 0.882 0.791 0.791 0.791 0.791 0.791 0.791 0.791 0.732 Observation 35 21 29 25 39 26 23 33 1 24 28 36 16 27 30 32 34 37 40 38 Actual 0 Interested 1 1 1 1 1 1 1 1 0 1 1 1 0 1 1 1 1 1 1 1 1 0 Predicted 1 Observation 13 2 3 7 9 12 18 22 31 6 20 15 5 14 19 8 10 17 4 11 The first decile lift of this classification is Interested 0 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 (a) Using a cutoff value of 0.5 to classify a profile observation as Interested or not, construct the confusion matrix for this 40-observation training set. 0 0 0 Probability of Interested 0.412 0.285 0.219 0.168 0.168 0.168 0.168 0.168 0.168 0.128 0.128 0.029 0.020 0.015 0.011 0.008 0.001 0.001 0.001 0.000 Compute sensitivity, specificity, and precision measures and interpret them within the context of Erin's dating prospects. If required, round your answers to two decimal places. Do not round intermediate calculations. The sensitivity of the model is The specificity of the model is This suggests that the model is reasonably This suggests that the model is reasonably This suggests that the model is reasonably The precision of the model is. Select your answer - Select your answer Select your answer (b) Oollama understands that its clients have a limited amount of time for dating and therefore use decile-wise lift charts to evaluate their classification models. For the training data, what is the first decile lift resulting from the logistic regression model? Interpret this value. at identifying the profiles that Erin is interested in. at avoiding recommending profiles to Erin that she will not be interested in. at suggesting profiles of interest to Erin. It means that the first decile of the logistic regression model Select your answer the number of profiles that Erin is interested in versus random selection. If required, round your answers to three decimal places. Do not round intermediate calculations. Log odds = Probability of Interest = (c) A recently posted profile has values of Fitness = 3, Music = 1, Education = 3, and Alcohol = 1. Use the estimated logistic regression equation to compute the probability of Erin's interest in this profile.
Expert Solution
trending now

Trending now

This is a popular solution!

steps

Step by step

Solved in 4 steps

Blurred answer
Follow-up Questions
Read through expert solutions to related follow-up questions below.
Follow-up Question

How to calculate decile-wise lift chart?

Solution
Bartleby Expert
SEE SOLUTION
Similar questions
  • SEE MORE QUESTIONS
Recommended textbooks for you
MATLAB: An Introduction with Applications
MATLAB: An Introduction with Applications
Statistics
ISBN:
9781119256830
Author:
Amos Gilat
Publisher:
John Wiley & Sons Inc
Probability and Statistics for Engineering and th…
Probability and Statistics for Engineering and th…
Statistics
ISBN:
9781305251809
Author:
Jay L. Devore
Publisher:
Cengage Learning
Statistics for The Behavioral Sciences (MindTap C…
Statistics for The Behavioral Sciences (MindTap C…
Statistics
ISBN:
9781305504912
Author:
Frederick J Gravetter, Larry B. Wallnau
Publisher:
Cengage Learning
Elementary Statistics: Picturing the World (7th E…
Elementary Statistics: Picturing the World (7th E…
Statistics
ISBN:
9780134683416
Author:
Ron Larson, Betsy Farber
Publisher:
PEARSON
The Basic Practice of Statistics
The Basic Practice of Statistics
Statistics
ISBN:
9781319042578
Author:
David S. Moore, William I. Notz, Michael A. Fligner
Publisher:
W. H. Freeman
Introduction to the Practice of Statistics
Introduction to the Practice of Statistics
Statistics
ISBN:
9781319013387
Author:
David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:
W. H. Freeman