MIE1626 Midterm Practice Problems

pdf

School

University of Toronto *

*We aren’t endorsed by this school

Course

1626

Subject

Statistics

Date

Feb 20, 2024

Type

pdf

Pages

3

Uploaded by MagistrateThunderPolarBear15

Report
Practice Problems MIE1626 Page 1 of 3 QUESTION 1 [2 marks]: In the expression Sales ≈ f(TV, Radio, Newspaper), "Sales" is the: A) Response B) Training Data C) Independent Variable D) Feature QUESTION 2 [2 marks]: In a predictive modeling project using regression, you fit a linear model to your data set. Which of the following is most likely true if you fit a quadratic model to the data set? A) Using the Quadratic Model will decrease your Irreducible Error. B) Using the Quadratic Model will decrease the Bias of your model. C) Using the Quadratic Model will decrease the Variance of your model D) Using the Quadratic Model will decrease your Reducible Error QUESTION 3 [2 marks]: One way of carrying out the bootstrap is to average equally over all possible bootstrap samples from the original data set (where two bootstrap data sets are different if they have the same observations but in different order). Unlike the usual implementation of the bootstrap, this method has the advantage of not introducing extra noise due to resampling randomly. To carry out this implementation on a data set with n data points, how many bootstrap data sets would we need to average over? A) 2 𝑛 B) 𝑛 2 C) 𝑛 𝑛 D) 𝑛! QUESTION 4 [2 marks]: Which of the following statements is more accurate about classification methods: Logistic Regression, Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), and naive Bayes? A) Logistic regression is not a suitable method when the classes are well-separated. B) LDA is useful when n is large and for problems with more than 2 classes. C) Assuming Gaussian distributions in each class, QDA is less flexible than naive Bayes. D) Naive bayes is most useful when the number of features and samples are roughly the same.
Practice Problems MIE1626 Page 2 of 3 QUESTION 5 [18 marks]: For predicting p , the probability of credit default, you have used a logistic regression model with variables 𝑥 1 = credit score and 𝑥 2 = credit card balance. Using historical data with class labels, you have fitted the model Logit(p) 𝛽 0 ̂ + 𝛽 1 ̂ 𝑥 1 + 𝛽 2 ̂ 𝑥 2 and obtained the estimated coefficients 𝛽 0 ̂ = −50 , 𝛽 1 ̂ = −1 , and 𝛽 2 ̂ = 0.2 . Part (a). [1 mark]: Explain in plain English what the estimated intercept means and provide a numerical example for its role in the model. Part (b). [2 marks]: Explain what 𝛽 2 ̂ means and provide a numerical example for how it impacts logit(p). Part (c). [1 mark]: Provide a numerical example for how 𝛽 2 ̂ impacts the odds of default. Part (d). [1 mark]: Estimate the probability of credit default for Bob who has a credit score of 70 and a credit card balance of 610. Part (e). [1 mark]: For having a credit default risk of 50% what should Bob’s credit card balance be?
Practice Problems MIE1626 Page 3 of 3 Part (f). [4 marks]: To use the logistic model as a classifier for detecting defaulters, we use the classification threshold of 𝑝̂ = 0.5. Calculate the confusion matrix for the following test dataset and specify the values for TP, TN, FP, and FN. 𝑥 1 = credit score 𝑥 2 = credit card balance Y (class labels) 70 610 1 70 700 1 70 800 1 70 500 0 70 400 1 60 600 0 50 600 0 40 600 1 80 600 0 90 600 0 Part (g). [3 marks]: Calculate precision, recall, and accuracy for the test dataset. Part (h). [2 marks]: Draw an ROC plot and indicate the point that shows the performance of the classifier on the test dataset based on the classification threshold of 𝑝̂ = 0.5. Part (i). [3 marks]: Your colleague says s/he has developed an alternative classifier producing the following test results using the same features and that it has a better F1-score. Actual class 1 0 Predicted class 1 4 1 0 2 3 Based on the results presented to you, do you recommend using the alternative classifier instead of the logistic regression model (Yes/No)? Justify your answer.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help