Assignment-2 Immad Allawala

pdf

School

Toronto Metropolitan University *

*We aren’t endorsed by this school

Course

415

Subject

Computer Science

Date

Dec 6, 2023

Type

pdf

Pages

Uploaded by ElderExplorationHare18

ITM-618 Assignment-2 (10%): Clustering, Similarity, and confusion matrix [Mandatory: You can do the calculation on paper, but please attach the calculations for all three questions along with your submission, without the calculation, the assignment will not be graded] Please answer the following questions: Q1. (Item similarity calculation) 4 Points The following table represents the attributes and their values for three Persons (PersonA, PersonB, and PersconC) Attribute Person A Person B Person C Age 35 50 55 Credit Score 730 820 680 Income 95K 120K 75K Loan 100K 40K 100K Using Euclidean distance, calculate the similarities between the three persons (PersonA, PersonB, and PersonC) and complete the following table with their similarity scores: PersonA PersonB PersonC PersonA PersonB PersonC Now, use the table to find the most similar person to PersonA? Q2. (Classification using k-NN) 7 Points The following 7- customers have applied for credit cards to a credit card company. The column ‘Approved’ shows the credit card application status: ‘No’ means the application was not approved, and ‘Yes’ means the application was approved. Customer Age Income Loan Credit Score Approved John 35 95K 300K 730 No Rachel 22 50K 10K 830 Yes Hannah 63 180K 20K 800 Yes Tom 59 50K 80K 650 No Nellie 25 60K 250K 680 No James 45 120K 40K 850 Yes Robin 42 70K 40K 780 Yes 0 0 0 65,000.06 65,000.06 20,000.07 20,000.07 75,000.13 75,000.13 Person C is closest to person A

Now, a new customer ‘David’ applies for a credit card, and David has the following profile: Age: 37 Income: 85K Loan: 45K Credit Score: 790 Apply the k-NN algorithm to determine whether David’s application will be Approved or Not! Complete the following table with the approval status (Yes/No) for the given values of k: k Approve 1 2 3 4 5 6 7 Q3. (Confusion matrix) 14 Points Given this confusion matrix for machine learning classifier, answer the following questions: 3.1 how many classes are there in this data set? 3.2 What is the data set size? 3.3 How many samples where misclassified? 3.4 How many cases belong to class (C1)? 3.5 The accuracy of the classifier is ___________ Actual labels Predicted labels C1 C2 C3 C1 25 5 1 C2 6 24 3 C3 2 4 26 3.6 Compute the precision of the classifier and interpret it. 3.7 Compute the recall of the classifier and interpret it. Yes Yes Yes Yes Yes Yes Yes 3 classes C1+C2+C3 = (25+5+1)+(6+24+3)+(2+4+26) = 96 96 - (25+24+26) = 21 C1 = 25+5+1 = 31 (25+24+26)/96 = 78.12% C1 = 0.806, C2 = 0.96, C3 = 0.67 Since C2 has the highest accuracy it is the most accurate classifier whereas C3 is the least accurate C1 = 0.83, C2 = 0.96, C3 = 0.67 Since C2 is a higher recall it is better at distinguishing the different instances of every class, whereas C3 is a lower recall so it is not as good at determining the different instances of a class

Your preview ends here

Eager to read complete document? Join bartleby learn and gain access to the full version

Access to all documents
Unlimited textbook solutions
24/7 expert homework help

Related Documents

Project 3 Quiz.py

Case Application Assignment 2 - Sajira Nageswaran .pdf

CIS 105 Assignment Schedule - Copy.xlsx

EMPS 101 - Week 10 - Online Networking Assignment Template - 2022-12-19(1)(2)(2).docx

LabF_netid-2.xlsx

Mod 12-II-A Quiz #2.pdf

A2-Debugging_Questions_Answers[1].docx

Testing_Correlations.m

QmB3300CBE. Del. 6. Prof. Tristan Brown.pptx

Midterm.docx

Assignment 6 - Location.docx

Test1CSE130 ethan suttor 5519519.docx

Recommended textbooks for you

Database Systems: Design, Implementation, & Manag...

Computer Science

ISBN:9781305627482

Author:Carlos Coronel, Steven Morris

Publisher:Cengage Learning

Database Systems: Design, Implementation, & Manag...

Computer Science

ISBN:9781285196145

Author:Steven, Steven Morris, Carlos Coronel, Carlos, Coronel, Carlos; Morris, Carlos Coronel and Steven Morris, Carlos Coronel; Steven Morris, Steven Morris; Carlos Coronel

Publisher:Cengage Learning

Np Ms Office 365/Excel 2016 I Ntermed

Computer Science

ISBN:9781337508841

Author:Carey

Publisher:Cengage

CMPTR

Computer Science

ISBN:9781337681872

Author:PINARD

Publisher:Cengage

SEE MORE TEXTBOOKS

Recommended textbooks for you

Database Systems: Design, Implementation, & Manag...
Computer Science
ISBN:9781305627482
Author:Carlos Coronel, Steven Morris
Publisher:Cengage Learning
Database Systems: Design, Implementation, & Manag...
Computer Science
ISBN:9781285196145
Author:Steven, Steven Morris, Carlos Coronel, Carlos, Coronel, Carlos; Morris, Carlos Coronel and Steven Morris, Carlos Coronel; Steven Morris, Steven Morris; Carlos Coronel
Publisher:Cengage Learning
Np Ms Office 365/Excel 2016 I Ntermed
Computer Science
ISBN:9781337508841
Author:Carey
Publisher:Cengage
CMPTR
Computer Science
ISBN:9781337681872
Author:PINARD
Publisher:Cengage

Database Systems: Design, Implementation, & Manag...

Computer Science

ISBN:9781305627482

Author:Carlos Coronel, Steven Morris

Publisher:Cengage Learning

Database Systems: Design, Implementation, & Manag...

Computer Science

ISBN:9781285196145

Author:Steven, Steven Morris, Carlos Coronel, Carlos, Coronel, Carlos; Morris, Carlos Coronel and Steven Morris, Carlos Coronel; Steven Morris, Steven Morris; Carlos Coronel

Publisher:Cengage Learning

Np Ms Office 365/Excel 2016 I Ntermed

Computer Science

ISBN:9781337508841

Author:Carey

Publisher:Cengage

CMPTR

Computer Science

ISBN:9781337681872

Author:PINARD

Publisher:Cengage

SEE MORE TEXTBOOKS