Repeat part (c) for model M1 using the threshold t = 0.1. Which threshold do you prefer, t = 0.5 or t = 0.1? Are the results consistent with what you expect from the ROC curve?

You are asked to evaluate the performance of two classification models, M1

and M2. The test set you have chosen contains 26 binary attributes, labeled

as A through Z.

Table 5.5 shows the posterior probabilities obtained by applying the models to

the test set. (Only the posterior probabilities for the positive class are shown).

As this is a two-class problem, P(?)=1 ? P(+) and P(?|A, . . . , Z)=1 ?

P(+|A, . . . , Z). Assume that we are mostly interested in detecting instances

from the positive class


When t = 0.1, the confusion matrix for M1 is shown below.





For t = 0.5, area = 0.6 × (1 ? 0.2) = 0.6 × 0.8=0.48.

For t = 0.1, area = 1 × (1 ? 0.8) = 1 × 0.2=0.2.

Since the area for t = 0.5 is larger than the area for t = 0.1, we prefer

t = 0.5.

Computer Science & Information Technology

You might also like to view...

The application manager is a global tool that lets you navigate through storage devices, libraries, and folders to find files.

Answer the following statement true (T) or false (F)

Computer Science & Information Technology

?Which of the following allows you to start a mail merge through emails?

A. ?Using the mail merge when writing an e-mail in Microsoft Outlook B. ?Saving a completed mail merge as a Rich Text Format word file and copying it to an e-mail C. ?Selecting E-mail messages in the Start Mail Merge button menu in the Mailings tab D. ?Writing an email and choosing the mail merge option from the sending options

Computer Science & Information Technology

The 100Base-FX Fast Ethernet transmits data over two multimode ____________________ fiber cables.

Fill in the blank(s) with the appropriate word(s).

Computer Science & Information Technology

The brightness of a photo is called the ________

Fill in the blank(s) with correct word

Computer Science & Information Technology