(Hint: only the leaves of the old decision tree need to be changed.) Does the decision tree capture the “+” concept?

Following is a data set that contains two attributes, X and Y , and two class


labels, “+” and “?”. Each attribute can take three different values: 0, 1, or 2.





The concept for the “+” class is Y = 1 and the concept for the “?” class is


X = 0 ? X = 2.


Build a new decision tree with the following cost function:



The cost matrix can be summarized as follows:





The decision tree in part (a) has 7 leaf nodes, X = 1, X = 0 ? Y = 0,


X = 0 ? Y = 1, X = 0 ? Y = 2, X = 2 ? Y = 0, X = 2 ? Y = 1, and


X = 2 ? Y = 2. Only X = 0 ? Y = 1 and X = 2 ? Y = 1 are impure


nodes. The cost of misclassifying these impure nodes as positive class


is:





10 ? 0+1 ? 100 = 100


while the cost of misclassifying them as negative class is:


10 ? 20 + 0 ? 100 = 200.


These nodes are therefore labeled as +.


The resulting concept is


Computer Science & Information Technology

You might also like to view...

If you do not use the mini toolbar, it remains on the screen. _______________

Answer the following statement true (T) or false (F)

Computer Science & Information Technology

Apps already installed on a SharePoint site can be viewed on the Site Contents page

Indicate whether the statement is true or false

Computer Science & Information Technology

Of particular importance in disaster recovery testing is performing a(n) ________ of the success and failure of the disaster recovery test

Fill in the blank(s) with correct word

Computer Science & Information Technology

Repeat part (a) using X as the first splitting attribute and then choose the best remaining attribute for splitting at each of the two successor nodes. What is the error rate of the induced tree?

Consider the following set of training examples.

Computer Science & Information Technology