According to the classification error rate, which attribute would be chosen as the first splitting attribute? For each attribute, show the contingency table and the gains in classification error rate.

The following table summarizes a data set with three attributes A, B, C and

two class labels +, ?. Build a two-level decision tree.




The error rate for the data without partitioning on any attribute is


image


After splitting on attribute A, the gain in error rate is:


image


After splitting on attribute B, the gain in error rate is:


image


After splitting on attribute C, the gain in error rate is:





The algorithm chooses attribute A because it has the highest gain.

Computer Science & Information Technology

You might also like to view...

A(n) ____ specifies a relationship between tables and the properties of that relationship.

A. link B. join C. hyperlink D. object

Computer Science & Information Technology

What is a MAC tag and how does it work?

What will be an ideal response?

Computer Science & Information Technology

The lines between cells in a spreadsheet are called ________ lines

Fill in the blank(s) with correct word

Computer Science & Information Technology

LuAnn has written a book report on "To Kill A Mockingbird" for her English class. Her instructor was impressed and has asked her to talk about the book at the next class. LuAnn decides to edit the report and convert it from a Word document to a PowerPoint presentation.The slides that LuAnn inserted in PowerPoint from the Word outline appear in the ____ pane.

A. Title and Content B. Notes C. Slides and Outline D. New Slide

Computer Science & Information Technology