Calculate the gain in the Gini index when splitting on A and B. Which attribute would the decision tree induction algorithm choose?

Consider the following data set for a binary class problem.


The overall gini before splitting is:
Gorig = 1 ? 0.42 ? 0.62 = 0.48
The gain in gini after splitting on A is:
image
The gain in gini after splitting on B is:

GB=T = 1?

1

4

2

?

3

4

2

= 0.3750

GB=F = 1=

1

6

2

?

5

6

2

= 0.2778

? = Gorig ? 4/10GB=T ? 6/10GB=F = 0.1633

Therefore, attribute B will be chosen to split the node.

Computer Science & Information Technology

You might also like to view...

Which strategy does a better job keeping large blocks available?

What will be an ideal response?

Computer Science & Information Technology

What command is used to display the routing protocols configured on your router?

What will be an ideal response?

Computer Science & Information Technology

Which of the following statements about email is FALSE?

A) Copies of your email may exist on several servers even after the recipient deletes it. B) Your employer may read your email at your work. C) It cannot be accessed by hackers. D) Email can be used for targeted marketing.

Computer Science & Information Technology

In SharePoint, the ________, displayed across the top of the SharePoint Online window, enables you to use the tools in Office 365

A) breadcrumb navigation trail B) global navigation bar C) contextual command D) top link bar

Computer Science & Information Technology