FOIL’s information gain.

Consider a training set that contains 100 positive examples and 400 negative
examples. For each of the following candidate rules,
R1: A ?? + (covers 4 positive and 1 negative examples),
R2: B ?? + (covers 30 positive and 10 negative examples),
R3: C ?? + (covers 100 positive and 90 negative examples),
determine which is the best and worst candidate rule according to:


Assume the initial rule is ? ?? +. This rule covers p0 = 100 positive
examples and n0 = 400 negative examples.
The rule R1 covers p1 = 4 positive examples and n1 = 1 negative
example. Therefore, the FOIL’s information gain for this rule is

The rule R2 covers p1 = 30 positive examples and n1 = 10 negative
example. Therefore, the FOIL’s information gain for this rule is

The rule R3 covers p1 = 100 positive examples and n1 = 90 negative
example. Therefore, the FOIL’s information gain for this rule is

Therefore, R3 is the best candidate and R1 is the worst candidate ac-
cording to FOIL’s information gain.

Computer Science & Information Technology

You might also like to view...

Two of the most important factors to consider when choosing an LCD monitor are its resolution and ________

A) refresh rate B) aspect ratio C) contrast D) dot pitch

Computer Science & Information Technology

The Windows XP ICF filters only inbound packets.What are the advantages of not checking outgoing packets? What are the disadvantages?

What will be an ideal response?

Computer Science & Information Technology

The mailing notation SPECIAL DELIVERY is ____.

A. keyed below the return address on the envelope at about 1" B. keyed at the right on the envelope below the stamp at about 1.2" C. not keyed on the envelope D. keyed on the letter one blank line below the salutation

Computer Science & Information Technology

What is an archetype?

What will be an ideal response?

Computer Science & Information Technology