FOIL’s information gain.
Consider a training set that contains 100 positive examples and 400 negative
examples. For each of the following candidate rules,
R1: A ?? + (covers 4 positive and 1 negative examples),
R2: B ?? + (covers 30 positive and 10 negative examples),
R3: C ?? + (covers 100 positive and 90 negative examples),
determine which is the best and worst candidate rule according to:
Assume the initial rule is ? ?? +. This rule covers p0 = 100 positive
examples and n0 = 400 negative examples.
The rule R1 covers p1 = 4 positive examples and n1 = 1 negative
example. Therefore, the FOIL’s information gain for this rule is
The rule R2 covers p1 = 30 positive examples and n1 = 10 negative
example. Therefore, the FOIL’s information gain for this rule is
The rule R3 covers p1 = 100 positive examples and n1 = 90 negative
example. Therefore, the FOIL’s information gain for this rule is
Therefore, R3 is the best candidate and R1 is the worst candidate ac-
cording to FOIL’s information gain.
You might also like to view...
Two of the most important factors to consider when choosing an LCD monitor are its resolution and ________
A) refresh rate B) aspect ratio C) contrast D) dot pitch
The Windows XP ICF filters only inbound packets.What are the advantages of not checking outgoing packets? What are the disadvantages?
What will be an ideal response?
The mailing notation SPECIAL DELIVERY is ____.
A. keyed below the return address on the envelope at about 1" B. keyed at the right on the envelope below the stamp at about 1.2" C. not keyed on the envelope D. keyed on the letter one blank line below the salutation
What is an archetype?
What will be an ideal response?