L6-8

Learning by Building Identification Trees

Name

Hair

Height

Weight

Lotion

Result

Sarah

Blonde

Average

Light

no

Sunburned

Dana

Blonde

Tall

Average

Yes

None

Alex

Brown

Short

Average

Yes

None

Annie

Blonde

Short

Average

no

Sunburned

Emily

Red

Average

Heavy

no

Sunburned

Pete

Brown

Tall

Heavy

no

None

John

Brown

Average

Heavy

no

None

Katie

Blonde

Short

Light

Yes

none












Remark: Entropy increases with disorder.

For 2 classes, let p1=0, p2=1, then H=-1´log 1 - 0 log 0 « 0

If p1 = 0.5, p2 = 0.5, H=(-0.5 ´ log 0.5) ´ 2 = 1

Test

Hair

Height

Weight

Lotion

Disorder

0.5

0.69

0.94

0.61



To generate an identification tree:



From Trees to Rules




No change

Sunburned

Blonde hair

2

0

Not blonde hair

1

0


No change

Sunburned

Use lotion

2

0

Use no lotion

0

2


No change

Sunburned

Blonde hair

0

2

Not blonde hair

2

1


No change

Sunburned

Use lotion

0

2

Use no lotion

2

0



To summarize, to generate rules from an identification tree:

Significance of the evidences


R1

R2

P1

l

m

P2

n

o