0.Cover

L4-5

Learning by Recording Cases

Using the consistency heuristics (similar in Pattern Recognition)
The consistency heuristics:

Whenever you want to guess a property of something, given nothing else to go on but a set of reference cases, find the most similar case, as measured by known properties, for which the property is known. Guess that the unknown property is the same as that known property.

Example: Given 8 blocks of known colour, width and height as follows. Find the colour of a new block of size 1cm ´ 4cm.

Nearest Neighbour

Calculate the distance to each block, and find the minimum O(n).
Use decision tree instead O(log n).
A decision tree is a semantic tree such that
- each node is connected to a set of possible answers
- Each non-leaf node is connected to a test that splits its set of possible answers into subsets corresponding to different test results.
- Each branch carries a particular test result's subset to another node.

A k-d tree is a decision tree (distance measured in k-dimension)
- The set of possible answers consists of points, one of which may be the nearest neighbour to a given point.
- Each test specifies a coordinate, a threshold, and a neutral zone around the threshold containing no point
- Each test divides a set of points into two sets, according to on which side of the threshold each point lies.

The resulting decision tree:

To find the nearest neighbour using the k-d procedure: