HKU Research  The University of Hong Kong
Department of Computer Science and Information System
Feature
home
current research
people
publications
HKU CS

 

12 Feb 2003

Mining Decision Trees from Time-changing Data Streams
Line
Speaker: Ivy TONG

 

Abstract Many organizations today have databases that grow without limit at a rate of several million records per day. Mining these continuous data streams is a new research topic. Most statistical and machine-learning algorithms assume that training data is a random sample drawn from a stationary distribution. Unfortunately, this assumption is errorneous since the underlying concept often changes over time. Although some algorithms have been proposed for learning time-changing concepts, they generally do not scale well to the large databases or data streams.

In this talk, I will present an efficient algorithm CVFDT (Concept-adapting Very Fast Decision Tree learner) for mining decision trees from continuously-changing data streams. CVFDT is able to maintain a decision tree up-to-date with a window of examples, using a small constant amount of time for each example but with guaranteed accuracy.

Read the Presentation Slides...

Referred Papers

Back to the top

Comment?  Send to dbgroup@cs.hku.hk