HKU CS-Database Research Group

The University of Hong Kong
Department of Computer Science

home

db seminars

31 May 2002

Evaluation of Techniques for Classifying Biological Sequences

Speaker: Sarah CHAN

Abstract

In recent years there has been an exponential increase in the amount of publicly accessible biological information, such as DNA and protein sequences. This has resulted in an increased interest in developing computational techniques to automatically classify these large volumes of sequence data into various categories corresponding to either their role in the chromosomes, their structure, and/or their functions. This paper evaluates some of the widely used sequence classification algorithms and develops a framework for modeling sequences in a fashion so that traditional machine learning algorithms, such as support vector machines, can be applied easily. Experiments show that the SVM-based approaches are able to achieve higher classification accuracy compared to the more traditional sequence classification algorithms such as K-nearest neighbor based approaches and Markov model based techniques.

Read the Presentation Slides...

Referred Papers

	Back to the top
	Comment? Send to dbgroup@cs.hku.hk