Digging into acceptor splice site prediction: an iterative feature selection approach.
Feature selection techniques are often used to reduce data dimensionality, increase classification performance, and gain insight into the processes that generated the data.In this paper, we describe an iterative procedure of feature selection and feature construction steps, improving the classification of acceptor splice sites, an important subtask of gene prediction. We show that acceptor prediction can benefit from feature selection, and describe how feature selection techniques can be used to gain new insights in the classification of acceptor sites.This is illustrated by the identification of a new, biologically motivated feature: the AG-scanning feature. The results described in this paper contribute both to the domain of gene prediction, and to research in feature selection techniques, describing a new wrapper based feature weighting method that aids in knowledge discovery when dealing with complex datasets.
Saeys, Y., Degroeve, S., Van de Peer, Y. (2004) Digging into acceptor splice site prediction: an iterative feature selection approach. Proceedings of ECML/PKDD Lecture Notes in Artificial Intelligence,3202:386-397.
VIB / UGent
Bioinformatics & Evolutionary Genomics
+32 (0) 9 33 13807 (phone)
+32 (0) 9 33 13809 (fax)