High-precision bio-molecular event extraction from text using parallel binary classifiers
We have developed a machine learning framework to accurately extract complex genetic interctions from text. Employing type-specifc classifiers,
this framework processes research articles to extract various biological events. Subsequently, the algorithm identifies regulation events that
take other events as arguments, allowing a nested structure of predictions. All predictions are merged into an integrated network, useful for
visualisation and for deduction of new biological knowledge. In this paper, we discuss several design choices for an event-based extraction framework.
These detailed studies help improving on existing systems, which is illustrated by the relative performance gain of 10% of our system compared to the
official results in the recent BioNLP'09 Shared Task. Our framework now achieves state-of-the-art performance with 37.43 recall, 54.81 precision and
44.48 F-score. We further present the fist study of feature selection for bio-molecular event extraction from text. While producing more cost-effective
models, feature selection can also lead to a better insight into the complexity of the challenge. Finally, this paper tries to bridge the gap between
theoretical relation extraction from text and experimental work on bio-molecular interactions by discussing interesting opportunities to employ
event-based text mining tools for real-life tasks such as hypothesis generation, database curation and knowledge discovery.
Van Landeghem, S., De Baets, B., Van de Peer, Y., Saeys, Y. (2011) High-precision bio-molecular event extraction from text using parallel binary classifiers. Comput. Intell. 27(4):645-664. |
|
Contact:
VIB / UGent Bioinformatics & Evolutionary Genomics Technologiepark 927 B-9052 Gent BELGIUM +32 (0) 9 33 13807 (phone) +32 (0) 9 33 13809 (fax) |
You are visiting an outdated page of the BEG/Van de Peer Lab site.
Not all pages have been ported, so these archived pages are still available.
Redirect to the new website?