The LAILAPS search engine: relevance ranking in life science databases

Matthias Lange; Karl Spies; Joachim Bargsten; Gregor Haberhauer; Matthias Klapperstück; Michael Leps; Christian Weinel; Röbbe Wünschiers; Mandy Weissbach; Jens Stein; Uwe Scholz

doi:10.2390/biecoll-jib-2010-110

The LAILAPS search engine: relevance ranking in life science databases

J Integr Bioinform. 2010 Jan 15;7(2):110. doi: 10.2390/biecoll-jib-2010-110.

Authors

Matthias Lange¹, Karl Spies, Joachim Bargsten, Gregor Haberhauer, Matthias Klapperstück, Michael Leps, Christian Weinel, Röbbe Wünschiers, Mandy Weissbach, Jens Stein, Uwe Scholz

Affiliation

¹ Research Group Bioinformatics and Information Technology, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany. lange@ipk-gatersleben.de

PMID: 20134080
DOI: 10.2390/biecoll-jib-2010-110

Abstract

Search engines and retrieval systems are popular tools at a life science desktop. The manual inspection of hundreds of database entries, that reflect a life science concept or fact, is a time intensive daily work. Hereby, not the number of query results matters, but the relevance does. In this paper, we present the LAILAPS search engine for life science databases. The concept is to combine a novel feature model for relevance ranking, a machine learning approach to model user relevance profiles, ranking improvement by user feedback tracking and an intuitive and slim web user interface, that estimates relevance rank by tracking user interactions. Queries are formulated as simple keyword lists and will be expanded by synonyms. Supporting a flexible text index and a simple data import format, LAILAPS can easily be used both as search engine for comprehensive integrated life science databases and for small in-house project databases. With a set of features, extracted from each database hit in combination with user relevance preferences, a neural network predicts user specific relevance scores. Using expert knowledge as training data for a predefined neural network or using users own relevance training sets, a reliable relevance ranking of database hits has been implemented. In this paper, we present the LAILAPS system, the concepts, benchmarks and use cases. LAILAPS is public available for SWISSPROT data at http://lailaps.ipk-gatersleben.de.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods*
Databases, Factual*
Information Storage and Retrieval
Search Engine / methods*
Software*
User-Computer Interface