Strategies for annotation and curation of translational databases: the eTUMOUR project

Database (Oxford). 2012 Nov 22:2012:bas035. doi: 10.1093/database/bas035. Print 2012.

Abstract

The eTUMOUR (eT) multi-centre project gathered in vivo and ex vivo magnetic resonance (MR) data, as well as transcriptomic and clinical information from brain tumour patients, with the purpose of improving the diagnostic and prognostic evaluation of future patients. In order to carry this out, among other work, a database--the eTDB--was developed. In addition to complex permission rules and software and management quality control (QC), it was necessary to develop anonymization, processing and data visualization tools for the data uploaded. It was also necessary to develop sophisticated curation strategies that involved on one hand, dedicated fields for QC-generated meta-data and specialized queries and global permissions for senior curators and on the other, to establish a set of metrics to quantify its contents. The indispensable dataset (ID), completeness and pairedness indices were set. The database contains 1317 cases created as a result of the eT project and 304 from a previous project, INTERPRET. The number of cases fulfilling the ID was 656. Completeness and pairedness were heterogeneous, depending on the data type involved.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining / methods*
  • Database Management Systems*
  • Databases as Topic*
  • Humans
  • Internet
  • Magnetic Resonance Imaging
  • Metric System
  • Neoplasms / pathology*
  • Quality Control
  • Reproducibility of Results
  • Spectrum Analysis
  • Translational Research, Biomedical*
  • User-Computer Interface