Biomarker identification using text mining

Comput Math Methods Med. 2012:2012:135780. doi: 10.1155/2012/135780. Epub 2012 Nov 11.

Abstract

Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Biomarkers / metabolism*
  • Computational Biology / methods
  • Data Mining / methods*
  • Databases, Factual
  • Genetic Diseases, Inborn / metabolism
  • Genotype
  • Humans
  • Phenotype
  • Programming Languages
  • PubMed

Substances

  • Biomarkers