Gene prioritization of resistant rice gene against Xanthomas oryzae pv. oryzae by using text mining technologies

Biomed Res Int. 2013:2013:853043. doi: 10.1155/2013/853043. Epub 2013 Nov 25.

Abstract

To effectively assess the possibility of the unknown rice protein resistant to Xanthomonas oryzae pv. oryzae, a hybrid strategy is proposed to enhance gene prioritization by combining text mining technologies with a sequence-based approach. The text mining technique of term frequency inverse document frequency is used to measure the importance of distinguished terms which reflect biomedical activity in rice before candidate genes are screened and vital terms are produced. Afterwards, a built-in classifier under the chaos games representation algorithm is used to sieve the best possible candidate gene. Our experiment results show that the combination of these two methods achieves enhanced gene prioritization.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Data Mining*
  • Genetic Association Studies
  • Oryza / genetics*
  • Oryza / microbiology
  • Xanthomonas / genetics
  • Xanthomonas / pathogenicity*