Discovering essential domains in essential genes

Methods Mol Biol. 2015:1279:235-45. doi: 10.1007/978-1-4939-2398-4_15.

Abstract

Genes with indispensable functions are identified as essential; however, the traditional gene-level perspective of essentiality has several limitations. We hypothesized that protein domains, the independent structural or functional units of a polypeptide chain, are responsible for gene essentiality. If the essentiality of domains is known, the essential genes could be identified. To find such essential domains, we have developed an EM algorithm-based Essential Domain Prediction (EDP) Model. With simulated datasets, the model provided convergent results given different initial values and offered accurate predictions even with noise. We then applied the EDP model to six microbes and predicted 3,450 domains to be essential in at least one species, ranging 8-24 % in each species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria / genetics
  • Computer Simulation
  • Databases, Genetic
  • Fungi / genetics
  • Genes, Bacterial
  • Genes, Essential*
  • Genes, Fungal
  • Genomics / methods*
  • Models, Theoretical