Identifying associations between amino acid changes and meta information in alignments

Bioinformatics. 2011 Oct 15;27(20):2782-9. doi: 10.1093/bioinformatics/btr476. Epub 2011 Aug 16.

Abstract

Motivation: We present a method that identifies associations between amino acid changes in potentially significant sites in an alignment (taking into account several amino acid properties) with phenotypic data, through the phylogenetic mixed model. The latter accounts for the dependency of the observations (organisms). It is known from previous studies that the pathogenic aspect of many organisms may be associated with a single or just few changes in amino acids, which have a strong structural and/or functional impact on the protein. Discovering these sites is a big step toward understanding pathogenicity. Our method is able to discover such sites in proteins responsible for the pathogenic character of a group of bacteria.

Results: We use our method to predict potentially significant sites in the RpoS protein from a set of 209 bacteria. Several sites with significant differences in biological relevant regions were found.

Availability: Our tool is publicly available on the CRAN network at http://cran.r-project.org/

Contact: naya@pasteur.edu.uy

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics
  • Genomics / methods
  • Linear Models
  • Phylogeny
  • Proteins / chemistry
  • Proteins / genetics
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein*
  • Sigma Factor / chemistry
  • Sigma Factor / genetics

Substances

  • Bacterial Proteins
  • Proteins
  • Sigma Factor
  • sigma factor KatF protein, Bacteria