Chameleon sequences in neurodegenerative diseases

Biochem Biophys Res Commun. 2016 Mar 25;472(1):209-16. doi: 10.1016/j.bbrc.2016.01.187. Epub 2016 Feb 23.

Abstract

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.

Keywords: Chameleon sequences; Enrichment analysis; Neurodegenerative diseases; Protein secondary structure; Sequence properties.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Amino Acids / chemistry
  • Databases, Protein
  • Dipeptides / chemistry
  • Dipeptides / genetics
  • Humans
  • Neurodegenerative Diseases / etiology
  • Neurodegenerative Diseases / genetics*
  • Neurodegenerative Diseases / metabolism*
  • Protein Conformation
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Proteins / genetics*

Substances

  • Amino Acids
  • Dipeptides
  • Proteins