Rational identification of aggregation hotspots based on secondary structure and amino acid hydrophobicity

Sci Rep. 2017 Aug 25;7(1):9558. doi: 10.1038/s41598-017-09749-2.

Abstract

Insolubility of proteins expressed in the Escherichia coli expression system hinders the progress of both basic and applied research. Insoluble proteins contain residues that decrease their solubility (aggregation hotspots). Mutating these hotspots to optimal amino acids is expected to improve protein solubility. To date, however, the identification of these hotspots has proven difficult. In this study, using a combination of approaches involving directed evolution and primary sequence analysis, we found two rules to help inductively identify hotspots: the α-helix rule, which focuses on the hydrophobicity of amino acids in the α-helix structure, and the hydropathy contradiction rule, which focuses on the difference in hydrophobicity relative to the corresponding amino acid in the consensus protein. By properly applying these two rules, we succeeded in improving the probability that expressed proteins would be soluble. Our methods should facilitate research on various insoluble proteins that were previously difficult to study due to their low solubility.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs*
  • Amino Acids / chemistry*
  • Hydrophobic and Hydrophilic Interactions
  • Models, Molecular
  • Mutagenesis, Site-Directed
  • Protein Aggregates
  • Protein Binding
  • Protein Interaction Domains and Motifs*
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteins / metabolism
  • Solubility
  • Structure-Activity Relationship

Substances

  • Amino Acids
  • Protein Aggregates
  • Proteins