A method for WD40 repeat detection and secondary structure prediction

PLoS One. 2013 Jun 11;8(6):e65705. doi: 10.1371/journal.pone.0065705. Print 2013.

Abstract

WD40-repeat proteins (WD40s), as one of the largest protein families in eukaryotes, play vital roles in assembling protein-protein/DNA/RNA complexes. WD40s fold into similar β-propeller structures despite diversified sequences. A program WDSP (WD40 repeat protein Structure Predictor) has been developed to accurately identify WD40 repeats and predict their secondary structures. The method is designed specifically for WD40 proteins by incorporating both local residue information and non-local family-specific structural features. It overcomes the problem of highly diversified protein sequences and variable loops. In addition, WDSP achieves a better prediction in identifying multiple WD40-domain proteins by taking the global combination of repeats into consideration. In secondary structure prediction, the average Q3 accuracy of WDSP in jack-knife test reaches 93.7%. A disease related protein LRRK2 was used as a representive example to demonstrate the structure prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Databases, Protein
  • Humans
  • Membrane Proteins
  • Neoplasm Proteins
  • Nerve Tissue Proteins
  • Protein Structure, Secondary
  • Proteins / chemistry*

Substances

  • LRRN1 protein, human
  • Membrane Proteins
  • Neoplasm Proteins
  • Nerve Tissue Proteins
  • Proteins

Associated data

  • PDB/2J04
  • PDB/3EI4

Grants and funding

Financial support by the National Science Foundation of China (21133002, http://www.nsfc.gov.cn/e_nsfc/desktop/zn/0101.htm), the Shenzhen Peacock Program, and Peking University Shenzhen Graduate School is acknowledged. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.