SCMMTP: identifying and characterizing membrane transport proteins using propensity scores of dipeptides

BMC Genomics. 2015;16 Suppl 12(Suppl 12):S6. doi: 10.1186/1471-2164-16-S12-S6. Epub 2015 Dec 9.

Abstract

Background: Identifying putative membrane transport proteins (MTPs) and understanding the transport mechanisms involved remain important challenges for the advancement of structural and functional genomics. However, the transporter characters are mainly acquired from MTP crystal structures which are hard to crystalize. Therefore, it is desirable to develop bioinformatics tools for the effective large-scale analysis of available sequences to identify novel transporters and characterize such transporters.

Results: This work proposes a novel method (SCMMTP) based on the scoring card method (SCM) using dipeptide composition to identify and characterize MTPs from an existing dataset containing 900 MTPs and 660 non-MTPs which are separated into a training dataset consisting 1,380 proteins and an independent dataset consisting 180 proteins. The SCMMTP produced estimating propensity scores for amino acids and dipeptides as MTPs. The SCMMTP training and test accuracy levels respectively reached 83.81% and 76.11%. The test accuracy of support vector machine (SVM) using a complicated classification method with a low possibility for biological interpretation and position-specific substitution matrix (PSSM) as a protein feature is 80.56%, thus SCMMTP is comparable to SVM-PSSM. To identify MTPs, SCMMTP is applied to three datasets including: 1) human transmembrane proteins, 2) a photosynthetic protein dataset, and 3) a human protein database. MTPs showing α-helix rich structure is agreed with previous studies. The MTPs used residues with low hydration energy. It is hypothesized that, after filtering substrates, the hydrated water molecules need to be released from the pore regions.

Conclusions: SCMMTP yields estimating propensity scores for amino acids and dipeptides as MTPs, which can be used to identify novel MTPs and characterize transport mechanisms for use in further experiments.

Availability: http://iclab.life.nctu.edu.tw/iclab_webtools/SCMMTP/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Amino Acids / chemistry
  • Computational Biology / methods*
  • Computers, Molecular
  • Databases, Protein
  • Dipeptides / chemistry*
  • Humans
  • Membrane Transport Proteins / chemistry*
  • Membrane Transport Proteins / metabolism*
  • Models, Molecular
  • Propensity Score
  • Protein Structure, Secondary

Substances

  • Amino Acids
  • Dipeptides
  • Membrane Transport Proteins