Annotation of Peptide Structures Using SMILES and Other Chemical Codes-Practical Solutions

Molecules. 2017 Nov 27;22(12):2075. doi: 10.3390/molecules22122075.

Abstract

Contemporary peptide science exploits methods and tools of bioinformatics, and cheminformatics. These approaches use different languages to describe peptide structures-amino acid sequences and chemical codes (especially SMILES), respectively. The latter may be applied, e.g., in comparative studies involving structures and properties of peptides and peptidomimetics. Progress in peptide science "in silico" may be achieved via better communication between biologists and chemists, involving the translation of peptide representation from amino acid sequence into SMILES code. Recent recommendations concerning good practice in chemical information include careful verification of data and their annotation. This publication discusses the generation of SMILES representations of peptides using existing software. Construction of peptide structures containing unnatural and modified amino acids (with special attention paid on glycosylated peptides) is also included. Special attention is paid to the detection and correction of typical errors occurring in SMILES representations of peptides and their correction using molecular editors. Brief recommendations for training of staff working on peptide annotations, are discussed as well.

Keywords: SMILES code; bioinformatics; chemical information; chemical modifications; cheminformatics; glycopeptides; good practice; molecular editors; peptides.

Publication types

  • Review

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / chemistry*
  • Computational Biology / methods*
  • Glycosylation
  • Molecular Structure
  • Peptides / chemistry*
  • Peptidomimetics / chemistry*
  • Software
  • Structure-Activity Relationship

Substances

  • Amino Acids
  • Peptides
  • Peptidomimetics