Computer-Aided Chemotaxonomy and Bioprospecting Study of Diterpenes of the Lamiaceae Family

Molecules. 2019 Oct 30;24(21):3908. doi: 10.3390/molecules24213908.

Abstract

Lamiaceae is one of the largest families of angiosperms and is classified into 12 subfamilies that are composed of 295 genera and 7775 species. It presents a variety of secondary metabolites such as diterpenes that are commonly found in their species, and some of them are known to be chemotaxonomic markers. The aim of this work was to construct a database of diterpenes and to use it to perform a chemotaxonomic analysis among the subfamilies of Lamiaceae, using molecular descriptors and self-organizing maps (SOMs). The 4115 different diterpenes corresponding to 6386 botanical occurrences, which are distributed in eight subfamilies, 66 genera, 639 different species and 4880 geographical locations, were added to SistematX. Molecular descriptors of diterpenes and their respective botanical occurrences were used to generate the SOMs. In all obtained maps, a match rate higher than 80% was observed, demonstrating a separation of the Lamiaceae subfamilies, corroborating with the morphological and molecular data proposed by Li et al. Therefore, through this chemotaxonomic study, we can predict the localization of a diterpene in a subfamily and assist in the search for secondary metabolites with specific structural characteristics, such as compounds with potential biological activity.

Keywords: Lamiaceae; SOMs; chemotaxonomic; database; diterpenes.

MeSH terms

  • Algorithms
  • Bioprospecting*
  • Databases as Topic
  • Diterpenes / chemistry
  • Diterpenes / classification*
  • Lamiaceae / classification*
  • Machine Learning
  • Phylogeny

Substances

  • Diterpenes