How Open Data Shapes In Silico Transporter Modeling

Floriane Montanari; Barbara Zdrazil

doi:10.3390/molecules22030422

How Open Data Shapes In Silico Transporter Modeling

Molecules. 2017 Mar 7;22(3):422. doi: 10.3390/molecules22030422.

Authors

Floriane Montanari¹, Barbara Zdrazil²

Affiliations

¹ Pharmacoinformatics Research Group, Department of Pharmaceutical Chemistry, University of Vienna, A-1090 Vienna, Austria. floriane.montanari@univie.ac.at.
² Pharmacoinformatics Research Group, Department of Pharmaceutical Chemistry, University of Vienna, A-1090 Vienna, Austria. barbara.zdrazil@univie.ac.at.

Abstract

Chemical compound bioactivity and related data are nowadays easily available from open data sources and the open medicinal chemistry literature for many transmembrane proteins. Computational ligand-based modeling of transporters has therefore experienced a shift from local (quantitative) models to more global, qualitative, predictive models. As the size and heterogeneity of the data set rises, careful data curation becomes even more important. This includes, for example, not only a tailored cutoff setting for the generation of binary classes, but also the proper assessment of the applicability domain. Powerful machine learning algorithms (such as multi-label classification) now allow the simultaneous prediction of multiple related targets. However, the more complex, the less interpretable these models will get. We emphasize that transmembrane transporters are very peculiar, some of which act as off-targets rather than as real drug targets. Thus, careful selection of the right modeling technique is important, as well as cautious interpretation of results. We hope that, as more and more data will become available, we will be able to ameliorate and specify our models, coming closer towards function elucidation and the development of safer medicine.

Keywords: applicability domain; computational modeling; data curation; machine learning; multi-label classification; open data; transport proteins.

Publication types

Review

MeSH terms

Carrier Proteins / chemistry*
Carrier Proteins / metabolism
Computational Biology / methods
Computer Simulation*
Databases, Protein
Ligands
Models, Molecular*
Protein Binding
Quantitative Structure-Activity Relationship

Substances

Carrier Proteins
Ligands

Grants and funding

P 29712/FWF_/Austrian Science Fund FWF/Austria