From Protein Sequence to Structure: The Next Frontier in Cross-Species Extrapolation for Chemical Safety Evaluations

Environ Toxicol Chem. 2023 Feb;42(2):463-474. doi: 10.1002/etc.5537. Epub 2023 Jan 16.

Abstract

Computational screening for potentially bioactive molecules using advanced molecular modeling approaches including molecular docking and molecular dynamic simulation is mainstream in certain fields like drug discovery. Significant advances in computationally predicting protein structures from sequence information have also expanded the availability of structures for nonmodel species. Therefore, the objective of the present study was to develop an analysis pipeline to harness the power of these bioinformatics approaches for cross-species extrapolation for evaluating chemical safety. The Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) tool compares protein-sequence similarity across species for conservation of known chemical targets, providing an initial line of evidence for extrapolation of toxicity knowledge. However, with the development of structural models from tools like the Iterative Threading ASSEmbly Refinement (ITASSER), analyses of protein structural conservation can be included to add further lines of evidence and generate protein models across species. Models generated through such a pipeline could then be used for advanced molecular modeling approaches in the context of species extrapolation. Two case examples illustrating this pipeline from SeqAPASS sequences to I-TASSER-generated protein structures were created for human liver fatty acid-binding protein (LFABP) and androgen receptor (AR). Ninety-nine LFABP and 268 AR protein models representing diverse species were generated and analyzed for conservation using template modeling (TM)-align. The results from the structural comparisons were in line with the sequence-based SeqAPASS workflow, adding further evidence of LFABL and AR conservation across vertebrate species. The present study lays the foundation for expanding the capabilities of the web-based SeqAPASS tool to include structural comparisons for species extrapolation, facilitating more rapid and efficient toxicological assessments among species with limited or no existing toxicity data. Environ Toxicol Chem 2023;42:463-474. © 2022 SETAC. This article has been contributed to by U.S. Government employees and their work is in the public domain in the USA.

Keywords: Predictive toxicology; SeqAPASS; bioinformatics; computational toxicology; protein structure; species extrapolation.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Chemical Safety*
  • Humans
  • Molecular Docking Simulation
  • Molecular Dynamics Simulation
  • Proteins / chemistry

Substances

  • Proteins