Protein sequence analysis in the context of drug repurposing

BMC Med Inform Decis Mak. 2024 May 13;24(1):122. doi: 10.1186/s12911-024-02531-1.

Abstract

Motivation: Drug repurposing speeds up the development of new treatments, being less costly, risky, and time consuming than de novo drug discovery. There are numerous biological elements that contribute to the development of diseases and, as a result, to the repurposing of drugs.

Methods: In this article, we analysed the potential role of protein sequences in drug repurposing scenarios. For this purpose, we embedded the protein sequences by performing four state of the art methods and validated their capacity to encapsulate essential biological information through visualization. Then, we compared the differences in sequence distance between protein-drug target pairs of drug repurposing and non - drug repurposing data. Thus, we were able to uncover patterns that define protein sequences in repurposing cases.

Results: We found statistically significant sequence distance differences between protein pairs in the repurposing data and the rest of protein pairs in non-repurposing data. In this manner, we verified the potential of using numerical representations of sequences to generate repurposing hypotheses in the future.

Keywords: Drug repurposing; Embedding vectors; Protein sequences; Sequence analysis.

MeSH terms

  • Drug Repositioning*
  • Humans
  • Sequence Analysis, Protein