Federated Learning in Computational Toxicology: An Industrial Perspective on the Effiris Hackathon

Davide Bassani; Alessandro Brigo; Andrea Andrews-Morger

doi:10.1021/acs.chemrestox.3c00137

Federated Learning in Computational Toxicology: An Industrial Perspective on the Effiris Hackathon

Chem Res Toxicol. 2023 Sep 18;36(9):1503-1517. doi: 10.1021/acs.chemrestox.3c00137. Epub 2023 Aug 16.

Authors

Davide Bassani¹, Alessandro Brigo¹, Andrea Andrews-Morger¹

Affiliation

¹ Pharmaceutical Research & Early Development, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., 4070 Basel, Switzerland.

Abstract

In silico approaches have acquired a towering role in pharmaceutical research and development, allowing laboratories all around the world to design, create, and optimize novel molecular entities with unprecedented efficiency. From a toxicological perspective, computational methods have guided the choices of medicinal chemists toward compounds displaying improved safety profiles. Even if the recent advances in the field are significant, many challenges remain active in the on-target and off-target prediction fields. Machine learning methods have shown their ability to identify molecules with safety concerns. However, they strongly depend on the abundance and diversity of data used for their training. Sharing such information among pharmaceutical companies remains extremely limited due to confidentiality reasons, but in this scenario, a recent concept named "federated learning" can help overcome such concerns. Within this framework, it is possible for companies to contribute to the training of common machine learning algorithms, using, but not sharing, their proprietary data. Very recently, Lhasa Limited organized a hackathon involving several industrial partners in order to assess the performance of their federated learning platform, called "Effiris". In this paper, we share our experience as Roche in participating in such an event, evaluating the performance of the federated algorithms and comparing them with those coming from our in-house-only machine learning models. Our aim is to highlight the advantages of federated learning and its intrinsic limitations and also suggest some points for potential improvements in the method.

MeSH terms

Algorithms*
Animals
Laboratories
Machine Learning
Pharmaceutical Preparations
Spiders*

Substances

Pharmaceutical Preparations