Features in Backgrounds of Microscopy Images Introduce Biases in Machine Learning Analyses

David N Greenblott; Florian Johann; Jared R Snell; Henning Gieseler; Christopher P Calderon; Theodore W Randolph

doi:10.1016/j.xphs.2024.03.003

Features in Backgrounds of Microscopy Images Introduce Biases in Machine Learning Analyses

J Pharm Sci. 2024 May;113(5):1177-1189. doi: 10.1016/j.xphs.2024.03.003. Epub 2024 Mar 12.

Authors

David N Greenblott¹, Florian Johann², Jared R Snell³, Henning Gieseler⁴, Christopher P Calderon⁵, Theodore W Randolph⁶

Affiliations

¹ Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO 80303, United States.
² Department of Pharmaceutics, Friedrich Alexander University Erlangen-Nürnberg, Erlangen 91058, Germany; Merck KGaA, Darmstadt 64293, Germany.
³ EMD Serono, Billerica, MA 01821, United States.
⁴ Department of Pharmaceutics, Friedrich Alexander University Erlangen-Nürnberg, Erlangen 91058, Germany; GILYOS GmbH, Würzburg 97076, Germany.
⁵ Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO 80303, United States; Ursa Analytics, Denver, CO 80212, United States.
⁶ Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO 80303, United States. Electronic address: Theodore.Randolph@colorado.edu.

PMID: 38484874
DOI: 10.1016/j.xphs.2024.03.003

Abstract

Subvisible particles may be encountered throughout the processing of therapeutic protein formulations. Flow imaging microscopy (FIM) and backgrounded membrane imaging (BMI) are techniques commonly used to record digital images of these particles, which may be analyzed to provide particle size distributions, concentrations, and identities. Although both techniques record digital images of particles within a sample, FIM analyzes particles suspended in flowing liquids, whereas BMI records images of dry particles after collection by filtration onto a membrane. This study compared the performance of convolutional neural networks (CNNs) in classifying images of subvisible particles recorded by both imaging techniques. Initially, CNNs trained on BMI images appeared to provide higher classification accuracies than those trained on FIM images. However, attribution analyses showed that classification predictions from CNNs trained on BMI images relied on features contributed by the membrane background, whereas predictions from CNNs trained on FIM features were based largely on features of the particles. Segmenting images to minimize the contributions from image backgrounds reduced the apparent accuracy of CNNs trained on BMI images but caused minimal reduction in the accuracy of CNNs trained on FIM images. Thus, the seemingly superior classification accuracy of CNNs trained on BMI images compared to FIM images was an artifact caused by subtle features in the backgrounds of BMI images. Our findings emphasize the importance of examining machine learning algorithms for image analysis with attribution methods to ensure the robustness of trained models and to mitigate potential influence of artifacts within training data sets.

Keywords: Image analysis; Machine learning; Monoclonal antibodies; Neural networks; Protein aggregation.

MeSH terms

Algorithms
Bias
Machine Learning*
Microscopy*
Neural Networks, Computer