Generalization of vision pre-trained models for histopathology

Milad Sikaroudi; Maryam Hosseini; Ricardo Gonzalez; Shahryar Rahnamayan; H R Tizhoosh

doi:10.1038/s41598-023-33348-z

Generalization of vision pre-trained models for histopathology

Sci Rep. 2023 Apr 13;13(1):6065. doi: 10.1038/s41598-023-33348-z.

Authors

Milad Sikaroudi¹, Maryam Hosseini¹, Ricardo Gonzalez^{1

2}, Shahryar Rahnamayan^{1

3}, H R Tizhoosh^{4

5

6}

Affiliations

¹ Kimia Lab, University of Waterloo, Waterloo, ON, Canada.
² Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA.
³ Engineering Department, Brock University, St. Catharines, ON, Canada.
⁴ Kimia Lab, University of Waterloo, Waterloo, ON, Canada. tizhoosh.hamid@mayo.edu.
⁵ Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA. tizhoosh.hamid@mayo.edu.
⁶ Rhazes Lab, Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, USA. tizhoosh.hamid@mayo.edu.

Abstract

Out-of-distribution (OOD) generalization, especially for medical setups, is a key challenge in modern machine learning which has only recently received much attention. We investigate how different convolutional pre-trained models perform on OOD test data-that is data from domains that have not been seen during training-on histopathology repositories attributed to different trial sites. Different trial site repositories, pre-trained models, and image transformations are examined as specific aspects of pre-trained models. A comparison is also performed among models trained entirely from scratch (i.e., without pre-training) and models already pre-trained. The OOD performance of pre-trained models on natural images, i.e., (1) vanilla pre-trained ImageNet, (2) semi-supervised learning (SSL), and (3) semi-weakly-supervised learning (SWSL) models pre-trained on IG-1B-Targeted are examined in this study. In addition, the performance of a histopathology model (i.e., KimiaNet) trained on the most comprehensive histopathology dataset, i.e., TCGA, has also been studied. Although the performance of SSL and SWSL pre-trained models are conducive to better OOD performance in comparison to the vanilla ImageNet pre-trained model, the histopathology pre-trained model is still the best in overall. In terms of top-1 accuracy, we demonstrate that diversifying the images in the training using reasonable image transformations is effective to avoid learning shortcuts when the distribution shift is significant. In addition, XAI techniques-which aim to achieve high-quality human-understandable explanations of AI decisions-are leveraged for further investigations.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Humans
Machine Learning*
Neural Networks, Computer*
Supervised Machine Learning