Histology segmentation using active learning on regions of interest in oral cavity squamous cell carcinoma

Jonathan Folmsbee; Lei Zhang; Xulei Lu; Jawaria Rahman; John Gentry; Brendan Conn; Marilena Vered; Paromita Roy; Ruta Gupta; Diana Lin; Shabnam Samankan; Pooja Dhorajiva; Anu Peter; Minhua Wang; Anna Israel; Margaret Brandwein-Weber; Scott Doyle

doi:10.1016/j.jpi.2022.100146

Histology segmentation using active learning on regions of interest in oral cavity squamous cell carcinoma

J Pathol Inform. 2022 Sep 27:13:100146. doi: 10.1016/j.jpi.2022.100146. eCollection 2022.

Authors

Jonathan Folmsbee^{1

2}, Lei Zhang¹, Xulei Lu³, Jawaria Rahman⁴, John Gentry⁵, Brendan Conn⁶, Marilena Vered^{7

8}, Paromita Roy⁹, Ruta Gupta¹⁰, Diana Lin¹¹, Shabnam Samankan¹², Pooja Dhorajiva¹³, Anu Peter¹⁴, Minhua Wang¹⁵, Anna Israel¹⁶, Margaret Brandwein-Weber³, Scott Doyle^{1

2}

Affiliations

¹ Department of Pathology & Anatomical Sciences, University at Buffalo SUNY, Buffalo, NY, USA.
² Department of Biomedical Engineering, University at Buffalo SUNY, Buffalo, NY, USA.
³ Icahn School of Medicine, The Mount Sinai Hospital, New York, NY, USA.
⁴ Department of Pathology, Case Western University, Cleveland, OH, USA.
⁵ Department of Pathology, Nebraska Medical Health System, Omaha, NE, USA.
⁶ Department of Pathology, University of Edinburgh, Edinburgh, UK.
⁷ Department of Oral Pathology, Oral Medicine and Maxillofacial Imaging, School of Dental Medicine, Tel Aviv University, Tel Aviv, IL, USA.
⁸ Institute of Pathology, Sheba Medical Center, Tel Hashomer, Ramat Gan, IL, USA.
⁹ Department of Pathology, Tata Memorial Cancer Center, Mumbai, IN, USA.
¹⁰ Department of Tissue Pathology and Diagnostic Oncology, NSW Health Pathology, Royal Prince Alfred Hospital and University of Sydney, Sydney, AU, USA.
¹¹ Department of Pathology, The University of Alabama at Birmingham, Birmingham, AL, USA.
¹² Department of Pathology, George Washington University Hospital, Washington, DC, USA.
¹³ Department of Oncologic Surgical Pathology, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
¹⁴ Department of Pathology, University of Pennsylvania, Philadelphia, PA, USA.
¹⁵ Department of Pathology, Yale University School of Medicine, New Haven, CT, USA.
¹⁶ Department of Anatomic Pathology, Robert J. Tomsich Pathology and Laboratory Medicine Institute, Cleveland Clinic, Cleveland, OH, USA.

Abstract

In digital pathology, deep learning has been shown to have a wide range of applications, from cancer grading to segmenting structures like glomeruli. One of the main hurdles for digital pathology to be truly effective is the size of the dataset needed for generalization to address the spectrum of possible morphologies. Small datasets limit classifiers' ability to generalize. Yet, when we move to larger datasets of whole slide images (WSIs) of tissue, these datasets may cause network bottlenecks as each WSI at its original magnification can be upwards of 100 000 by 100 000 pixels, and over a gigabyte in file size. Compounding this problem, high quality pathologist annotations are difficult to obtain, as the volume of necessary annotations to create a classifier that can generalize would be extremely costly in terms of pathologist-hours. In this work, we use Active Learning (AL), a process for iterative interactive training, to create a modified U-net classifier on the region of interest (ROI) scale. We then compare this to Random Learning (RL), where images for addition to the dataset for retraining are randomly selected. Our hypothesis is that AL shows benefits for generating segmentation results versus randomly selecting images to annotate. We show that after 3 iterations, that AL, with an average Dice coefficient of 0.461, outperforms RL, with an average Dice Coefficient of 0.375, by 0.086.

Keywords: Active learning; Computational pathology; Digital pathology; Oral cavity cancer; Region of interest; Semantic segmentation; U-net; Whole slide imaging.

Abstract

Grants and funding