Development and validation of a deep learning model for detection of breast cancers in mammography from multi-institutional datasets

Daiju Ueda; Akira Yamamoto; Naoyoshi Onoda; Tsutomu Takashima; Satoru Noda; Shinichiro Kashiwagi; Tamami Morisaki; Shinya Fukumoto; Masatsugu Shiba; Mina Morimura; Taro Shimono; Ken Kageyama; Hiroyuki Tatekawa; Kazuki Murai; Takashi Honjo; Akitoshi Shimazaki; Daijiro Kabata; Yukio Miki

doi:10.1371/journal.pone.0265751

Development and validation of a deep learning model for detection of breast cancers in mammography from multi-institutional datasets

PLoS One. 2022 Mar 24;17(3):e0265751. doi: 10.1371/journal.pone.0265751. eCollection 2022.

Authors

Daiju Ueda¹, Akira Yamamoto¹, Naoyoshi Onoda², Tsutomu Takashima², Satoru Noda², Shinichiro Kashiwagi², Tamami Morisaki^{2

3}, Shinya Fukumoto³, Masatsugu Shiba⁴, Mina Morimura⁵, Taro Shimono¹, Ken Kageyama¹, Hiroyuki Tatekawa¹, Kazuki Murai¹, Takashi Honjo¹, Akitoshi Shimazaki¹, Daijiro Kabata⁶, Yukio Miki¹

Affiliations

¹ Department of Diagnostic and Interventional Radiology, Graduate School of Medicine, Osaka City University, Osaka, Japan.
² Department of Breast and Endocrine Surgery, Graduate School of Medicine, Osaka City University, Osaka, Japan.
³ Department of Premier Preventive Medicine, Graduate School of Medicine, Osaka City University, Osaka, Japan.
⁴ Department of Gastroenterology, Graduate School of Medicine, Osaka City University, Osaka, Japan.
⁵ Department of General Practice, Osaka City University Hospital, Osaka, Japan.
⁶ Department of Medical Statistics, Graduate School of Medicine, Osaka City University, Osaka, Japan.

Abstract

Objectives: The objective of this study was to develop and validate a state-of-the-art, deep learning (DL)-based model for detecting breast cancers on mammography.

Methods: Mammograms in a hospital development dataset, a hospital test dataset, and a clinic test dataset were retrospectively collected from January 2006 through December 2017 in Osaka City University Hospital and Medcity21 Clinic. The hospital development dataset and a publicly available digital database for screening mammography (DDSM) dataset were used to train and to validate the RetinaNet, one type of DL-based model, with five-fold cross-validation. The model's sensitivity and mean false positive indications per image (mFPI) and partial area under the curve (AUC) with 1.0 mFPI for both test datasets were externally assessed with the test datasets.

Results: The hospital development dataset, hospital test dataset, clinic test dataset, and DDSM development dataset included a total of 3179 images (1448 malignant images), 491 images (225 malignant images), 2821 images (37 malignant images), and 1457 malignant images, respectively. The proposed model detected all cancers with a 0.45-0.47 mFPI and had partial AUCs of 0.93 in both test datasets.

Conclusions: The DL-based model developed for this study was able to detect all breast cancers with a very low mFPI. Our DL-based model achieved the highest performance to date, which might lead to improved diagnosis for breast cancer.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Breast Neoplasms* / diagnostic imaging
Deep Learning*
Early Detection of Cancer
Female
Humans
Mammography / methods
Retrospective Studies

Grants and funding

This work was supported by Wellness Open Living Labs, LLC. No grant number was provided. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.