Development and Validation of an AI-driven Mammographic Breast Density Classification Tool Based on Radiologist Consensus

Veronica Magni; Matteo Interlenghi; Andrea Cozzi; Marco Alì; Christian Salvatore; Alcide A Azzena; Davide Capra; Serena Carriero; Gianmarco Della Pepa; Deborah Fazzini; Giuseppe Granata; Caterina B Monti; Giulia Muscogiuri; Giuseppe Pellegrino; Simone Schiaffino; Isabella Castiglioni; Sergio Papa; Francesco Sardanelli

doi:10.1148/ryai.210199

Development and Validation of an AI-driven Mammographic Breast Density Classification Tool Based on Radiologist Consensus

Radiol Artif Intell. 2022 Mar 16;4(2):e210199. doi: 10.1148/ryai.210199. eCollection 2022 Mar.

Authors

Veronica Magni^#¹, Matteo Interlenghi^#¹, Andrea Cozzi¹, Marco Alì¹, Christian Salvatore¹, Alcide A Azzena¹, Davide Capra¹, Serena Carriero¹, Gianmarco Della Pepa¹, Deborah Fazzini¹, Giuseppe Granata¹, Caterina B Monti¹, Giulia Muscogiuri¹, Giuseppe Pellegrino¹, Simone Schiaffino¹, Isabella Castiglioni¹, Sergio Papa¹, Francesco Sardanelli¹

Affiliation

¹ Department of Biomedical Sciences for Health (V.M., A.C., D.C., C.B.M., F.S.) and Postgraduate School in Radiodiagnostics (A.A.A., S.C., G.D.P., G.G., G.M., G.P.), Università degli Studi di Milano, Milan, Italy; DeepTrace Technologies, Milan, Italy (M.I., C.S.); Unit of Diagnostic Imaging and Stereotactic Radiosurgery, C.D.I. Centro Diagnostico Italiano, Milan, Italy (M.A., D.F., S.P.); Bracco Imaging, Milan, Italy (M.A.); Department of Science, Technology and Society, University School for Advanced Studies IUSS Pavia, Palazzo del Broletto, Piazza della Vittoria 15, 27100 Pavia, Italy (C.S.); Unit of Radiology, IRCCS Policlinico San Donato, San Donato Milanese, Italy (S.S., F.S.); Institute of Biomedical Imaging and Physiology, Consiglio Nazionale delle Ricerche, Segrate, Italy (I.C.); and Department of Physics, Università degli Studi di Milano-Bicocca, Milan, Italy (I.C.).

^# Contributed equally.

Abstract

Mammographic breast density (BD) is commonly visually assessed using the Breast Imaging Reporting and Data System (BI-RADS) four-category scale. To overcome inter- and intraobserver variability of visual assessment, the authors retrospectively developed and externally validated a software for BD classification based on convolutional neural networks from mammograms obtained between 2017 and 2020. The tool was trained using the majority BD category determined by seven board-certified radiologists who independently visually assessed 760 mediolateral oblique (MLO) images in 380 women (mean age, 57 years ± 6 [SD]) from center 1; this process mimicked training from a consensus of several human readers. External validation of the model was performed by the three radiologists whose BD assessment was closest to the majority (consensus) of the initial seven on a dataset of 384 MLO images in 197 women (mean age, 56 years ± 13) obtained from center 2. The model achieved an accuracy of 89.3% in distinguishing BI-RADS a or b (nondense breasts) versus c or d (dense breasts) categories, with an agreement of 90.4% (178 of 197 mammograms) and a reliability of 0.807 (Cohen κ) compared with the mode of the three readers. This study demonstrates accuracy and reliability of a fully automated software for BD classification. Keywords: Mammography, Breast, Convolutional Neural Network (CNN), Deep Learning Algorithms, Machine Learning Algorithms Supplemental material is available for this article. © RSNA, 2022.

Keywords: Breast; Convolutional Neural Network (CNN); Deep Learning Algorithms; Machine Learning Algorithms; Mammography.

2022 by the Radiological Society of North America, Inc.