Traditional Machine Learning Methods versus Deep Learning for Meningioma Classification, Grading, Outcome Prediction, and Segmentation: A Systematic Review and Meta-Analysis

Krish M Maniar; Philipp Lassarén; Aakanksha Rana; Yuxin Yao; Ishaan A Tewarie; Jakob V E Gerstl; Camila M Recio Blanco; Liam H Power; Marco Mammi; Heather Mattie; Timothy R Smith; Rania A Mekary

doi:10.1016/j.wneu.2023.08.023

Traditional Machine Learning Methods versus Deep Learning for Meningioma Classification, Grading, Outcome Prediction, and Segmentation: A Systematic Review and Meta-Analysis

World Neurosurg. 2023 Nov:179:e119-e134. doi: 10.1016/j.wneu.2023.08.023. Epub 2023 Aug 12.

Authors

Affiliations

¹ Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States.
² Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States; Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden.
³ Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States; McGovern Institute for Brain Research, Massachusetts Institute of Technology, Boston, Massachusetts, United States.
⁴ Department of Pharmaceutical Business and Administrative Sciences, School of Pharmacy, Massachusetts College of Pharmacy and Health Sciences University, Boston, Massachusetts, United States.
⁵ Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States; Department of Neurosurgery, Haaglanden Medical Center, The Hague, The Netherlands; Faculty of Medicine, Erasmus University Rotterdam/Erasmus Medical Center Rotterdam, Rotterdam, The Netherlands.
⁶ Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States; Northeast National University, Corrientes, Argentina; Prisma Salud, Puerto San Julian, Santa Cruz, Argentina.
⁷ Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States; School of Medicine, Tufts University, Boston, Massachusetts, United States.
⁸ Neurosurgery Unit, S. Croce e Carle Hospital, Cuneo, Italy.
⁹ Department of Biostatistics, Harvard TH Chan School of Public Health, Boston, Massachusetts, United States.
¹⁰ Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States; Department of Neurosurgery, Brigham and Women's Hospital, Harvard University, Boston, Massachusetts, United States.
¹¹ Department of Neurosurgery, Computational Neurosciences Outcomes Center (CNOC), Harvard Medical School, Brigham and Women's Hospital, Boston, Massachusetts, United States; Department of Pharmaceutical Business and Administrative Sciences, School of Pharmacy, Massachusetts College of Pharmacy and Health Sciences University, Boston, Massachusetts, United States. Electronic address: rania.mekary@channing.harvard.edu.

PMID: 37574189
DOI: 10.1016/j.wneu.2023.08.023

Abstract

Background: Meningiomas are common intracranial tumors. Machine learning (ML) algorithms are emerging to improve accuracy in 4 primary domains: classification, grading, outcome prediction, and segmentation. Such algorithms include both traditional approaches that rely on hand-crafted features and deep learning (DL) techniques that utilize automatic feature extraction. The aim of this study was to evaluate the performance of published traditional ML versus DL algorithms in classification, grading, outcome prediction, and segmentation of meningiomas.

Methods: A systematic review and meta-analysis were conducted. Major databases were searched through September 2021 for publications evaluating traditional ML versus DL models on meningioma management. Performance measures including pooled sensitivity, specificity, F1-score, area under the receiver-operating characteristic curve, positive and negative likelihood ratios (LR+, LR-) along with their respective 95% confidence intervals (95% CIs) were derived using random-effects models.

Results: Five hundred thirty-four records were screened, and 43 articles were included, regarding classification (3 articles), grading (29), outcome prediction (7), and segmentation (6) of meningiomas. Of the 29 studies that reported on grading, 10 could be meta-analyzed with 2 DL models (sensitivity 0.89, 95% CI: 0.74-0.96; specificity 0.91, 95% CI: 0.45-0.99; LR+ 10.1, 95% CI: 1.33-137; LR- 0.12, 95% CI: 0.04-0.59) and 8 traditional ML (sensitivity 0.74, 95% CI: 0.62-0.83; specificity 0.93, 95% CI: 0.79-0.98; LR+ 10.5, 95% CI: 2.91-39.5; and LR- 0.28, 95% CI: 0.17-0.49). The insufficient performance metrics reported precluded further statistical analysis of other performance metrics.

Conclusions: ML on meningiomas is mostly carried out with traditional methods. For meningioma grading, traditional ML methods generally had a higher LR+, while DL models a lower LR-.

Keywords: Deep learning; Grading; Machine learning; Meningioma; Segmentation; Traditional statistics.

Publication types

Meta-Analysis
Systematic Review

MeSH terms

Deep Learning*
Humans
Machine Learning
Meningeal Neoplasms* / diagnostic imaging
Meningeal Neoplasms* / pathology
Meningioma* / diagnostic imaging
Meningioma* / pathology
Prognosis