Value of handcrafted and deep radiomic features towards training robust machine learning classifiers for prediction of prostate cancer disease aggressiveness

Ana Rodrigues; Nuno Rodrigues; João Santinha; Maria V Lisitskaya; Aycan Uysal; Celso Matos; Inês Domingues; Nickolas Papanikolaou

doi:10.1038/s41598-023-33339-0

Value of handcrafted and deep radiomic features towards training robust machine learning classifiers for prediction of prostate cancer disease aggressiveness

Sci Rep. 2023 Apr 17;13(1):6206. doi: 10.1038/s41598-023-33339-0.

Authors

Ana Rodrigues^{1

2}, Nuno Rodrigues^{3

4}, João Santinha^{3

5}, Maria V Lisitskaya⁶, Aycan Uysal⁷, Celso Matos³, Inês Domingues^{8

9}, Nickolas Papanikolaou³

Affiliations

¹ Champalimaud Research, Champalimaud Foundation, Lisbon, Portugal. anacarolina.rodrigues@research.fchampalimaud.org.
² Faculty of Medicine, University of Porto, Porto, Portugal. anacarolina.rodrigues@research.fchampalimaud.org.
³ Champalimaud Research, Champalimaud Foundation, Lisbon, Portugal.
⁴ LASIGE, Faculty of Sciences, University of Lisbon, Lisbon, Portugal.
⁵ Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal.
⁶ Cand. of Sci. (Med.), Radiologist at Radiology Department with CT and MRI, Medical Research and Educational Center, Lomonosov Moscow State University, Moscow, Russia.
⁷ Gulhane Medical School, University of Health Sciences, Ankara, Turkey.
⁸ Instituto Politécnico de Coimbra, Instituto Superior de Engenharia, Rua Pedro Nunes-Quinta da Nora, 3030-199, Coimbra, Portugal.
⁹ Centro de Investigação do Instituto Português de Oncologia do Porto (CI-IPOP): Grupo de Física Médica, Radiobiologia e Protecção Radiológica, Porto, Portugal.

Abstract

There is a growing piece of evidence that artificial intelligence may be helpful in the entire prostate cancer disease continuum. However, building machine learning algorithms robust to inter- and intra-radiologist segmentation variability is still a challenge. With this goal in mind, several model training approaches were compared: removing unstable features according to the intraclass correlation coefficient (ICC); training independently with features extracted from each radiologist's mask; training with the feature average between both radiologists; extracting radiomic features from the intersection or union of masks; and creating a heterogeneous dataset by randomly selecting one of the radiologists' masks for each patient. The classifier trained with this last resampled dataset presented with the lowest generalization error, suggesting that training with heterogeneous data leads to the development of the most robust classifiers. On the contrary, removing features with low ICC resulted in the highest generalization error. The selected radiomics dataset, with the randomly chosen radiologists, was concatenated with deep features extracted from neural networks trained to segment the whole prostate. This new hybrid dataset was then used to train a classifier. The results revealed that, even though the hybrid classifier was less overfitted than the one trained with deep features, it still was unable to outperform the radiomics model.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Artificial Intelligence*
Humans
Machine Learning
Male
Prostatic Neoplasms* / diagnostic imaging

Supplementary concepts

Prostate Cancer, Hereditary, 7