Artificial intelligence-based models enabling accurate diagnosis of ovarian cancer using laboratory tests in China: a multicentre, retrospective cohort study

Lancet Digit Health. 2024 Mar;6(3):e176-e186. doi: 10.1016/S2589-7500(23)00245-5. Epub 2024 Jan 11.

Abstract

Background: Ovarian cancer is the most lethal gynecological malignancy. Timely diagnosis of ovarian cancer is difficult due to the lack of effective biomarkers. Laboratory tests are widely applied in clinical practice, and some have shown diagnostic and prognostic relevance to ovarian cancer. We aimed to systematically evaluate the value of routine laboratory tests on the prediction of ovarian cancer, and develop a robust and generalisable ensemble artificial intelligence (AI) model to assist in identifying patients with ovarian cancer.

Methods: In this multicentre, retrospective cohort study, we collected 98 laboratory tests and clinical features of women with or without ovarian cancer admitted to three hospitals in China during Jan 1, 2012 and April 4, 2021. A multi-criteria decision making-based classification fusion (MCF) risk prediction framework was used to make a model that combined estimations from 20 AI classification models to reach an integrated prediction tool developed for ovarian cancer diagnosis. It was evaluated on an internal validation set (3007 individuals) and two external validation sets (5641 and 2344 individuals). The primary outcome was the prediction accuracy of the model in identifying ovarian cancer.

Findings: Based on 52 features (51 laboratory tests and age), the MCF achieved an area under the receiver-operating characteristic curve (AUC) of 0·949 (95% CI 0·948-0·950) in the internal validation set, and AUCs of 0·882 (0·880-0·885) and 0·884 (0·882-0·887) in the two external validation sets. The model showed higher AUC and sensitivity compared with CA125 and HE4 in identifying ovarian cancer, especially in patients with early-stage ovarian cancer. The MCF also yielded acceptable prediction accuracy with the exclusion of highly ranked laboratory tests that indicate ovarian cancer, such as CA125 and other tumour markers, and outperformed state-of-the-art models in ovarian cancer prediction. The MCF was wrapped as an ovarian cancer prediction tool, and made publicly available to provide estimated probability of ovarian cancer with input laboratory test values.

Interpretation: The MCF model consistently achieved satisfactory performance in ovarian cancer prediction when using laboratory tests from the three validation sets. This model offers a low-cost, easily accessible, and accurate diagnostic tool for ovarian cancer. The included laboratory tests, not only CA125 which was the highest ranked laboratory test in importance of diagnostic assistance, contributed to the characterisation of patients with ovarian cancer.

Funding: Ministry of Science and Technology of China; National Natural Science Foundation of China; Natural Science Foundation of Guangdong Province, China; and Science and Technology Project of Guangzhou, China.

Translation: For the Chinese translation of the abstract see Supplementary Materials section.

Publication types

  • Multicenter Study

MeSH terms

  • Artificial Intelligence*
  • Female
  • Humans
  • Ovarian Neoplasms* / diagnosis
  • Prognosis
  • ROC Curve
  • Retrospective Studies