Joint global and local interpretation method for CIN status classification in breast cancer

Heliyon. 2024 Feb 28;10(5):e27054. doi: 10.1016/j.heliyon.2024.e27054. eCollection 2024 Mar 15.

Abstract

Breast cancer is among the cancer types with the highest numbers of new cases. The study of this disease from a microscopic perspective has been a prominent research topic. Previous studies have shown that microRNAs (miRNAs) are closely linked to chromosomal instability (CIN). Correctly predicting CIN status from miRNAs can help to improve the survival of breast cancer patients. In this study, a joint global and local interpretation method called GL_XGBoost is proposed for predicting CIN status in breast cancer. GL_XGBoost integrates the eXtreme Gradient Boosting (XGBoost) and SHapley Additive exPlanation (SHAP) methods. XGBoost is used to predict CIN status from miRNA data, whereas SHAP is used to select miRNA features that have strong relationships with CIN. Furthermore, SHAP's rich visualization strategies enhance the interpretability of the entire model at the global and local levels. The performance of GL_XGBoost is validated on the TCGA-BRCA dataset, and it is shown to have an accuracy of 78.57% and an area under the curve value of 0.87. Rich visual analysis is used to explain the relationships between miRNAs and CIN status from different perspectives. Our study demonstrates an intuitive way of exploring the relationship between CIN and cancer from a microscopic perspective.

Keywords: Breast cancer; Chromosomal instability; SHAP; XGBoost; miRNAs.