MV-CVIB: a microbiome-based multi-view convolutional variational information bottleneck for predicting metastatic colorectal cancer

Front Microbiol. 2023 Aug 22:14:1238199. doi: 10.3389/fmicb.2023.1238199. eCollection 2023.

Abstract

Introduction: Imbalances in gut microbes have been implied in many human diseases, including colorectal cancer (CRC), inflammatory bowel disease, type 2 diabetes, obesity, autism, and Alzheimer's disease. Compared with other human diseases, CRC is a gastrointestinal malignancy with high mortality and a high probability of metastasis. However, current studies mainly focus on the prediction of colorectal cancer while neglecting the more serious malignancy of metastatic colorectal cancer (mCRC). In addition, high dimensionality and small samples lead to the complexity of gut microbial data, which increases the difficulty of traditional machine learning models.

Methods: To address these challenges, we collected and processed 16S rRNA data and calculated abundance data from patients with non-metastatic colorectal cancer (non-mCRC) and mCRC. Different from the traditional health-disease classification strategy, we adopted a novel disease-disease classification strategy and proposed a microbiome-based multi-view convolutional variational information bottleneck (MV-CVIB).

Results: The experimental results show that MV-CVIB can effectively predict mCRC. This model can achieve AUC values above 0.9 compared to other state-of-the-art models. Not only that, MV-CVIB also achieved satisfactory predictive performance on multiple published CRC gut microbiome datasets.

Discussion: Finally, multiple gut microbiota analyses were used to elucidate communities and differences between mCRC and non-mCRC, and the metastatic properties of CRC were assessed by patient age and microbiota expression.

Keywords: information bottleneck; metastatic colorectal cancer; microbiome; multi-view; risk assessment.

Grants and funding

This study was supported by the grant of the National Key R&D Program of China (Nos. 2018YFA0902600 and 2018AAA0100100) and partly supported by the National Natural Science Foundation of China (Grant Nos. 62002266, 61932008, and 62073231), respectively. This study was also supported by the Key Project of Science and Technology of Guangxi (Grant No. 2021AB20147), the Guangxi Natural Science Foundation (Grant Nos. 2021JJA170204 and 2021JJA170199), and the Guangxi Science and Technology Base and Talents Special Project (Grant No. 2021AC19354).