Pattern recognition of hematological profiles of tumors of the digestive tract: an exploratory study

Front Med (Lausanne). 2023 Aug 16:10:1208022. doi: 10.3389/fmed.2023.1208022. eCollection 2023.

Abstract

Aims: In this study, we aimed to apply laboratory blood analysis to identify the hematological (based on hemoglobin concentration, erythrocytes, hematocrit, and RDW count) profiles associated with the most prevalent forms of digestive tract malignancies. Furthermore, we aimed to evaluate how these profiles contributed to distinguishing these tumors at diagnosis.

Methods: We collected data from the date of ICD-10 diagnostic coding for C15 esophagus, C16 stomach, C18 colon, and C19 rectum tumors of 184 individuals. The statistical analysis and data visualization approaches, notably the heat map and principal component analysis (PCA), allowed for creating a summary hematological profile and identifying the most associated parameters for each pathologic state. Univariate and multivariate data modeling and ROC analysis were performed in both SPSS and Python.

Results: Our data reveal unique patterns based on tumor development anatomical location, clustering the C18 colon and C19 rectum from the C15 esophagus and C16 stomach. We found a significant difference between C16 stomach carcinoma and the other tumors, which substantially correlated with raised RDW in conjunction with low hemoglobin concentration, erythrocytes, and hematocrit counts. In contrast, C18 colon carcinoma had the higher red blood cell count, allowing for the best classification metrics in the test set of the binary logistic regression (LR) model, accounting for an AUC of 0.77 with 94% sensitivity and 52% specificity.

Conclusion: This study emphasizes the significance of adding hematological patterns in diagnosing these malignancies, which could path further investigations regarding profiling and monitoring at the point of care.

Keywords: anatomical location; cancer; digestive tract tumor; hematological profile; pattern recognition.

Grants and funding

This study has been funded by National Funds, through the Foundation for Science and Technology (FCT)—Projects UIDB/50026/2020 and UIDP/50026/2020.