Accurate estimation of biological age and its application in disease prediction using a multimodal image Transformer system

Proc Natl Acad Sci U S A. 2024 Jan 16;121(3):e2308812120. doi: 10.1073/pnas.2308812120. Epub 2024 Jan 8.

Abstract

Aging in an individual refers to the temporal change, mostly decline, in the body's ability to meet physiological demands. Biological age (BA) is a biomarker of chronological aging and can be used to stratify populations to predict certain age-related chronic diseases. BA can be predicted from biomedical features such as brain MRI, retinal, or facial images, but the inherent heterogeneity in the aging process limits the usefulness of BA predicted from individual body systems. In this paper, we developed a multimodal Transformer-based architecture with cross-attention which was able to combine facial, tongue, and retinal images to estimate BA. We trained our model using facial, tongue, and retinal images from 11,223 healthy subjects and demonstrated that using a fusion of the three image modalities achieved the most accurate BA predictions. We validated our approach on a test population of 2,840 individuals with six chronic diseases and obtained significant difference between chronological age and BA (AgeDiff) than that of healthy subjects. We showed that AgeDiff has the potential to be utilized as a standalone biomarker or conjunctively alongside other known factors for risk stratification and progression prediction of chronic diseases. Our results therefore highlight the feasibility of using multimodal images to estimate and interrogate the aging process.

Keywords: biological age prediction; biomarker discovery; chronic disease diagnosis and prognosis; multimodal fusion; transformer with cross-attention.

MeSH terms

  • Aging*
  • Biomarkers
  • Chronic Disease
  • Electric Power Supplies*
  • Face
  • Humans

Substances

  • Biomarkers