HALD, a human aging and longevity knowledge graph for precision gerontology and geroscience analyses

Sci Data. 2023 Dec 1;10(1):851. doi: 10.1038/s41597-023-02781-0.

Abstract

Human aging is a natural and inevitable biological process that leads to an increased risk of aging-related diseases. Developing anti-aging therapies for aging-related diseases requires a comprehensive understanding of the mechanisms and effects of aging and longevity from a multi-modal and multi-faceted perspective. However, most of the relevant knowledge is scattered in the biomedical literature, the volume of which reached 36 million in PubMed. Here, we presented HALD, a text mining-based human aging and longevity dataset of the biomedical knowledge graph from all published literature related to human aging and longevity in PubMed. HALD integrated multiple state-of-the-art natural language processing (NLP) techniques to improve the accuracy and coverage of the knowledge graph for precision gerontology and geroscience analyses. Up to September 2023, HALD had contained 12,227 entities in 10 types (gene, RNA, protein, carbohydrate, lipid, peptide, pharmaceutical preparations, toxin, mutation, and disease), 115,522 relations, 1,855 aging biomarkers, and 525 longevity biomarkers from 339,918 biomedical articles in PubMed. HALD is available at https://bis.zju.edu.cn/hald .

Publication types

  • Dataset

MeSH terms

  • Aging*
  • Biomarkers
  • Geriatrics*
  • Geroscience
  • Humans
  • Longevity*
  • Pattern Recognition, Automated

Substances

  • Biomarkers