ALPHLARD-NT: Bayesian Method for Human Leukocyte Antigen Genotyping and Mutation Calling through Simultaneous Analysis of Normal and Tumor Whole-Genome Sequence Data

J Comput Biol. 2019 Sep;26(9):923-937. doi: 10.1089/cmb.2018.0224. Epub 2019 Apr 3.

Abstract

Human leukocyte antigen (HLA) genes provide useful information on the relationship between cancer and the immune system. Despite the ease of obtaining these data through next-generation sequencing methods, interpretation of these relationships remains challenging owing to the complexity of HLA genes. To resolve this issue, we developed a Bayesian method, ALPHLARD-NT, to identify HLA germline and somatic mutations as well as HLA genotypes from whole-exome sequencing (WES) and whole-genome sequencing (WGS) data. ALPHLARD-NT showed 99.2% accuracy for WGS-based HLA genotyping and detected five HLA somatic mutations in 25 colon cancer cases. In addition, ALPHLARD-NT identified 88 HLA somatic mutations, including recurrent mutations and a novel HLA-B type, from WES data of 343 colon adenocarcinoma cases. These results demonstrate the potential of ALPHLARD-NT for conducting an accurate analysis of HLA genes even from low-coverage data sets. This method can become an essential tool for comprehensive analyses of HLA genes from WES and WGS data, helping to advance understanding of immune regulation in cancer as well as providing guidance for novel immunotherapy strategies.

Keywords: Bayesian model; HLA genotyping; HLA mutation calling; whole-exome sequencing; whole-genome sequencing.

MeSH terms

  • Bayes Theorem
  • Computational Biology / methods*
  • Genotyping Techniques / methods*
  • HLA Antigens / genetics*
  • Humans
  • Mutation Rate
  • Neoplasms / genetics*
  • Software*
  • Whole Genome Sequencing / methods*

Substances

  • HLA Antigens