A high-resolution HLA imputation system for the Taiwanese population: a study of the Taiwan Biobank

Pharmacogenomics J. 2020 Oct;20(5):695-704. doi: 10.1038/s41397-020-0156-3. Epub 2020 Feb 11.

Abstract

An imputation algorithm for human leukocyte antigen (HLA) is helpful for exploring novel disease associations. However, population-specific HLA imputation references are essential for achieving high imputation accuracy. In this study, a subset of 1012 individuals from the Taiwan Biobank (TWB) who underwent both whole-genome SNP array and NGS-based HLA typing were used to establish Taiwanese HLA imputation references. The HIBAG package was used to generate the imputation references for eight HLA loci at a two- and three-field resolution. Internal validation was carried out to evaluate the call threshold and accuracy for each HLA gene. HLA class II genes found to be associated with rheumatoid arthritis (RA) were validated in this study by the imputed HLA alleles. Our Taiwanese population-specific references achieved average HLA imputation accuracies of 98.11% for two-field and 98.08% for three-field resolution. The frequency distribution of imputed HLA alleles among 23,972 TWB subjects were comparable with PCR-based HLA alleles in general Taiwanese reported in the allele frequency net database. We replicated four common HLA alleles (HLA-DRB1*03:01, DRB1*04:05, DQA1*03:03, and DQB1*04:01) significantly associated with RA. The population-specific references provide an informative tool to investigate the associations of HLA variants and human diseases in large-scale population-based studies.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Arthritis, Rheumatoid / diagnosis
  • Arthritis, Rheumatoid / genetics*
  • Arthritis, Rheumatoid / immunology
  • Databases, Genetic
  • Genetics, Population*
  • Genotype
  • HLA Antigens / genetics*
  • HLA-DQ alpha-Chains / genetics
  • HLA-DQ beta-Chains / genetics
  • HLA-DRB1 Chains / genetics
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide*
  • Reproducibility of Results
  • Taiwan
  • Whole Genome Sequencing

Substances

  • HLA Antigens
  • HLA-DQ alpha-Chains
  • HLA-DQ beta-Chains
  • HLA-DQA1 antigen
  • HLA-DQB1 antigen
  • HLA-DRB1 Chains
  • HLA-DRB1*03:01 antigen
  • HLA-DRB1*04:05 antigen