Unzipping haplotypes in diploid and polyploid genomes

Comput Struct Biotechnol J. 2019 Dec 9:18:66-72. doi: 10.1016/j.csbj.2019.11.011. eCollection 2020.

Abstract

Diploid genomes consist of two homologous copies of chromosomes with one from each parent while polyploid genomes contain more than two homologous sets of chromosomes. Most of the reference genome assemblies collapsed haplotypes that represent 'mosaic' sequences, ignoring allelic variants that may be involved in important cellular and biological functions. Unzipping haplotypes into distinct sets of sequences has been a growing trend in recent genome studies, as it is an essential tool towards resolving important clinical and biological questions, such as compound heterozygotes, heterosis, and evolution. Herein, we review existing methods for alignment-based and assembly-based haplotype phasing for heterozygous diploid and polyploid genomes, as well as recent advances of experimental approaches for improved genome phasing. We anticipate that full haplotype phasing could become a routine procedure in genome studies in the near future.

Keywords: Genome assembly; Haplotype phasing; Heterozygosity; Ploidy; Reference genome.

Publication types

  • Review