Estimation of long terminal repeat element content in the Helicoverpa zea genome from high-throughput sequencing of bacterial artificial chromosome pools

Genome. 2017 Apr;60(4):310-324. doi: 10.1139/gen-2016-0067. Epub 2016 Nov 15.

Abstract

The lepidopteran pest insect Helicoverpa zea feeds on cultivated corn and cotton across the Americas where control remains challenging owing to the evolution of resistance to chemical and transgenic insecticidal toxins, yet genomic resources remain scarce for this species. A bacterial artificial chromosome (BAC) library having a mean genomic insert size of 145 ± 20 kbp was created from a laboratory strain of H. zea, which provides ∼12.9-fold coverage of a 362.8 ± 8.8 Mbp (0.37 ± 0.09 pg) flow cytometry estimated haploid genome size. Assembly of Illumina HiSeq 2000 reads generated from 14 pools that encompassed all BAC clones resulted in 165 485 genomic contigs (N50 = 3262 bp; 324.6 Mbp total). Long terminal repeat (LTR) protein coding regions annotated from 181 contigs included 30 Ty1/copia, 78 Ty3/gypsy, and 73 BEL/Pao elements, of which 60 (33.1%) encoded all five functional polyprotein (pol) domains. Approximately 14% of LTR elements are distributed non-randomly across pools of BAC clones.

Keywords: Helicoverpa; bacterial artificial chromosome; chromosome bactérien artificiel; evolutionary genetics; génétique évolutive; retrotransposon; rétrotransposon.

MeSH terms

  • Animals
  • Chromosomes, Artificial, Bacterial / genetics*
  • Evolution, Molecular
  • Genome Size
  • Genome, Insect
  • Genomic Library
  • Haploidy
  • High-Throughput Nucleotide Sequencing / methods*
  • Lepidoptera / genetics*
  • Male
  • Molecular Sequence Annotation
  • Phylogeny
  • Sequence Analysis, DNA / methods*
  • Terminal Repeat Sequences*