The million mutation project: a new approach to genetics in Caenorhabditis elegans

Genome Res. 2013 Oct;23(10):1749-62. doi: 10.1101/gr.157651.113. Epub 2013 Jun 25.

Abstract

We have created a library of 2007 mutagenized Caenorhabditis elegans strains, each sequenced to a target depth of 15-fold coverage, to provide the research community with mutant alleles for each of the worm's more than 20,000 genes. The library contains over 800,000 unique single nucleotide variants (SNVs) with an average of eight nonsynonymous changes per gene and more than 16,000 insertion/deletion (indel) and copy number changes, providing an unprecedented genetic resource for this multicellular organism. To supplement this collection, we also sequenced 40 wild isolates, identifying more than 630,000 unique SNVs and 220,000 indels. Comparison of the two sets demonstrates that the mutant collection has a much richer array of both nonsense and missense mutations than the wild isolate set. We also find a wide range of rDNA and telomere repeat copy number in both sets. Scanning the mutant collection for molecular phenotypes reveals a nonsense suppressor as well as strains with higher levels of indels that harbor mutations in DNA repair genes and strains with abundant males associated with him mutations. All the strains are available through the Caenorhabditis Genetics Center and all the sequence changes have been deposited in WormBase and are available through an interactive website.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Animals
  • Caenorhabditis elegans / classification
  • Caenorhabditis elegans / genetics*
  • Codon, Nonsense
  • DNA Copy Number Variations
  • DNA, Ribosomal
  • Databases, Nucleic Acid
  • Genes, Essential
  • Genes, Helminth*
  • Genes, Suppressor
  • Genetic Variation
  • Genome, Helminth
  • Genome, Mitochondrial
  • Heterozygote
  • INDEL Mutation
  • Male
  • Mutation*
  • Mutation, Missense
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Tandem Repeat Sequences

Substances

  • Codon, Nonsense
  • DNA, Ribosomal

Associated data

  • SRA/SRPO18046