Genome-wide discovery and information resource development of DNA polymorphisms in cassava

PLoS One. 2013 Sep 11;8(9):e74056. doi: 10.1371/journal.pone.0074056. eCollection 2013.

Abstract

Cassava (Manihot esculenta Crantz) is an important crop that provides food security and income generation in many tropical countries, and is known for its adaptability to various environmental conditions. Its draft genome sequence and many expressed sequence tags are now publicly available, allowing the development of cassava polymorphism information. Here, we describe the genome-wide discovery of cassava DNA polymorphisms. Using the alignment of predicted transcribed sequences from the cassava draft genome sequence and ESTs from GenBank, we discovered 10,546 single-nucleotide polymorphisms and 647 insertions and deletions. To facilitate molecular marker development for cassava, we designed 9,316 PCR primer pairs to amplify the genomic region around each DNA polymorphism. Of the discovered SNPs, 62.7% occurred in protein-coding regions. Disease-resistance genes were found to have a significantly higher ratio of nonsynonymous-to-synonymous substitutions. We identified 24 read-through (changes of a stop codon to a coding codon) and 38 premature stop (changes of a coding codon to a stop codon) single-nucleotide polymorphisms, and found that the 5 gene ontology terms in biological process were significantly different in genes with read-through single-nucleotide polymorphisms compared with all cassava genes. All data on the discovered DNA polymorphisms were organized into the Cassava Online Archive database, which is available at http://cassava.psc.riken.jp/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Genetic*
  • Genome, Plant
  • Genome-Wide Association Study*
  • INDEL Mutation
  • Manihot / genetics*
  • Manihot / metabolism
  • Molecular Sequence Annotation
  • Online Systems
  • Plant Proteins / genetics
  • Plant Proteins / metabolism
  • Polymorphism, Genetic*
  • Polymorphism, Single Nucleotide
  • Reproducibility of Results

Substances

  • Plant Proteins

Grants and funding

This work was supported by Strategic Funds for the Promotion of Science and Technology of the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan and the Grant-in-Aid for Scientific Research for Young Scientists (B) (21710205) to T.S. from the Japan Society for the Promotion of Science. This work was also partially supported by the Grant-in-Aid for Scientific Research for Scientific Research on Innovative Areas (23119524) to K.M. from the Japan Society for the Promotion of Science. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.