Untangling the transcriptome from fungus-infected plant tissues

Gene. 2013 May 1;519(2):238-44. doi: 10.1016/j.gene.2013.02.023. Epub 2013 Mar 1.

Abstract

The development of sequencing technology allows low-cost generation of sequence data. The huge amount of raw sequence data now available has introduced many challenges associated with analysis of these large-scale data banks. For example, it is very important to distinguish materials of plant and fungal origin in fungus-infected plant tissue. The origin of transcripts that were sequenced from Library 895-M6 (poplar tissue infected by Marssonina brunnea) on Illumina/Solexa GA IIx was determined by combining three methods: (1) based on the taxonomic information of homologous sequences; (2) based on the reference genome sequence; (3) based on the transcriptome sequence of the host and its pathogen obtained from Library 895 (poplar) and Library M6 (M. brunnea) as well as Library 895-M6 (mixture of poplar and M. brunnea). We idenified accurately the origin of 80,978 (99.5%) contigs in the mixed poplar and M. brunnea sample (Library 895-M6) by integrating the results from the three methods. The results of this study demonstrate that a combination of these three approaches described here is an effective strategy for determining the origin of sequences in a mixed pool, and provides a basis for further transcriptome analysis of the mixed sample.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Ascomycota / genetics
  • Ascomycota / pathogenicity*
  • Expressed Sequence Tags
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Plant*
  • Gene Library
  • Host-Pathogen Interactions*
  • Plant Diseases / microbiology*
  • Plant Leaves / genetics*
  • Plant Leaves / microbiology
  • Plant Proteins / genetics
  • Plant Proteins / metabolism
  • Sequence Analysis, RNA

Substances

  • Plant Proteins