Characterization and analysis of full-length transcriptomes from two grasshoppers, Gomphocerus licenti and Mongolotettix japonicus

Sci Rep. 2020 Aug 26;10(1):14228. doi: 10.1038/s41598-020-71178-5.

Abstract

Acrididae are diverse in size, body shape, behavior, ecology and life history; widely distributed; easy to collect; and important to agriculture. They represent promising model candidates for functional genomics, but their extremely large genomes have hindered this research; establishing a reference transcriptome for a species is the primary means of obtaining genetic information. Here, two Acrididae species, Gomphocerus licenti and Mongolotettix japonicus, were selected for full-length (FL) PacBio transcriptome sequencing. For G. licenti and M. japonicus, respectively, 590,112 and 566,165 circular consensus sequences (CCS) were generated, which identified 458,131 and 428,979 full-length nonchimeric (FLNC) reads. After isoform-level clustering, next-generation sequencing (NGS) short sequences were used for error correction, and remove redundant sequences with CD-HIT, 17,970 and 16,766 unigenes were generated for G. licenti and M. japonicus. In addition, we obtained 17,495 and 16,373 coding sequences, 1,082 and 813 transcription factors, 11,840 and 10,814 simple sequence repeats, and 905 and 706 long noncoding RNAs by analyzing the transcriptomes of G. licenti and M. japonicus, respectively, and 15,803 and 14,846 unigenes were annotated in eight functional databases. This is the first study to sequence FL transcriptomes of G. licenti and M. japonicus, providing valuable genetic resources for further functional genomics research.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Animals
  • Female
  • Gene Expression Profiling / methods
  • Grasshoppers / genetics
  • Grasshoppers / metabolism*
  • Male
  • Microsatellite Repeats
  • RNA, Long Noncoding
  • Transcriptome*

Substances

  • RNA, Long Noncoding