Identification of probable genomic packaging signal sequence from SARS-CoV genome by bioinformatics analysis

Acta Pharmacol Sin. 2003 Jun;24(6):489-96.

Abstract

Aim: To predict the probable genomic packaging signal of SARS-CoV by bioinformatics analysis. The derived packaging signal may be used to design antisense RNA and RNA interfere (RNAi) drugs treating SARS.

Methods: Based on the studies about the genomic packaging signals of MHV and BCoV, especially the information about primary and secondary structures, the putative genomic packaging signal of SARS-CoV were analyzed by using bioinformatic tools. Multi-alignment for the genomic sequences was performed among SARS-CoV, MHV, BCoV, PEDV and HCoV 229E. Secondary structures of RNA sequences were also predicted for the identification of the possible genomic packaging signals. Meanwhile, the N and M proteins of all five viruses were analyzed to study the evolutionary relationship with genomic packaging signals.

Results: The putative genomic packaging signal of SARS-CoV locates at the 3' end of ORF1b near that of MHV and BCoV, where is the most variable region of this gene. The RNA secondary structure of SARS-CoV genomic packaging signal is very similar to that of MHV and BCoV. The same result was also obtained in studying the genomic packaging signals of PEDV and HCoV 229E. Further more, the genomic sequence multi-alignment indicated that the locations of packaging signals of SARS-CoV, PEDV, and HCoV overlaped each other. It seems that the mutation rate of packaging signal sequences is much higher than the N protein, while only subtle variations for the M protein.

Conclusions: The probable genomic packaging signal of SARS-CoV is analogous to that of MHV and BCoV, with the corresponding secondary RNA structure locating at the similar region of ORF1b. The positions where genomic packaging signals exist have suffered rounds of mutations, which may influence the primary structures of the N and M proteins consequently.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Computational Biology
  • Coronavirus 229E, Human / genetics
  • Coronavirus, Bovine / genetics
  • Genome, Viral
  • Humans
  • Molecular Sequence Data
  • Murine hepatitis virus / genetics
  • Nucleocapsid Proteins / genetics*
  • Protein Sorting Signals / genetics*
  • Protein Structure, Secondary
  • RNA Interference
  • RNA, Antisense / genetics
  • RNA, Viral / genetics
  • Sequence Alignment
  • Sequence Homology, Amino Acid
  • Severe Acute Respiratory Syndrome / virology*
  • Severe acute respiratory syndrome-related coronavirus / genetics*
  • Severe acute respiratory syndrome-related coronavirus / isolation & purification
  • Viral Matrix Proteins / genetics*

Substances

  • Nucleocapsid Proteins
  • Protein Sorting Signals
  • RNA, Antisense
  • RNA, Viral
  • Viral Matrix Proteins