The P5 multicopy gene family in the MHC is related in sequence to human endogenous retroviruses HERV-L and HERV-16

Immunogenetics. 1999 May;49(5):404-12. doi: 10.1007/s002510050513.

Abstract

P5 is believed to be a multicopy gene family with at least eight members restricted to the major histocompatibility complex (MHC). Although the function of P5 genes is not known, one of the family members, P5-1, was found previously to be specifically transcribed in lymphoid cells and tissue. In this study, we used computer programs Censor and RepeatMasker, and dot plot analysis to show that the major P5 family members are related in sequence to human endogenous retroviruses, HERV-L and HERV-16. The P5-HERV sequences have at least 60% sequence identity with HERV-L within the pol region but differ significantly within the gag and LTR regions. The LTRs flanking the P5-HERV sequences share about 70% identity with the repeat element LTR16B. Structural analysis of open reading frames (ORFs) confirmed that the P5-1 cDNA is characterized by many stop codons and short putative coding regions resembling the patterns found in the HERV-L nucleotide sequence rather than those found in an mRNA sequence such as expressed by HLA class I genes. A 159 base pair (bp) ORF at the 5' end of the 2535 bp P5-1 mRNA may code for a peptide of 52 amino acids with a domain identical in sequence to the signal peptide of HLA molecules. Furthermore, the P5-1 mRNA is complementary in sequence to retroviral pol mRNA. Therefore, the P5-1 genomic sequence appears to be an example of an HERV within the MHC that expresses an antisense transcript with a possible role in immunity to retrovirus infection.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Codon, Terminator
  • Endogenous Retroviruses / genetics*
  • Evolution, Molecular
  • Humans
  • Major Histocompatibility Complex*
  • Molecular Sequence Data
  • Multigene Family*
  • Open Reading Frames
  • Promoter Regions, Genetic
  • Protein Sorting Signals / genetics
  • Proteins / genetics*
  • RNA, Long Noncoding
  • RNA, Messenger / genetics
  • RNA, Untranslated
  • Recombinant Fusion Proteins / genetics
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Software

Substances

  • Codon, Terminator
  • HCP5 long noncoding RNA, human
  • Protein Sorting Signals
  • Proteins
  • RNA, Long Noncoding
  • RNA, Messenger
  • RNA, Untranslated
  • Recombinant Fusion Proteins