A comparative study of convicilin storage protein gene sequences in species of the tribe Vicieae

Genome. 2008 Jul;51(7):511-23. doi: 10.1139/G08-036.

Abstract

Convicilins, a set of seed storage proteins, differ from vicilins, a related group of seed storage proteins, mainly because of the presence of the N-terminal extension, an additional sequence of amino acids in the sequence corresponding to the first exon. Convicilins have been described only in species of the legume tribe Vicieae. One or two genes for convicilins have been identified in most species of this tribe. The genus Pisum is the main exception, since two genes have been identified in most of its species. Thirty-four new convicilin gene sequences from 29 different species (Lathyrus, Lens, Pisum, and Vicia spp.) have been analyzed here. Convicilin gene sequences are generally organized in 6 exons, but in some instances one of the internal introns (2nd or 4th) is lost. In these 29 species, the N-terminal extension is formed by a stretch of 99 to 196 amino acids particularly rich in polar and charged amino acids (on average, it contains 29.43% glutamic acid and 15.38% arginine residues). This N-terminal extension has the characteristics of an intrinsically unstructured region (IUR), one of the categories of protein "degenerate sequences". A comparative analysis indicates that the N-terminal extension sequence has evolved faster than the surrounding sequence, which is common to all vicilins, and it evolved mainly through a series of duplications of short internal sequences and triplet expansions, the predominant one being GAA. This agrees with the evolution of IURs, which is faster than the evolution of surrounding sequences and is mainly due to replication slippage and unequal crossover recombination. Alternative maximum-likelihood trees of phylogenetic relationships among the 29 Vicieae species based on the convicilin exon sequences are presented and discussed.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Fabaceae / genetics*
  • Genes, Plant
  • Lathyrus / genetics
  • Lens Plant / genetics
  • Phylogeny
  • Pisum sativum / genetics
  • Plant Proteins / classification
  • Plant Proteins / genetics*
  • Sequence Analysis, DNA
  • Vicia / genetics

Substances

  • Plant Proteins