Chromosome Gene Orientation Inversion Networks (GOINs) of Plasmodium Proteome

J Proteome Res. 2018 Mar 2;17(3):1258-1268. doi: 10.1021/acs.jproteome.7b00861. Epub 2018 Feb 5.

Abstract

The spatial distribution of genes in chromosomes seems not to be random. For instance, only 10% of genes are transcribed from bidirectional promoters in humans, and many more are organized into larger clusters. This raises intriguing questions previously asked by different authors. We would like to add a few more questions in this context, related to gene orientation inversions. Does gene orientation (inversion) follow a random pattern? Is it relevant to biological activity somehow? We define a new kind of network coined as the gene orientation inversion network (GOIN). GOIN's complex network encodes short- and long-range patterns of inversion of the orientation of pairs of gene in the chromosome. We selected Plasmodium falciparum as a case of study due to the high relevance of this parasite to public health (causal agent of malaria). We constructed here for the first time all of the GOINs for the genome of this parasite. These networks have an average of 383 nodes (genes in one chromosome) and 1314 links (pairs of gene with inverse orientation). We calculated node centralities and other parameters of these networks. These numerical parameters were used to study different properties of gene inversion patterns, for example, distribution, local communities, similarity to Erdös-Rényi random networks, randomness, and so on. We find clues that seem to indicate that gene orientation inversion does not follow a random pattern. We noted that some gene communities in the GOINs tend to group genes encoding for RIFIN-related proteins in the proteome of the parasite. RIFIN-like proteins are a second family of clonally variant proteins expressed on the surface of red cells infected with Plasmodium falciparum. Consequently, we used these centralities as input of machine learning (ML) models to predict the RIFIN-like activity of 5365 proteins in the proteome of Plasmodium sp. The best linear ML model found discriminates RIFIN-like from other proteins with sensitivity and specificity 70-80% in training and external validation series. All of these results may point to a possible biological relevance of gene orientation inversion not directly dependent on genetic sequence information. This work opens the gate to the use of GOINs as a tool for the study of the structure of chromosomes and the study of protein function in proteome research.

Keywords: Plasmodium sp. proteome; chromosome microstructure; complex networks; gene orientation; machine learning; malaria.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosomes / chemistry*
  • Erythrocytes / parasitology
  • Gene Expression Regulation
  • Gene Regulatory Networks*
  • Genes, Protozoan*
  • Humans
  • Machine Learning
  • Membrane Proteins / genetics*
  • Membrane Proteins / metabolism
  • Multigene Family
  • Plasmodium falciparum / genetics*
  • Plasmodium falciparum / metabolism
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • Proteome / genetics*
  • Proteome / metabolism
  • Protozoan Proteins / genetics*
  • Protozoan Proteins / metabolism
  • Sequence Inversion*
  • Software

Substances

  • Membrane Proteins
  • Protein Isoforms
  • Proteome
  • Protozoan Proteins
  • RIFIN protein, Plasmodium falciparum