The Methodology Behind Network Thinking: Graphs to Analyze Microbial Complexity and Evolution

Methods Mol Biol. 2019:1910:271-308. doi: 10.1007/978-1-4939-9074-0_9.

Abstract

In the post genomic era, large and complex molecular datasets from genome and metagenome sequencing projects expand the limits of what is possible for bioinformatic analyses. Network-based methods are increasingly used to complement phylogenetic analysis in studies in molecular evolution, including comparative genomics, classification, and ecological studies. Using network methods, the vertical and horizontal relationships between all genes or genomes, whether they are from cellular chromosomes or mobile genetic elements, can be explored in a single expandable graph. In recent years, development of new methods for the construction and analysis of networks has helped to broaden the availability of these approaches from programmers to a diversity of users. This chapter introduces the different kinds of networks based on sequence similarity that are already available to tackle a wide range of biological questions, including sequence similarity networks, gene-sharing networks and bipartite graphs, and a guide for their construction and analyses.

Keywords: Ecology; Evolution; Gene remodeling; Lateral gene transfer (LGT); Metagenomics; Sequence similarity network.

MeSH terms

  • Biodiversity
  • Biological Evolution
  • Computational Biology / methods
  • Ecosystem
  • Evolution, Molecular
  • Gene Ontology
  • Gene Regulatory Networks
  • High-Throughput Nucleotide Sequencing
  • Metagenome*
  • Metagenomics* / methods
  • Microbiota
  • Molecular Sequence Annotation
  • Multigene Family