Duplication models for biological networks

Fan Chung; Linyuan Lu; T Gregory Dewey; David J Galas

doi:10.1089/106652703322539024

Duplication models for biological networks

J Comput Biol. 2003;10(5):677-87. doi: 10.1089/106652703322539024.

Authors

Fan Chung¹, Linyuan Lu, T Gregory Dewey, David J Galas

Affiliation

¹ Department of Mathematics, University of California at San Diego, La Jolla, CA 92093, USA.

PMID: 14633392
DOI: 10.1089/106652703322539024

Abstract

Are biological networks different from other large complex networks? Both large biological and nonbiological networks exhibit power-law graphs (number of nodes with degree k, N(k) approximately k(-beta)), yet the exponents, beta, fall into different ranges. This may be because duplication of the information in the genome is a dominant evolutionary force in shaping biological networks (like gene regulatory networks and protein-protein interaction networks) and is fundamentally different from the mechanisms thought to dominate the growth of most nonbiological networks (such as the Internet). The preferential choice models used for nonbiological networks like web graphs can only produce power-law graphs with exponents greater than 2. We use combinatorial probabilistic methods to examine the evolution of graphs by node duplication processes and derive exact analytical relationships between the exponent of the power law and the parameters of the model. Both full duplication of nodes (with all their connections) as well as partial duplication (with only some connections) are analyzed. We demonstrate that partial duplication can produce power-law graphs with exponents less than 2, consistent with current data on biological networks. The power-law exponent for large graphs depends only on the growth process, not on the starting graph.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Internet
Models, Biological*
Neural Networks, Computer*
Probability
Proteins / chemistry
Reproducibility of Results

Substances

Proteins