Increasing the number of discrete character states for continuous characters generates well-resolved trees that do not reflect phylogeny

Integr Zool. 2014 Aug;9(4):531-41. doi: 10.1111/1749-4877.12076.

Abstract

Since the introduction of the cladistic method in systematics, continuous characters have been integrated into analyses but no methods for their treatment have received unanimous support. Some methods require a large number of character states to discretise continuous characters in order to keep the maximum level of information about taxa differences within the coding scheme. Our objective was to assess the impact of increasing the character state number on the outcomes of phylogenetic analyses. Analysis of a variety of simulated datasets shows that these methods for coding continuous characters can lead to the generation of well-resolved trees that do not reflect a phylogenetic signal. We call this phenomenon the flattening of the tree-length distribution; it is influenced by both the relative quantity of continuous characters in relation to discrete characters, and the number of characters in relation to the number of taxa. Bootstrap tests provide a method to avoid this potential bias.

Keywords: cladistics; continuous character; gap weighting; phylogeny.

MeSH terms

  • Animals
  • Classification / methods*
  • Computational Biology
  • Models, Biological
  • Phylogeny*