Fish species lifespan prediction from promoter cytosine-phosphate-guanine density

Mol Ecol Resour. 2023 Feb 24. doi: 10.1111/1755-0998.13774. Online ahead of print.

Abstract

Lifespan is a key attribute of a species' life cycle and varies extensively among major lineages of animals. In fish, lifespan varies by several orders of magnitude, with reported values ranging from less than 1 year to approximately 400 years. Lifespan information is particularly useful for species management, as it can be used to estimate invasion potential, extinction risk and sustainable harvest rates. Despite its utility, lifespan is unknown for most fish species. This is due to the difficulties associated with accurately identifying the oldest individual(s) of a given species, and/or deriving lifespan estimates that are representative for an entire species. Recently it has been shown that CpG density in gene promoter regions can be used to predict lifespan in mammals and other vertebrates, with variable accuracy across taxa. To improve accuracy of lifespan prediction in a non-mammalian vertebrate group, here we develop a fish-specific genomic lifespan predictor. Our new model includes more than eight times the number of fish species included in the previous vertebrate model (n = 442) and uses fish-specific gene promoters as reference sequences. The model predicts fish lifespan from genomic CpG density alone (measured as CpG observed/expected ratio), explaining 64% of the variance between known and predicted lifespans. The predictions are highly robust to variation in genome quality and are applicable to all classes of fish; a taxonomically diverse and speciose group. The results demonstrate the value of promoter CpG density as a universal predictor of fish lifespan that can applied where empirical data are unavailable, or impracticable to obtain.

Keywords: t max; CpG; elastic net; fish; longevity; prediction.