Comparison of genomic selection models for exploring predictive ability of complex traits in breeding programs

Plant Genome. 2021 Nov;14(3):e20158. doi: 10.1002/tpg2.20158. Epub 2021 Nov 1.

Abstract

Traits with a complex unknown genetic architecture are common in breeding programs. However, they pose a challenge for selection due to a combination of complex environmental and pleiotropic effects that impede the ability to create mapping populations to characterize the trait's genetic basis. One such trait, seedling emergence of wheat (Triticum aestivum L.) from deep planting, presents a unique opportunity to explore the best method to use and implement genetic selection (GS) models to predict a complex trait. Seventeen GS models were compared using two training populations, consisting of 473 genotypes from a diverse association mapping panel phenotyped from 2015 to 2019 and the other training population consisting of 643 breeding lines phenotyped in 2015 and 2020 in Lind, WA, with 40,368 markers. There were only a few significant differences between GS models, with support vector machines reaching the highest accuracy of 0.56 in a single breeding line trial using cross-validations. However, the consistent moderate accuracy of the parametric models indicates little advantage of using nonparametric models within individual years, but the nonparametric models show a slight increase in accuracy when combing years for complex traits. There was an increase in accuracy using cross-validations from 0.40 to 0.41 using diversity panels lines to breeding lines. Overall, our study showed that breeders can accurately predict and implement GS for a complex trait by using nonparametric machine learning models within their own breeding programs with increased accuracy as they combine training populations over the years.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Genomics
  • Models, Genetic
  • Multifactorial Inheritance*
  • Plant Breeding*
  • Triticum / genetics