Efficient genetic value prediction using incomplete omics data

Matthias Westhues; Claas Heuer; Georg Thaller; Rohan Fernando; Albrecht E Melchinger

doi:10.1007/s00122-018-03273-1

Efficient genetic value prediction using incomplete omics data

Theor Appl Genet. 2019 Apr;132(4):1211-1222. doi: 10.1007/s00122-018-03273-1. Epub 2019 Jan 17.

Authors

Matthias Westhues¹, Claas Heuer^{2

3}, Georg Thaller², Rohan Fernando⁴, Albrecht E Melchinger⁵

Affiliations

¹ Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, 70599, Stuttgart, Germany.
² Institute of Animal Breeding and Husbandry, Christian-Albrechts-University Kiel, 24098, Kiel, Germany.
³ Inguran, LLC dba STGenetics, 22575 SH6 South, Navasota, TX, 77868, USA.
⁴ Department of Animal Science, Iowa State University, Ames, IA, 50011, USA.
⁵ Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, 70599, Stuttgart, Germany. melchinger@uni-hohenheim.de.

PMID: 30656353
DOI: 10.1007/s00122-018-03273-1

Abstract

Covering a subset of individuals with a quantitative predictor, while imputing records for all others using pedigree or genomic data, could improve the precision of predictions while controlling for costs. Predicting genetic values with high accuracy is pivotal for effective candidate selection in animal and plant breeding. Novel 'omics'-based predictors have been shown to improve upon established genome-based predictions of important complex traits but require laborious and expensive assays. As a consequence, there are various datasets with full genetic marker coverage of all studied individuals but incomplete coverage with other 'omics' data. In animal breeding, single-step prediction was introduced to efficiently combine pedigree information, collected on a large number of animals, with genomic information, collected on a smaller subset of animals, for breeding value estimation without bias. Using two maize datasets of inbred lines and hybrids, we show that the single-step framework facilitates imputing transcriptomic data, boosting forecasts when their predictive ability exceeds that of pedigree or genomic data. Our results suggest that covering only a subset of inbred lines with 'omics' predictors and imputing all others using pedigree or genomic data could enable breeders to improve trait predictions while keeping costs under control. Employing 'omics' predictors could particularly improve candidate selection in hybrid breeding because the success of forecasts is a strongly convex function of predictive ability.

MeSH terms

Genomics / methods*
Genotype
Hybridization, Genetic
Inbreeding
Quantitative Trait Loci / genetics
Zea mays / genetics*

Abstract

MeSH terms

Grants and funding