Categorization of birth weight phenotypes for inclusion in genetic evaluations using a deep neural network

J Anim Sci. 2021 Mar 1;99(3):skab053. doi: 10.1093/jas/skab053.

Abstract

Birth weight (BW) serves as a valuable indicator of the economically relevant trait of calving ease (CE), and erroneous data collection for BW could impact genetic evaluations for CE. The objective of the current study was to evaluate the use of deep neural networks (DNNs) for classifying contemporary groups (CGs) based on the method used to generate BW phenotypes. CGs (n = 120,000,000) ranging between 10 and 250 animals were simulated assuming 12 data collection and CG formation scenarios that could impact CG phenotypic variance, including weights recorded with a digital scale (REAL), hoof tape (TAPE), erroneous data collection (DIRTY), and those that were fabricated (FAB). The performance of eight activation functions (AFs; ReLu, Sigmoid, Exponential, ReLu6, Softmax, Softplus, Leaky ReLu, and Tanh) was evaluated. Four hidden layers were used with seven different scenarios relative to the number of neurons. Simulations were replicated 10 times. In general, accuracy (proportion of correct predictions) across AF and numbers of neurons were similar, with mean correlations ranging between 0.91 and 0.99. The AF ReLu, Sigmoid, Exponential, and ReLu6 had the greatest consistency (mean pair-wise correlation among replicates) with an average correlation of greater than 0.85. Independent of the number of neurons used, the sigmoid function produced the highest accuracy (0.99) and consistency (0.93). The model with the greatest accuracy and consistency was then applied to real BW data supplied by the American Hereford Association. In the real data, the lowest phenotypic variance was for FAB CG (2.65 kg2), REAL CG had the largest (15.84 kg2), and TAPE CG was intermediate (6.84 kg2). To investigate the potential impact of FAB data on routine genetic evaluations, CGs classified as FAB in 90% or more of the replicates were removed from the evaluation for CE, and the rank of resulting genetic predictions were compared with the case where records were not removed. The removal of FAB CG had a moderate impact on the prediction of CE expected progeny differences, primarily for animals with intermediate to high accuracy. The results suggest that a well-trained DNN can be effectively used to classify data based on quality metrics prior to the inclusion in routine genetic evaluation.

Keywords: beef cattle; birth weight; deep neural network; genetic prediction.

MeSH terms

  • Animals
  • Birth Weight
  • Data Collection
  • Goals*
  • Models, Genetic
  • Neural Networks, Computer*
  • Phenotype