Genome-Enabled Prediction Methods Based on Machine Learning

Methods Mol Biol. 2022:2467:189-218. doi: 10.1007/978-1-0716-2205-6_7.

Abstract

Growth of artificial intelligence and machine learning (ML) methodology has been explosive in recent years. In this class of procedures, computers get knowledge from sets of experiences and provide forecasts or classification. In genome-wide based prediction (GWP), many ML studies have been carried out. This chapter provides a description of main semiparametric and nonparametric algorithms used in GWP in animals and plants. Thirty-four ML comparative studies conducted in the last decade were used to develop a meta-analysis through a Thurstonian model, to evaluate algorithms with the best predictive qualities. It was found that some kernel, Bayesian, and ensemble methods displayed greater robustness and predictive ability. However, the type of study and data distribution must be considered in order to choose the most appropriate model for a given problem.

Keywords: Bayesian methods; Complex traits; Ensemble methods; GWP; Kernel methods; Machine learning; Meta-analysis; Neural networks.

Publication types

  • Meta-Analysis

MeSH terms

  • Algorithms
  • Animals
  • Artificial Intelligence*
  • Bayes Theorem
  • Genome
  • Machine Learning*