Applications and Trends of Machine Learning in Genomics and Phenomics for Next-Generation Breeding

Plants (Basel). 2019 Dec 25;9(1):34. doi: 10.3390/plants9010034.

Abstract

Crops are the major source of food supply and raw materials for the processing industry. A balance between crop production and food consumption is continually threatened by plant diseases and adverse environmental conditions. This leads to serious losses every year and results in food shortages, particularly in developing countries. Presently, cutting-edge technologies for genome sequencing and phenotyping of crops combined with progress in computational sciences are leading a revolution in plant breeding, boosting the identification of the genetic basis of traits at a precision never reached before. In this frame, machine learning (ML) plays a pivotal role in data-mining and analysis, providing relevant information for decision-making towards achieving breeding targets. To this end, we summarize the recent progress in next-generation sequencing and the role of phenotyping technologies in genomics-assisted breeding toward the exploitation of the natural variation and the identification of target genes. We also explore the application of ML in managing big data and predictive models, reporting a case study using microRNAs (miRNAs) to identify genes related to stress conditions.

Keywords: PacBio; QTLs dissection; genome-wide association studies; genomics; genotyping by sequencing; machine learning; microRNA; nanopore; phenomics.

Publication types

  • Review