ASAS-NANP symposium: mathematical modeling in animal nutrition-Making sense of big data and machine learning: how open-source code can advance training of animal scientists

J Anim Sci. 2023 Jan 3:101:skad317. doi: 10.1093/jas/skad317.

Abstract

Advancements in precision livestock technology have resulted in an unprecedented amount of data being collected on individual animals. Throughout the data analysis chain, many bottlenecks occur, including processing raw sensor data, integrating multiple streams of information, incorporating data into animal growth and nutrition models, developing decision support tools for producers, and training animal science students as data scientists. To realize the promise of precision livestock management technologies, open-source tools and tutorials must be developed to reduce these bottlenecks, which are a direct result of the tremendous time and effort required to create data pipelines from scratch. Open-source programming languages (e.g., R or Python) can provide users with tools to automate many data processing steps for cleaning, aggregating, and integrating data. However, the steps from data collection to training artificial intelligence models and integrating predictions into mathematical models can be tedious for those new to statistical programming, with few examples pertaining to animal science. To address this issue, we outline how open-source code can help overcome many of the bottlenecks that occur in the era of big data and precision livestock technology, with an emphasis on how routine use and publication of open-source code can help facilitate training the next generation of animal scientists. In addition, two case studies are presented with publicly available data and code to demonstrate how open-source tutorials can be utilized to streamline data processing, train machine learning models, integrate with animal nutrition models, and facilitate learning. The National Animal Nutrition Program focuses on providing research-based data on animal performance and feeding strategies. Open-source data and code repositories with examples specific to animal science can help create a reinforcing mechanism aimed at advancing animal science research.

Keywords: Open-source; big data; livestock management systems.

Plain language summary

Livestock production is undergoing a new revolution of incorporating advanced technology to inform animal management. As more and more technologies come to market, new challenges arise with developing a workforce trained to handle big datasets generated from these technologies and turning datasets into insight for livestock producers. This can be especially challenging as multiple data streams ranging from climate and weather information to real-time metrics on animal performance need to be efficiently processed and incorporated into animal production models. Open-source code is one possible solution to these challenges because it is designed to be made publicly available so any user can view, alter, and improve upon existing code. This paper aims to highlight how open-source code can help address many of the challenges of precision livestock technology, including efficient data processing, data integration, development of decision tools, and training of future animal scientists. In addition, the need for open-source tutorials and datasets specific to animal science are included to help facilitate greater adoption of open science.

MeSH terms

  • Animals
  • Artificial Intelligence*
  • Big Data*
  • Humans
  • Machine Learning
  • Models, Theoretical
  • Software