Biotechnology Data Analysis Training with Jupyter Notebooks

J Microbiol Biol Educ. 2023 Jan 16;24(1):e00113-22. doi: 10.1128/jmbe.00113-22. eCollection 2023 Apr.

Abstract

Biotechnology has experienced innovations in analytics and data processing. As the volume of data and its complexity grow, new computational procedures for extracting information are being developed. However, the rate of change outpaces the adaptation of biotechnology curricula, necessitating new teaching methodologies to equip biotechnologists with data analysis abilities. To simulate experimental data, we created a virtual organism simulator (silvio) by combining diverse cellular and subcellular microbial models. With the silvio Python package, we constructed a computer-based instructional workflow to teach growth curve data analysis, promoter sequence design, and expression rate measurement. The instructional workflow is a Jupyter Notebook with background explanations and Python-based experiment simulations combined. The data analysis is conducted either within the Notebook in Python or externally with Excel. This instructional workflow was separately implemented in two distance courses for Master's students in biology and biotechnology with assessment of the pedagogic efficiency. The concept of using virtual organism simulations that generate coherent results across different experiments can be used to construct consistent and motivating case studies for biotechnological data literacy.

Keywords: Jupyter Notebooks; Python; biotechnology; cloning; data analysis; growth modeling; mathematical model; recombinant expression; recombinant protein production; systems biology.