Analysis of Variability of Functionals of Recombinant Protein Production Trajectories Based on Limited Data

Int J Mol Sci. 2022 Jul 10;23(14):7628. doi: 10.3390/ijms23147628.

Abstract

Making statistical inference on quantities defining various characteristics of a temporally measured biochemical process and analyzing its variability across different experimental conditions is a core challenge in various branches of science. This problem is particularly difficult when the amount of data that can be collected is limited in terms of both the number of replicates and the number of time points per process trajectory. We propose a method for analyzing the variability of smooth functionals of the growth or production trajectories associated with such processes across different experimental conditions. Our modeling approach is based on a spline representation of the mean trajectories. We also develop a bootstrap-based inference procedure for the parameters while accounting for possible multiple comparisons. This methodology is applied to study two types of quantities-the "time to harvest" and "maximal productivity"-in the context of an experiment on the production of recombinant proteins. We complement the findings with extensive numerical experiments comparing the effectiveness of different types of bootstrap procedures for various tests of hypotheses. These numerical experiments convincingly demonstrate that the proposed method yields reliable inference on complex characteristics of the processes even in a data-limited environment where more traditional methods for statistical inference are typically not reliable.

Keywords: ANOVA; limited data; linear constraints; production trajectories; resampling techniques; simultaneous hypothesis tests.

MeSH terms

  • Recombinant Proteins / genetics
  • Research Design*

Substances

  • Recombinant Proteins