Are batch effects still relevant in the age of big data?

Trends Biotechnol. 2022 Sep;40(9):1029-1040. doi: 10.1016/j.tibtech.2022.02.005. Epub 2022 Mar 10.

Abstract

Batch effects (BEs) are technical biases that may confound analysis of high-throughput biotechnological data. BEs are complex and effective mitigation is highly context-dependent. In particular, the advent of high-resolution technologies such as single-cell RNA sequencing presents new challenges. We first cover how BE modeling differs between traditional datasets and the new data landscape. We also discuss new approaches for measuring and mitigating BEs, including whether a BE is significant enough to warrant correction. Even with the advent of machine learning and artificial intelligence, the increased complexity of next-generation biotechnological data means increased complexities in BE management. We forecast that BEs will not only remain relevant in the age of big data but will become even more important.

Keywords: RNA sequencing; artificial intelligence; batch effect; machine learning; single cell.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence*
  • Big Data*
  • Machine Learning