Specified Certainty Classification, with Application to Read Classification for Reference-Guided Metagenomic Assembly

ArXiv [Preprint]. 2021 Sep 13:arXiv:2109.06677v2.

Abstract

Specified Certainty Classification (SCC) classifiers whose outputs carry uncertainties, typically in the form of Bayesian posterior probabilities. By allowing the classifier output to be less precise than one of a set of atomic decisions, SCC allows all decisions to achieve a specified level of certainty, as well as provides insights into classifier behavior by examining all decisions that are possible. Our primary illustration is read classification for reference-guided genome assembly, but we demonstrate the breadth of SCC by also analyzing COVID-19 vaccination data.

Keywords: Bayesian analysis; classifier; metagenomics; posterior probabilities; read classification; uncertainty quantification.

Publication types

  • Preprint