F*: an interpretable transformation of the F-measure

Mach Learn. 2021;110(3):451-456. doi: 10.1007/s10994-021-05964-1. Epub 2021 Mar 15.

Abstract

The F-measure, also known as the F1-score, is widely used to assess the performance of classification algorithms. However, some researchers find it lacking in intuitive interpretation, questioning the appropriateness of combining two aspects of performance as conceptually distinct as precision and recall, and also questioning whether the harmonic mean is the best way to combine them. To ease this concern, we describe a simple transformation of the F-measure, which we call F (F-star), which has an immediate practical interpretation.

Keywords: Classification; Error rate; F1-score; Interpretability; Performance; Precision; Recall.