Rethink reporting of evaluation results in AI

Science. 2023 Apr 14;380(6641):136-138. doi: 10.1126/science.adf6369. Epub 2023 Apr 13.

Abstract

Aggregate metrics and lack of access to results limit understanding.