We Should Evaluate Real-World Impact

Authors

  • Ehud Reiter Department of Computing Science, University of Aberdeen

Keywords:

evaluation, real-world impact, ACL Anthology, structured survey, A/B test, clinical trial, before-and-after study

Abstract

The ACL community has very little interest in evaluating the real-world impact of NLP systems.  A structured survey of the ACL Anthology shows that perhaps 0.1% of its papers contain such evaluations; furthermore most papers which include impact evaluations present them very sketchily and instead focus on metric evaluations.  NLP technology would be more useful and more quickly adopted if we seriously tried to understand and evaluate its real-world impact.

Published

2026-01-22