We Should Evaluate Real-World Impact
Keywords:
evaluation, real-world impact, ACL Anthology, structured survey, A/B test, clinical trial, before-and-after studyAbstract
The ACL community has very little interest in evaluating the real-world impact of NLP systems. A structured survey of the ACL Anthology shows that perhaps 0.1% of its papers contain such evaluations; furthermore most papers which include impact evaluations present them very sketchily and instead focus on metric evaluations. NLP technology would be more useful and more quickly adopted if we seriously tried to understand and evaluate its real-world impact.
Downloads
Published
2026-01-22
Issue
Section
Last Words