HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

Ivan VuliÄ‡; Daniela Gerz; Douwe Kiela; Felix Hill; Anna Korhonen

Authors

Ivan VuliÄ‡ University of Cambridge
Daniela Gerz University of Cambridge
Douwe Kiela Facebook AI Research
Felix Hill Google DeepMind
Anna Korhonen University of Cambridge

Abstract

We introduce HyperLex - a dataset and evaluation resource that quantifies the extent of of the semantic category membership, that is, type-of relation also known as hyponymy-hypernymy or lexical entailment (LE) relation between 2,616 concept pairs. Cognitive psychology research has established that typicality and category/class membership are computed in human semantic memory as a gradual rather than binary relation. Nevertheless, most NLP research, and existing large-scale invetories of concept category membership (WordNet, DBPedia, etc.) treat category membership and LE as binary. To address this, we asked hundreds of native English speakers to indicate typicality and strength of category membership between a diverse range of concept pairs on a crowdsourcing platform. Our results confirm that category membership and LE are indeed more gradual than binary. We then compare these human judgements with the predictions of automatic systems, which reveals a huge gap between human performance and state-of-the-art LE, distributional and representation learning models, and substantial differences Â between the models themselves. We discuss a pathway for improving semantic models to overcome this discrepancy, and indicate future application areas for improved graded LE systems.

Author Biographies

Ivan VuliÄ‡, University of Cambridge

Language Technology Lab, Department of Theoretical and Applied Linguistics
Postdoctoral Research Associate
Daniela Gerz, University of Cambridge

Language Technology Lab, Department of Theoretical and Applied Linguistics
PhD Student
Douwe Kiela, Facebook AI Research

Facebook AI Research
Postdoctoral Researcher
Felix Hill, Google DeepMind

Language Technology Lab, Department of Theoretical and Applied Linguistics
PhD Student
Anna Korhonen, University of Cambridge

Reader in Computational Linguistics, Department of Theoretical and Applied Linguistics
Co-director of Language Technology Lab
Â

HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment

Authors

Abstract

Author Biographies

Published

Issue

Section

Make a Submission

Information

Announcements

EACL 2027 - CL deadlines for Qualifying Papers

Special Issue on the Ethics of NLP and CL in Computational Linguistics