LESSLEX: Linking multilingual Embeddings to SenSe representations of Lexical items

Authors

  • Davide Colla Computer Science Department, University of Turin, ITALY
  • Enrico Mensa Computer Science Department, University of Turin, ITALY
  • Daniele P. Radicioni Computer Science Department, University of Turin, ITALY

Abstract

We present LESSLEX, a novel multilingual lexical resource. Different from the vast majority of existing approaches, we ground our embeddings on a sense inventory made available from the BabelNet semantic network. In this setting, multilingual access is governed by the mapping of terms onto their underlying sense descriptions, such that all vectors co-exist in the same semantic space. For each term we have thus the 'blended' terminological vector along with those describing all senses associated to that term. LESSLEX has been tested on the conceptual similarity task: we experimented over the principal data sets for this task in their multilingual and cross-lingual variants, improving on or closely approaching state-of-the-art results. We conclude by arguing that LESSLEX vectors may be relevant for practical applications and for interdisciplinary research on conceptual and lexical access and competence.

Published

2024-12-05

Issue

Section

Special Issue: Multilingual and Interlingual Semantic Representations for Natural Language Processing