AutoExtend: Combining Word Embeddings with Semantic Resources

Authors

  • Sascha Rothe University of Munich
  • Hinrich Schütze University of Munich

Abstract

We present AutoExtend, a system that combines word embeddings with semantic resources by learning embeddings for non-word objects like synsets and entities and learning improved word embeddings which incorporate the semantic information from the resource. It is flexible in that it can take any word embeddings as input and does not need an additional training corpus. The obtained embeddings live in the same vector space as the input word embeddings. A sparse tensor formalization guarantees efficiency and parallelizability. We use WordNet, GermaNet and Freebase as semantic resources. AutoExtend achieves state-of-the-art performance on Word-in-Context Similarity and Word Sense Disambiguation tasks.

Published

2024-12-05

Issue

Section

Short paper