Morphological and Syntactic Case in Statistical Dependency Parsing

Authors

  • Wolfgang Seeker Institut für Maschinelle Sprachverarbeitung University of Stuttgart
  • Jonas Kuhn Institut für Maschinelle Sprachverarbeitung University of Stuttgart

Abstract

Morphologically rich languages that exhibit free word order frequently use case systems to mark the function of nominal elements, especially for the core argument functions of a verb. The standard pipeline approach in syntactic dependency parsing assumes a complete disambiguation of morphological (case) information prior to automatic syntactic analysis. Parsing experiments on Czech, German, and Hungarian show that this approach is susceptible to propagating morphological annotation errors when parsing languages that show syncretism in their morphological case paradigms. This article demonstrates how by interpreting (underspecified) case as a filtering device that restricts the options for syntactic analysis, carefully designed morphosyntactic constraints can restrict the search space of a statistical dependency parser, and prevent solutions that would violate the syntactic restrictions overtly marked in the morphology of the words in a given sentence.

Published

2024-12-05

Issue

Section

Special Issue on Parsing Morphologically Rich Languages