Survey of Low-Resource Machine Translation

Authors

  • Barry Haddow University of Edinburgh
  • Rachel Bawden Inria
  • Antonio Valerio Miceli Barone University of Edinburgh
  • Jindrich Helcl University of Edinburgh
  • Alexandra Birch University of Edinburgh

Abstract

We present a a survey covering the state of the art in low-resource machine translation. There are currently around 7000 languages spoken in the world and almost all language pairs lack significant resources for training machine translation models. There has been increasing interest in research addressing the challenge of producing useful translation models when very little translated training data is available. We present a high level summary of this topical field and provide an overview of best practices.

Author Biographies

  • Barry Haddow, University of Edinburgh
    School of Informatics, Senior Research Fellow
  • Alexandra Birch, University of Edinburgh
    School of Informatics, Reader in Natural Language Processing

Published

2024-11-21