Position Information in Transformers: An Overview

Philipp Dufter; Martin Schmitt; Hinrich SchÃ¼tze

Authors

Philipp Dufter Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen
Martin Schmitt Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen
Hinrich SchÃ¼tze Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen

Abstract

Transformers are arguably the main workhorse in recent Natural Language Processing research. By definition a Transformer is invariant with respect to reordering of the input. However, language is inherently sequential and word order is essential to the semantics and syntax of an utterance. In this article, we provide an overview and theoretical comparison of existing methods to incorporate position information into Transformer models. The objectives of this survey are to (1) showcase that position information in Transformer is a vibrant and extensive research area; (2) enable the reader to compare existing methods by providing a unified notation and systematization of different approaches along important model dimensions; (3) indicate what characteristics of an application should be taken into account when selecting a position encoding; (4) provide stimuli for future research.

Author Biographies

Philipp Dufter, Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen

PostDoc at Center for Information- and Language Processing atÂ Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen
Martin Schmitt, Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen

PhD Student atÂ Center for Information- and Language Processing atÂ Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen
Hinrich SchÃ¼tze, Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen

Professor and Director atÂ Center for Information- and Language Processing atÂ Ludwig-Maximilians-UniversitÃ¤t MÃ¼nchen

Position Information in Transformers: An Overview

Authors

Abstract

Author Biographies

Downloads

Published

Issue

Section

Make a Submission

Information

Announcements

EACL 2026 – CL deadlines for Qualifying Papers

2026 *ACL Conference Dates