On the Derivational Entropy of Left-to-Right Probabilistic Finite-State Automata and Hidden Markov Models

Joan Andreu Sanchez Peiro

Abstract

Probabilistic Finite-State Automata are a formalism that is widely
used in many problems of Automatic Speech Recognition, Natural
Language Processing. Probabilistic Finite-State Automata are closely
related to other finite-state models as Weighted Finite-State
Automata, Word Lattices and Hidden Markov Models. Therefore, they
share many similar properties and problems. Entropy measures of
finite-state models have been researched in the past in order to
study the information capacity of these models. The derivational
entropy quantifies the uncertainty that the model has about the
probability distribution it represents. The derivational entropy in
a finite-state automaton is computed from the probability
accumulated in all of its individual paths. The computation of the
entropy from a weighted finite-state automaton requires a normalizedÂ model. This paper studies an efficient computation of the
derivational entropy of a left-to-right probabilistic finite-state
automata and it introduces an efficient algorithm for normalizing
weighted finite-state automata. The efficient computation of the
derivational entropy is also extended to continuous Hidden Markov
Models.

EMNLP 2025 – CL deadlines for Qualifying Papers

April 3, 2025

To be eligible for presentation (oral or poster, etc.) at EMNLP 2025, CL papers must satisfy both of the following conditions:

* receive an accepted decision by July 16^th

* with the final version submitted (and approved to be sent to MIT Press) by July 30^th

Exclusions:

Your paper has been presented previously at other conferences.
Your submission was an extension of prior work.
Your submission was a survey proposal.

----------------------------------------------------

Authors Registration Fee Details: Author-Registered Papers (for presentation) Industrial/Non-Academic, Academic or Student

At least one author of each accepted paper to an ACL conference (ACL, NAACL, EACL, AACL, or EMNLP) must register their paper to present at the conference. Exceptions to the statement above: Accepted Finding that are not being presented. All findings being presented must register their paper. Workshop shared tasked papers do not need to register their paper to present.

Note all Paper registration fees are based on actual hard cost to the conference - In person registration fees reflects the attendees’ hard costs of food & beverage (breaks, welcome reception and social dinner) along with meeting space, av or poster presentation equipment). Virtual attendees’ registration fees reflect the virtual costs (internet, AV, content management, platforms).

Computational Linguistics - December 2025 51(1) has been published!

April 1, 2025

Celebrating 50 years!

By the end of 2024, the journal Computational Linguistics has reached a significant milestone: It has published exactly 50 volumes over the past half-century. As we launch the first issue of Volume 51, this is an opportune moment to reflect on the journal’s legacy, ongoing evolution, and the exciting changes that lie ahead. Together, we embark on a journey to open a new chapter for this storied publication.

https://direct.mit.edu/coli/issue/51/1

On the Derivational Entropy of Left-to-Right Probabilistic Finite-State Automata and Hidden Markov Models

Authors

Abstract

Author Biography

Published

Issue

Section

Make a Submission

Information

Announcements

EMNLP 2025 – CL deadlines for Qualifying Papers

Computational Linguistics - December 2025 51(1) has been published!