A Unifying Theory of Transition-based and Sequence Labeling Parsing

Carlos Gómez-Rodríguez, Michalina Strzyz, David Vilares


Abstract
We define a mapping from transition-based parsing algorithms that read sentences from left to right to sequence labeling encodings of syntactic trees. This not only establishes a theoretical relation between transition-based parsing and sequence-labeling parsing, but also provides a method to obtain new encodings for fast and simple sequence labeling parsing from the many existing transition-based parsers for different formalisms. Applying it to dependency parsing, we implement sequence labeling versions of four algorithms, showing that they are learnable and obtain comparable performance to existing encodings.
Anthology ID:
2020.coling-main.336
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Donia Scott, Nuria Bel, Chengqing Zong
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
3776–3793
Language:
URL:
https://aclanthology.org/2020.coling-main.336
DOI:
10.18653/v1/2020.coling-main.336
Bibkey:
Cite (ACL):
Carlos Gómez-Rodríguez, Michalina Strzyz, and David Vilares. 2020. A Unifying Theory of Transition-based and Sequence Labeling Parsing. In Proceedings of the 28th International Conference on Computational Linguistics, pages 3776–3793, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
A Unifying Theory of Transition-based and Sequence Labeling Parsing (Gómez-Rodríguez et al., COLING 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.coling-main.336.pdf
Code
 mstrise/dep2label