Modeling syntactic dependencies in southern Dutch dialects

Publication type
C1
Publication status
Published
Authors
De Langhe, L., Degraeuwe, JRD, Farasyn, M., & Hoste, V.
Editor
Eva Hajičová and Sylvain Kahane
Series
Proceedings of the Eighth International Conference on Dependency Linguistics
Pagination
24-35
Publisher
Association for Computational Linguistics (ACL)
Conference
SyntaxFest 2025 (Ljubljana, Slovenia)
Download
(.pdf)
View in Biblio
(externe link)

Abstract

Dependency parsing of non-normative language varieties remains a challenge for modern NLP. While contemporary parsers excel at standardized languages, dialectal variation – especially in function words, conjunctives, and verb clustering – introduces syntactic ambiguity that disrupts traditional parsing approaches. In this paper, we conduct a quantitative evaluation of syntactic dependencies in Southern Dutch dialects, leveraging a standardized dialect corpus to isolate syntactic effects from lexical variation. Using a neural biaffine dependency parser with various mono- and multilingual transformer-based encoders, we benchmark parsing performance on standard Dutch, dialectal data, and mixed training sets. Our results demonstrate that incorporating dialect-specific data significantly enhances parsing accuracy, yet certain syntactic structures remain difficult to resolve, even with dedicated adaptation. These findings highlight the need for more nuanced parsing strategies and improved syntactic modeling for non-normative language varieties.