Semantic and Syntactic features for Anaphora Resolution for Dutch

Publication type: P1
Publication status: Published
Authors: Hendrickx, I., Hoste, V., & Daelemans, W.
Editor: A. Gelbukh
Journal: Lecture Notes in Computer Science
Series: Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics
Volume: 4919
Pagination: 351-361
Publisher: Springer - Verlag (Haifa, Israel)
Download

Abstract

Weinvestigatetheeffectofencodingadditionalsemanticand syntactic information sources in a classification-based machine learning approach to the task of coreference resolution for Dutch. We experiment both with a memory-based learning approach and a maximum entropy modeling method. As an alternative to using external lexical resources, such as the low-coverage Dutch EuroWordNet, we evaluate the effect of automatically generated semantic clusters as information source. We compare these clusters, which group together semantically similar nouns, to two semantic features based on EuroWordNet encoding synonym and hypernym relations between nouns. The syntactic function of the anaphor and antecedent in the sentence can be an important clue for resolving coreferential relations. As baseline approach, we encode syntactic information as predicted by a memory- based shallow parser in a set of features. We contrast these shallow parse based features with features encoding richer syntactic information from a dependency parser. We show that using both the additional semantic information and syntactic information lead to small but significant performance improvement of our coreference resolution approach.

April 8, 2024	Vacancy post-doctoral assistant at LT3
March 27, 2024	LT3 members involved in the organization of various shared tasks and workshops
Jan. 20, 2024	Veronique appointed as Francqui chair 2023-2024 at ULB
Nov. 7, 2023	Gilles-Maurice shows how ChatGPT can compile excellent dictionaries (for English)
Oct. 25, 2023	Meet the expert: Prof. Lynne Bowker