A Coreference Corpus and Resolution System for Dutch

Publication type: C1
Publication status: Published
Authors: Hendrickx, I., Bouma, G., Coppens, F., Daelemans, W., Hoste, V., Kloosterman, G., Mineur, A., Van Der Vloet, J., & Verschelde, J.
Editor: N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odjik, S. Piperidis, and D. Tapias
Series: Proceedings of the Sixth Conference on International Language Resources and Evaluation (LREC'08)
Pagination: 144-149
Publisher: European Language Resources Association (Marrakech, Morocco)
Download

Abstract

We present the main outcomes of the COREA project: a corpus annotated with coreferential relations and a coreference resolution system for Dutch. We discuss the annotation of the corpus: the type of annotated relations, the guidelines, the annotation tool and inter-annotator agreement. We also show a visualization of the annotated relations. The standard approach to evaluate a coreference resolution system is to compare the predictions of the system to a hand-annotated gold standard test set (cross-validation). A more practically oriented evaluation is to test the usefulness of coreference relation information in an NLP application. We present results of both types of evalutation. We run experiments with an Information Extraction module for the medical domain, and measure the performance of this module with and without coreference relation information. In a separate experiment we also evaluate the effect of coreference information produced by a simple rule-based coreference module in a Question Answering application

June 8, 2026	20 years of LT3
May 31, 2026	PhD Defense Quanqi Du
May 20, 2026	📢 PhD Position
Dec. 17, 2025	On how GPT-4o, Gemini-2.5 and DeepSeek-R1 have been used in lexicography
Oct. 31, 2025	PhD Defense Sofie