Evaluating Hybrid versus Data-Driven Coreference Resolution

Publication type
P1
Publication status
Published
Authors
Hendrickx, I., Hoste, V., & Daelemans, W.
Journal
Lecture Notes in Artificial Intelligence
Series
Anaphora: Analysis, Algorithms and Applications
Volume
4410
Pagination
137-150
Publisher
Springer - Verlag (Lagos, Portugal)
Download
(.pdf)

Abstract

In this paper, we present a systematic evaluation of a hybrid approach of combined rule-based filtering and machine learning to Dutch coreference resolution. Through the application of a selection of linguistically-motivated negative and positive filters, which we apply in isolation and combined, we study the effect of these filters on precision and recall using two different learning techniques: memory-based learning and maximum entropy modeling. Our results show that by using the hybrid approach, we can reduce up to 92 % of the training material without performance loss. We also show that the filters improve the overall precision of the classifiers leading to higher F-scores on the test set.