Coreference in Wikipedia: Main Concept Resolution

From Wikipedia Quality
Revision as of 23:21, 27 September 2019 by Sadie (talk | contribs) (+ links)
Jump to: navigation, search

Coreference in Wikipedia: Main Concept Resolution - scientific work related to Wikipedia quality published in 2016, written by Abbas Ghaddar and Phillippe Langlais.

Overview

Wikipedia is a resource of choice exploited in many NLP applications, yet authors are not aware of recent attempts to adapt coreference resolution to this resource. In this work, authors revisit a seldom studied task which consists in identifying in a Wikipedia article all the mentions of the main concept being described. Authors show that by exploiting the Wikipedia markup of a document, as well as links to external knowledge bases such as Freebase, authors can acquire useful information on entities that helps to classify mentions as coreferent or not. Authors designed a classifier which drastically outperforms fair baselines built on top of state-of-the-art coreference resolution systems. Authors also measure the benefits of this classifier in a full coreference resolution pipeline applied to Wikipedia texts.