A Two-Stage Framework for Computing Entity Relatedness in Wikipedia

From Wikipedia Quality
Revision as of 19:47, 25 May 2019 by Olivia (talk | contribs) (Creating a new page - A Two-Stage Framework for Computing Entity Relatedness in Wikipedia)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

A Two-Stage Framework for Computing Entity Relatedness in Wikipedia - scientific work related to Wikipedia quality published in 2017, written by Marco Ponza, Paolo Ferragina and Soumen Chakrabarti.

Overview

Introducing a new dataset with human judgments of entity relatedness, authors present a thorough study of all entity relatedness measures in recent literature based on Wikipedia as the knowledge graph. No clear dominance is seen between measures based on textual similarity and graph proximity. Some of the better measures involve expensive global graph computations. Authors then propose a new, space-efficient, computationally lightweight, two-stage framework for relatedness computation. In the first stage, a small weighted subgraph is dynamically grown around the two query entities; in the second stage, relatedness is derived based on computations on this subgraph. Authors system shows better agreement with human judgment than existing proposals both on the new dataset and on an established one. Authors also plug relatedness algorithm into a state-of-the-art entity linker and observe an increase in its accuracy and robustness.