A Study on Linking Wikipedia Categories to Wordnet Synsets Using Text Similarity

From Wikipedia Quality
Revision as of 00:04, 8 August 2019 by Emilia (talk | contribs) (Wikilinks)
Jump to: navigation, search

A Study on Linking Wikipedia Categories to Wordnet Synsets Using Text Similarity - scientific work related to Wikipedia quality published in 2009, written by Antonio Toral, Óscar Ferrández, Eneko Agirre and Rafael Muñoz.

Overview

This paper studies the application of text similarity methods to disambiguate ambiguous links between WordNet nouns and Wikipedia categories. The methods range from word overlap between glosses, random projections, WordNetbased similarity, and a full-fledged textual entailment system. Both unsupervised and supervised combinations have been tried. The goldstandard with disambiguated links is publicly available. The results range from 64.7% for the first sense heuristic, 68% for an unsupervised combination, and up to 77.74% for a supervised combination.