Difference between revisions of "A Study on Linking Wikipedia Categories to Wordnet Synsets Using Text Similarity"

From Wikipedia Quality
Jump to: navigation, search
(A Study on Linking Wikipedia Categories to Wordnet Synsets Using Text Similarity - basic info)
 
(Wikilinks)
Line 1: Line 1:
'''A Study on Linking Wikipedia Categories to Wordnet Synsets Using Text Similarity''' - scientific work related to Wikipedia quality published in 2009, written by Antonio Toral, Óscar Ferrández, Eneko Agirre and Rafael Muñoz.
+
'''A Study on Linking Wikipedia Categories to Wordnet Synsets Using Text Similarity''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Antonio Toral]], [[Óscar Ferrández]], [[Eneko Agirre]] and [[Rafael Muñoz]].
  
 
== Overview ==
 
== Overview ==
This paper studies the application of text similarity methods to disambiguate ambiguous links between WordNet nouns and Wikipedia categories. The methods range from word overlap between glosses, random projections, WordNetbased similarity, and a full-fledged textual entailment system. Both unsupervised and supervised combinations have been tried. The goldstandard with disambiguated links is publicly available. The results range from 64.7% for the first sense heuristic, 68% for an unsupervised combination, and up to 77.74% for a supervised combination.
+
This paper studies the application of text similarity methods to disambiguate ambiguous links between [[WordNet]] nouns and [[Wikipedia categories]]. The methods range from word overlap between glosses, random projections, WordNetbased similarity, and a full-fledged textual entailment system. Both unsupervised and supervised combinations have been tried. The goldstandard with disambiguated links is publicly available. The results range from 64.7% for the first sense heuristic, 68% for an unsupervised combination, and up to 77.74% for a supervised combination.

Revision as of 00:04, 8 August 2019

A Study on Linking Wikipedia Categories to Wordnet Synsets Using Text Similarity - scientific work related to Wikipedia quality published in 2009, written by Antonio Toral, Óscar Ferrández, Eneko Agirre and Rafael Muñoz.

Overview

This paper studies the application of text similarity methods to disambiguate ambiguous links between WordNet nouns and Wikipedia categories. The methods range from word overlap between glosses, random projections, WordNetbased similarity, and a full-fledged textual entailment system. Both unsupervised and supervised combinations have been tried. The goldstandard with disambiguated links is publicly available. The results range from 64.7% for the first sense heuristic, 68% for an unsupervised combination, and up to 77.74% for a supervised combination.