Automatising the Learning of Lexical Patterns: an Application to the Enrichment of Wordnet by Extracting Semantic Relationships from Wikipedia

From Wikipedia Quality
Revision as of 22:50, 2 June 2019 by Sylwia (talk | contribs) (Int.links)
Jump to: navigation, search

Automatising the Learning of Lexical Patterns: an Application to the Enrichment of Wordnet by Extracting Semantic Relationships from Wikipedia - scientific work related to Wikipedia quality published in 2007, written by Maria Ruiz-Casado, Enrique Alfonseca and Pablo Castells.

Overview

This paper describes an automatic approach to identify lexical patterns that represent semantic relationships between concepts in an on-line encyclopedia. Next, these patterns can be applied to extend existing ontologies or semantic networks with new relations. The experiments have been performed with the Simple English Wikipedia and WordNet 1.7. A new algorithm has been devised for automatically generalising the lexical patterns found in the encyclopedia entries. Authors have found general patterns for the hyperonymy, hyponymy, holonymy and meronymy relations and, using them, authors have extracted more than 2600 new relationships that did not appear in WordNet originally. The precision of these relationships depends on the degree of generality chosen for the patterns and the type of relation, being around 60-70% for the best combinations proposed.