Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web

From Wikipedia Quality
Revision as of 10:34, 2 August 2019 by Brianna (talk | contribs) (+ wikilinks)
Jump to: navigation, search

Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web - scientific work related to Wikipedia quality published in 2009, written by Yulan Yan, Naoaki Okazaki, Yutaka Matsuo, Zhenglu Yang and Mitsuru Ishizuka.

Overview

This paper presents an unsupervised relation extraction method for discovering and enhancing relations in which a specified concept in Wikipedia participates. Using respective characteristics of Wikipedia articles and Web corpus, authors develop a clustering approach based on combinations of patterns: dependency patterns from dependency analysis of texts in Wikipedia, and surface patterns generated from highly redundant information related to the Web. Evaluations of the proposed approach on two different domains demonstrate the superiority of the pattern combination over existing approaches. Fundamentally, method demonstrates how deep linguistic patterns contribute complementarily with Web surface patterns to the generation of various relations.