Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web

From Wikipedia Quality
Revision as of 21:44, 28 July 2019 by Sadie (talk | contribs) (Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web - creating a new article)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web - scientific work related to Wikipedia quality published in 2009, written by Yulan Yan, Naoaki Okazaki, Yutaka Matsuo, Zhenglu Yang and Mitsuru Ishizuka.

Overview

This paper presents an unsupervised relation extraction method for discovering and enhancing relations in which a specified concept in Wikipedia participates. Using respective characteristics of Wikipedia articles and Web corpus, authors develop a clustering approach based on combinations of patterns: dependency patterns from dependency analysis of texts in Wikipedia, and surface patterns generated from highly redundant information related to the Web. Evaluations of the proposed approach on two different domains demonstrate the superiority of the pattern combination over existing approaches. Fundamentally, method demonstrates how deep linguistic patterns contribute complementarily with Web surface patterns to the generation of various relations.