Extracting Semantic Concept Relations from Wikipedia

From Wikipedia Quality
Revision as of 22:14, 5 October 2019 by Audrey (talk | contribs) (Adding wikilinks)
Jump to: navigation, search

Extracting Semantic Concept Relations from Wikipedia - scientific work related to Wikipedia quality published in 2014, written by Patrick Arnold and Erhard Rahm.

Overview

Background knowledge as provided by repositories such as WordNet is of critical importance for linking or mapping ontologies and related tasks. Since current repositories are quite limited in their scope and currentness, authors investigate how to automatically build up improved repositories by extracting semantic relations (e.g., is-a and part-of relations) from Wikipedia articles. Authors approach uses a comprehensive set of semantic patterns, finite state machines and NLP-techniques to process Wikipedia definitions and to identify semantic relations between concepts. Authors approach is able to extract multiple relations from a single Wikipedia article. An evaluation for different domains shows the high quality and effectiveness of the proposed approach.