Decoding Wikipedia Categories for Knowledge Acquisition

From Wikipedia Quality
Revision as of 08:31, 15 May 2020 by Aria (talk | contribs) (+ wikilinks)
Jump to: navigation, search

Decoding Wikipedia Categories for Knowledge Acquisition - scientific work related to Wikipedia quality published in 2008, written by Vivi Nastase and Michael Strube.

Overview

This paper presents an approach to acquire knowledge from Wikipedia categories and the category network. Many Wikipedia categories have complex names which reflect human classification and organizing instances, and thus encode knowledge about class attributes, taxonomic and other semantic relations. Authors decode the names and refer back to the network to induce relations between concepts in Wikipedia represented through pages or categories. The category structure allows us to propagate a relation detected between constituents of a category name to numerous concept links. The results of the process are evaluated against ResearchCyc and a subset also by human judges. The results support the idea that Wikipedia category names are a rich source of useful and accurate knowledge.