Acquiring a Taxonomy from the German Wikipedia

From Wikipedia Quality
Revision as of 10:58, 30 November 2019 by Hanna (talk | contribs) (+ wikilinks)
Jump to: navigation, search

Acquiring a Taxonomy from the German Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Laura Kassner, Vivi Nastase and Michael Strube.


This paper presents the process of acquiring a large, domain independent, taxonomy from the German Wikipedia. Authors build upon a previously implemented platform that extracts a semantic network and taxonomy from the English version of the Wikipedia. Authors describe two accomplishments of work: the semantic network for the German language in which isa links are identified and annotated, and an expansion of the platform for easy adaptation for a new language. Authors identify the platform’s strengths and shortcomings, which stem from the scarcity of free processing resources for languages other than English. Authors show that the taxonomy induction process is highly reliable – evaluated against the German version of WordNet, GermaNet, the resource obtained shows an accuracy of 83.34%.