Hacking Wikipedia for Hyponymy Relation Acquisition

From Wikipedia Quality
Revision as of 09:38, 13 December 2019 by Alyssa (talk | contribs) (Embed for English Wikipedia, HTML)
Jump to: navigation, search


Hacking Wikipedia for Hyponymy Relation Acquisition
Authors
Asuka Sumida
Kentaro Torisawa
Publication date
2008
Links
Original

Hacking Wikipedia for Hyponymy Relation Acquisition - scientific work related to Wikipedia quality published in 2008, written by Asuka Sumida and Kentaro Torisawa.

Overview

This paper describes a method for extracting a large set of hyponymy relations from Wikipedia. The Wikipedia is much more consistently structured than generic HTML documents, and authors can extract a large number of hyponymy relations with simple methods. In this work, authors managed to extract more than 1.4 × 106 hyponymy relations with 75.3% precision from the Japanese version of the Wikipedia. To the best of knowledge, this is the largest machine-readable thesaurus for Japanese. The main contribution of this paper is a method for hyponymy acquisition from hierarchical layouts in Wikipedia. By using a machine learning technique and pattern matching, authors were able to extract more than 6.3 × 105 relations from hierarchical layouts in the Japanese Wikipedia, and their precision was 76.4%. The remaining hyponymy relations were acquired by existing methods for extracting relations from definition sentences and category pages. This means that extraction from the hierarchical layouts almost doubled the number of relations extracted.

Embed

Wikipedia Quality

Sumida, Asuka; Torisawa, Kentaro. (2008). "[[Hacking Wikipedia for Hyponymy Relation Acquisition]]".

English Wikipedia

{{cite journal |last1=Sumida |first1=Asuka |last2=Torisawa |first2=Kentaro |title=Hacking Wikipedia for Hyponymy Relation Acquisition |date=2008 |url=https://wikipediaquality.com/wiki/Hacking_Wikipedia_for_Hyponymy_Relation_Acquisition}}

HTML

Sumida, Asuka; Torisawa, Kentaro. (2008). &quot;<a href="https://wikipediaquality.com/wiki/Hacking_Wikipedia_for_Hyponymy_Relation_Acquisition">Hacking Wikipedia for Hyponymy Relation Acquisition</a>&quot;.