Populating Conceptnet Knowledge Base with Information Acquired from Japanese Wikipedia

From Wikipedia Quality
Revision as of 22:09, 15 June 2019 by Everly (talk | contribs) (Overview - Populating Conceptnet Knowledge Base with Information Acquired from Japanese Wikipedia)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Populating Conceptnet Knowledge Base with Information Acquired from Japanese Wikipedia - scientific work related to Wikipedia quality published in 2015, written by Marek Krawczyk, Rafal Rzepka and Kenji Araki.

Overview

This paper presents a method of acquiring IsA assertions (hyponymy relations), AtLocation assertions (informing of location of objects) and Located Near assertions (informing of neigh boring locations) automatically from Japanese Wikipedia XML dump files. To extract IsA assertions, authors use the Hyponymy extraction tool v1.0, which analyses definition, category and hierarchy structures of Wikipedia articles. The tool also produces information-rich taxonomy from which, using original method, authors can extract additional information, in this case AtLocation and Located Near type of assertions. Experiments showed that both methods produce positive results: authors were able to acquire 5,866,680 IsA assertions with 99.0% reliability, 131,760 AtLocation assertion pairs with 93.0% reliability and 6,217 Located Near assertion pairs with 99.0% reliability. Authors method exceeded the baseline system considering both precision and the number of acquired assertions.