Domain-Specific Knowledge Base Enrichment Using Wikipedia Tables

From Wikipedia Quality
Revision as of 06:37, 30 July 2019 by Aria (talk | contribs) (Adding new article - Domain-Specific Knowledge Base Enrichment Using Wikipedia Tables)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Domain-Specific Knowledge Base Enrichment Using Wikipedia Tables - scientific work related to Wikipedia quality published in 2015, written by Chenwei Ran, Wei Shen, Jianyong Wang and Xuan Zhu.

Overview

The knowledge base is a machine-readable set of knowledge. More and more multi-domain and large-scale knowledge bases have emerged in recent years, and they play an essential role in many information systems and semantic annotation tasks. However authors do not have a perfect knowledge base yet and maybe authors will never have a perfect one, because all the knowledge bases have limited coverage while new knowledge continues to emerge. Therefore populating and enriching the existing knowledge base become important tasks. Traditional knowledge base population task usually leverages the information embedded in the unstructured free text. Recently researchers found that massive structured tables on the Web are high-quality relational data and easier to be utilized than the unstructured text. Authors goal of this paper is to enrich the knowledge base using Wikipedia tables. Here, knowledge means binary relations between entities and authors focus on the relations in some specific domains. There are two basic types of information can be used in this task: the existing relation instances and the connection between types and relations. Authors firstly propose two basic probabilistic models based on two types of information respectively. Then authors propose a light-weight aggregated model to combine the advantages of basic models. The experimental results show that method is an effective approach to enriching the knowledge base with both high precision and recall.