Semantic Processing of Database Textual Attributes Using Wikipedia

From Wikipedia Quality
Revision as of 10:29, 8 September 2019 by Caroline (talk | contribs) (Adding new article - Semantic Processing of Database Textual Attributes Using Wikipedia)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Semantic Processing of Database Textual Attributes Using Wikipedia - scientific work related to Wikipedia quality published in 2011, written by Jesús R. Campaña, Juan Miguel Medina and M. Amparo Vila.

Overview

Text attributes in databases contain rich semantic information that is seldom processed or used. This paper proposes a method to extract and semantically represent concepts from texts stored in databases. This process relies on tools such as WordNet and Wikipedia to identify concepts extracted from texts and represent them as a basic ontology whose concepts are annotated with search terms. This ontology can play diverse roles. It can be seen as a conceptual summary of the content of an attribute, which can be used as a means to navigate through the textual content of an attribute. It can also be used as a profile for text search using the terms associated to the ontology concepts. The ontology is built as a subset of Wikipedia category graph, selected using diverse metrics. Category selection using these metrics is discussed and an example application is presented and evaluated.