Mining Semantic Relationships Between Concepts Across Documents Incorporating Wikipedia Knowledge

From Wikipedia Quality
Revision as of 21:02, 29 November 2019 by Ariana (talk | contribs) (+ Infobox work)
Jump to: navigation, search


Mining Semantic Relationships Between Concepts Across Documents Incorporating Wikipedia Knowledge
Authors
Peng Yan
Wei Jin
Publication date
2013
DOI
10.1007/978-3-642-39736-3_6
Links
Original

Mining Semantic Relationships Between Concepts Across Documents Incorporating Wikipedia Knowledge - scientific work related to Wikipedia quality published in 2013, written by Peng Yan and Wei Jin.

Overview

The ongoing astounding growth of text data has created an enormous need for fast and efficient text mining algorithms. Traditional approaches for document representation are mostly based on the Bag of Words (BOW) model which takes a document as an unordered collection of words. However, when applied in fine-grained information discovery tasks, such as mining semantic relationships between concepts, sorely relying on the BOW representation may not be sufficient to identify all potential relationships since the resulting associations based on the BOW approach are limited to the concepts that appear in the document collection literally. In this paper, authors attempt to complement existing information in the corpus by proposing a new hybrid approach, which mines semantic associations between concepts across multiple text units through incorporating extensive knowledge from Wikipedia. The experimental evaluation demonstrates that search performance has been significantly enhanced in terms of accuracy and coverage compared with a purely BOW-based approach and alternative solutions where only the article contents of Wikipedia or category information are considered.