Difference between revisions of "Conceptual Feature Generation for Textual Information Using a Conceptual Network Constructed from Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Basic information on Conceptual Feature Generation for Textual Information Using a Conceptual Network Constructed from Wikipedia)
 
(+ wikilinks)
Line 1: Line 1:
'''Conceptual Feature Generation for Textual Information Using a Conceptual Network Constructed from Wikipedia''' - scientific work related to Wikipedia quality published in 2016, written by Amir Hossein Jadidinejad, Fariborz Mahmoudi and Mohammad Reza Meybodi.
+
'''Conceptual Feature Generation for Textual Information Using a Conceptual Network Constructed from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2016, written by [[Amir Hossein Jadidinejad]], [[Fariborz Mahmoudi]] and [[Mohammad Reza Meybodi]].
  
 
== Overview ==
 
== Overview ==
A proper semantic representation of textual information underlies many natural language processing tasks. In this paper, a novel semantic annotator is presented to generate conceptual features for text documents. A comprehensive conceptual network is automatically constructed with the aid of Wikipedia that has been represented as a Markov chain. Furthermore, semantic annotator gets a fragment of natural language text and initiates a random walk to generate conceptual features that represent topical semantic of the input text. The generated conceptual features are applicable to many natural language processing tasks where the input is textual information and the output is a decision based on its context. Consequently, the effectiveness of the generated features is evaluated in the task of document clustering and classification. Empirical results demonstrate that representing text using conceptual features and considering the relations between concepts can significantly improve not only the bag of words representation but also other state-of-the-art approaches.
+
A proper semantic representation of textual information underlies many [[natural language processing]] tasks. In this paper, a novel semantic annotator is presented to generate conceptual [[features]] for text documents. A comprehensive conceptual network is automatically constructed with the aid of [[Wikipedia]] that has been represented as a Markov chain. Furthermore, semantic annotator gets a fragment of natural language text and initiates a random walk to generate conceptual features that represent topical semantic of the input text. The generated conceptual features are applicable to many natural language processing tasks where the input is textual information and the output is a decision based on its context. Consequently, the effectiveness of the generated features is evaluated in the task of document clustering and classification. Empirical results demonstrate that representing text using conceptual features and considering the relations between concepts can significantly improve not only the bag of words representation but also other state-of-the-art approaches.

Revision as of 10:11, 21 November 2019

Conceptual Feature Generation for Textual Information Using a Conceptual Network Constructed from Wikipedia - scientific work related to Wikipedia quality published in 2016, written by Amir Hossein Jadidinejad, Fariborz Mahmoudi and Mohammad Reza Meybodi.

Overview

A proper semantic representation of textual information underlies many natural language processing tasks. In this paper, a novel semantic annotator is presented to generate conceptual features for text documents. A comprehensive conceptual network is automatically constructed with the aid of Wikipedia that has been represented as a Markov chain. Furthermore, semantic annotator gets a fragment of natural language text and initiates a random walk to generate conceptual features that represent topical semantic of the input text. The generated conceptual features are applicable to many natural language processing tasks where the input is textual information and the output is a decision based on its context. Consequently, the effectiveness of the generated features is evaluated in the task of document clustering and classification. Empirical results demonstrate that representing text using conceptual features and considering the relations between concepts can significantly improve not only the bag of words representation but also other state-of-the-art approaches.