Difference between revisions of "Using Hyperlink Texts to Improve Quality of Identifying Document Topics based on Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Using Hyperlink Texts to Improve Quality of Identifying Document Topics based on Wikipedia -- new article)
 
(+ links)
Line 1: Line 1:
'''Using Hyperlink Texts to Improve Quality of Identifying Document Topics based on Wikipedia''' - scientific work related to Wikipedia quality published in 2009, written by Dat T. Huynh, Tru H. Cao, Phuong H.T. Pham and Toan N. Hoang.
+
'''Using Hyperlink Texts to Improve Quality of Identifying Document Topics based on Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Dat T. Huynh]], [[Tru H. Cao]], [[Phuong H.T. Pham]] and [[Toan N. Hoang]].
  
 
== Overview ==
 
== Overview ==
This paper presents a method to identify the topics of documents based on Wikipedia category network. It is to improve the method previously proposed by Schonhofen by taking into account the weights of words in hyperlink texts in Wikipedia articles. The experiments on Computing and Team Sport domains have been carried out and showed that proposed method outperforms the Schonhofen’s one.
+
This paper presents a method to identify the topics of documents based on [[Wikipedia]] category network. It is to improve the method previously proposed by Schonhofen by taking into account the weights of words in hyperlink texts in Wikipedia articles. The experiments on Computing and Team Sport domains have been carried out and showed that proposed method outperforms the Schonhofen’s one.

Revision as of 18:25, 26 May 2019

Using Hyperlink Texts to Improve Quality of Identifying Document Topics based on Wikipedia - scientific work related to Wikipedia quality published in 2009, written by Dat T. Huynh, Tru H. Cao, Phuong H.T. Pham and Toan N. Hoang.

Overview

This paper presents a method to identify the topics of documents based on Wikipedia category network. It is to improve the method previously proposed by Schonhofen by taking into account the weights of words in hyperlink texts in Wikipedia articles. The experiments on Computing and Team Sport domains have been carried out and showed that proposed method outperforms the Schonhofen’s one.