Difference between revisions of "Creating a Phrase Similarity Graph from Wikipedia"

Revision as of 10:58, 13 November 2019

Creating a Phrase Similarity Graph from Wikipedia
Authors	Lubomir Stanchev
Publication date	2014
DOI	10.1109/ICSC.2014.22
Links	Original

Creating a Phrase Similarity Graph from Wikipedia - scientific work related to Wikipedia quality published in 2014, written by Lubomir Stanchev.

Overview

The paper addresses the problem of modeling the relationship between phrases in English using a similarity graph. The mathematical model stores data about the strength of the relationship between phrases expressed as a decimal number. Both structured data from Wikipedia, such as that the Wikipedia page with title "Dog" belongs to the Wikipedia category "Domesticated animals", and textual descriptions, such as that the Wikipedia page with title "Dog" contains the word "wolf" thirty one times are used in creating the graph. The quality of the graph data is validated by comparing the similarity of pairs of phrases using software that uses the graph with results of studies that were performed with human subjects. To the best of knowledge, software produces better correlation with the results of both the Miller and Charles study and the WordSimilarity-353 study than any other published research.

@@ Line 1: / Line 1: @@
+{{Infobox work
+| title = Creating a Phrase Similarity Graph from Wikipedia
+| date = 2014
+| authors = [[Lubomir Stanchev]]
+| doi = 10.1109/ICSC.2014.22
+| link = http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6882003&amp;filter%3DAND%28p_IS_Number%3A6881979%29
+}}
 '''Creating a Phrase Similarity Graph from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2014, written by [[Lubomir Stanchev]].
 == Overview ==
 The paper addresses the problem of modeling the relationship between phrases in English using a similarity graph. The mathematical model stores data about the strength of the relationship between phrases expressed as a decimal number. Both structured data from [[Wikipedia]], such as that the Wikipedia page with title "Dog" belongs to the Wikipedia category "Domesticated animals", and textual descriptions, such as that the Wikipedia page with title "Dog" contains the word "wolf" thirty one times are used in creating the graph. The quality of the graph data is validated by comparing the similarity of pairs of phrases using software that uses the graph with results of studies that were performed with human subjects. To the best of knowledge, software produces better correlation with the results of both the Miller and Charles study and the WordSimilarity-353 study than any other published research.

Difference between revisions of "Creating a Phrase Similarity Graph from Wikipedia"

Revision as of 10:58, 13 November 2019

Overview

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools