Difference between revisions of "Creating a Phrase Similarity Graph from Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(+ wikilinks)
(Infobox)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Creating a Phrase Similarity Graph from Wikipedia
 +
| date = 2014
 +
| authors = [[Lubomir Stanchev]]
 +
| doi = 10.1109/ICSC.2014.22
 +
| link = http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6882003&filter%3DAND%28p_IS_Number%3A6881979%29
 +
}}
 
'''Creating a Phrase Similarity Graph from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2014, written by [[Lubomir Stanchev]].
 
'''Creating a Phrase Similarity Graph from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2014, written by [[Lubomir Stanchev]].
  
 
== Overview ==
 
== Overview ==
 
The paper addresses the problem of modeling the relationship between phrases in English using a similarity graph. The mathematical model stores data about the strength of the relationship between phrases expressed as a decimal number. Both structured data from [[Wikipedia]], such as that the Wikipedia page with title "Dog" belongs to the Wikipedia category "Domesticated animals", and textual descriptions, such as that the Wikipedia page with title "Dog" contains the word "wolf" thirty one times are used in creating the graph. The quality of the graph data is validated by comparing the similarity of pairs of phrases using software that uses the graph with results of studies that were performed with human subjects. To the best of knowledge, software produces better correlation with the results of both the Miller and Charles study and the WordSimilarity-353 study than any other published research.
 
The paper addresses the problem of modeling the relationship between phrases in English using a similarity graph. The mathematical model stores data about the strength of the relationship between phrases expressed as a decimal number. Both structured data from [[Wikipedia]], such as that the Wikipedia page with title "Dog" belongs to the Wikipedia category "Domesticated animals", and textual descriptions, such as that the Wikipedia page with title "Dog" contains the word "wolf" thirty one times are used in creating the graph. The quality of the graph data is validated by comparing the similarity of pairs of phrases using software that uses the graph with results of studies that were performed with human subjects. To the best of knowledge, software produces better correlation with the results of both the Miller and Charles study and the WordSimilarity-353 study than any other published research.

Revision as of 10:58, 13 November 2019


Creating a Phrase Similarity Graph from Wikipedia
Authors
Lubomir Stanchev
Publication date
2014
DOI
10.1109/ICSC.2014.22
Links
Original

Creating a Phrase Similarity Graph from Wikipedia - scientific work related to Wikipedia quality published in 2014, written by Lubomir Stanchev.

Overview

The paper addresses the problem of modeling the relationship between phrases in English using a similarity graph. The mathematical model stores data about the strength of the relationship between phrases expressed as a decimal number. Both structured data from Wikipedia, such as that the Wikipedia page with title "Dog" belongs to the Wikipedia category "Domesticated animals", and textual descriptions, such as that the Wikipedia page with title "Dog" contains the word "wolf" thirty one times are used in creating the graph. The quality of the graph data is validated by comparing the similarity of pairs of phrases using software that uses the graph with results of studies that were performed with human subjects. To the best of knowledge, software produces better correlation with the results of both the Miller and Charles study and the WordSimilarity-353 study than any other published research.