Difference between revisions of "Mining Wikipedia Article Clusters for Geospatial Entities and Relationships"

From Wikipedia Quality
Jump to: navigation, search
(Mining Wikipedia Article Clusters for Geospatial Entities and Relationships - creating a new article)
 
(Links)
Line 1: Line 1:
'''Mining Wikipedia Article Clusters for Geospatial Entities and Relationships''' - scientific work related to Wikipedia quality published in 2009, written by Jeremy Witmer and Jugal K. Kalita.
+
'''Mining Wikipedia Article Clusters for Geospatial Entities and Relationships''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Jeremy Witmer]] and [[Jugal K. Kalita]].
  
 
== Overview ==
 
== Overview ==
Authors present in this paper a method to extract geospatial entities and relationships from the unstructured text of the English language Wikipedia. Using a novel approach that applies SVMs trained from purely structural features of text strings, authors extract candidate geospatial entities and relationships. Using a combination of further techniques, along with an external gazetteer, the candidate entities and relationships are disambiguated and the Wikipedia article pages are modified to include the semantic information provided by the extraction process. Authors successfully extracted location entities with an F-measure of 81%, and location relations with an F-
+
Authors present in this paper a method to extract geospatial entities and relationships from the unstructured text of the English language [[Wikipedia]]. Using a novel approach that applies SVMs trained from purely structural [[features]] of text strings, authors extract candidate geospatial entities and relationships. Using a combination of further techniques, along with an external gazetteer, the candidate entities and relationships are disambiguated and the Wikipedia article pages are modified to include the [[semantic information]] provided by the extraction process. Authors successfully extracted location entities with an F-measure of 81%, and location relations with an F-

Revision as of 11:21, 7 July 2019

Mining Wikipedia Article Clusters for Geospatial Entities and Relationships - scientific work related to Wikipedia quality published in 2009, written by Jeremy Witmer and Jugal K. Kalita.

Overview

Authors present in this paper a method to extract geospatial entities and relationships from the unstructured text of the English language Wikipedia. Using a novel approach that applies SVMs trained from purely structural features of text strings, authors extract candidate geospatial entities and relationships. Using a combination of further techniques, along with an external gazetteer, the candidate entities and relationships are disambiguated and the Wikipedia article pages are modified to include the semantic information provided by the extraction process. Authors successfully extracted location entities with an F-measure of 81%, and location relations with an F-