Difference between revisions of "A Comparison of Methods for the Automatic Identification of Locations in Wikipedia"

Revision as of 16:33, 31 August 2019

A Comparison of Methods for the Automatic Identification of Locations in Wikipedia
Authors	Davide Buscaldi Paolo Rosso
Publication date	2007
DOI	10.1145/1316948.1316971
Links	Original

A Comparison of Methods for the Automatic Identification of Locations in Wikipedia - scientific work related to Wikipedia quality published in 2007, written by Davide Buscaldi and Paolo Rosso.

Overview

In this paper authors compare two methods for the automatic identification of geographical articles in encyclopedic resources such as Wikipedia. The methods are a WordNet-based method that uses a set of keywords related to geographical places, and a multinomial Naive Bayes classificator, trained over a randomly selected subset of the English Wikipedia. This task may be included into the broader task of Named Entity classification, a well-known problem in the field of Natural Language Processing. The experiments were carried out considering both the full text of the articles and only the definition of the entity being described in the article. The obtained results show that the information contained in the page templates and the category labels is more useful than the text of the articles.

@@ Line 1: / Line 1: @@
+{{Infobox work
+| title = A Comparison of Methods for the Automatic Identification of Locations in Wikipedia
+| date = 2007
+| authors = [[Davide Buscaldi]]<br />[[Paolo Rosso]]
+| doi = 10.1145/1316948.1316971
+| link = http://dl.acm.org/ft_gateway.cfm?id=1316971&amp;type=pdf
+}}
 '''A Comparison of Methods for the Automatic Identification of Locations in Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2007, written by [[Davide Buscaldi]] and [[Paolo Rosso]].
 == Overview ==
 In this paper authors compare two methods for the automatic identification of geographical articles in encyclopedic resources such as [[Wikipedia]]. The methods are a [[WordNet]]-based method that uses a set of keywords related to geographical places, and a multinomial Naive Bayes classificator, trained over a randomly selected subset of the [[English Wikipedia]]. This task may be included into the broader task of Named Entity classification, a well-known problem in the field of [[Natural Language Processing]]. The experiments were carried out considering both the full text of the articles and only the definition of the entity being described in the article. The obtained results show that the information contained in the page templates and the category labels is more useful than the text of the articles.

Difference between revisions of "A Comparison of Methods for the Automatic Identification of Locations in Wikipedia"

Revision as of 16:33, 31 August 2019

Overview

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools