Text Mining Wikipedia to Discover Alternative Destinations

From Wikipedia Quality
Revision as of 10:31, 24 May 2019 by Sofia (talk | contribs) (Overview - Text Mining Wikipedia to Discover Alternative Destinations)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Text Mining Wikipedia to Discover Alternative Destinations - scientific work related to Wikipedia quality published in 2013, written by Kenneth Cosh.

Overview

This paper discusses an application of some statistical Natural Language Processing algorithms to a set of articles from Wikipedia about top tourist destinations. The objective is to automatically identify the key features of each destination and then discover other destinations which share similar sets of features. Through this a method is demonstrated by which meta data about each article can be extracted from the unstructured text and then used to answer complex discovery type queries. The paper compares an approach to automatically clustering similar destinations with a more user driven feature focused technique.