Augmenting Wikipedia with Named Entity Tags

From Wikipedia Quality
Jump to: navigation, search


Augmenting Wikipedia with Named Entity Tags
Authors
Wisam Dakka
Silviu Cucerzan
Publication date
2008
Links
Original

Augmenting Wikipedia with Named Entity Tags - scientific work related to Wikipedia quality published in 2008, written by Wisam Dakka and Silviu Cucerzan.

Overview

Wikipedia is the largest organized knowledge repository on the Web, increasingly employed by natural language processing and search tools. In this paper, authors investigate the task of labeling Wikipedia pages with standard named entity tags, which can be used further by a range of information extraction and language processing tools. To train the classifiers, authors manually annotated a small set of Wikipedia pages and then extrapolated the annotations using the Wikipedia category information to a much larger training set. Authors employed several distinct features for each page: bag-of-words, page structure, abstract, titles, and entity mentions. Authors report high accuracies for several of the classifiers built. As a result of this work, a Web service that classifies any Wikipedia page has been made available to the academic community.

Embed

Wikipedia Quality

Dakka, Wisam; Cucerzan, Silviu. (2008). "[[Augmenting Wikipedia with Named Entity Tags]]".

English Wikipedia

{{cite journal |last1=Dakka |first1=Wisam |last2=Cucerzan |first2=Silviu |title=Augmenting Wikipedia with Named Entity Tags |date=2008 |url=https://wikipediaquality.com/wiki/Augmenting_Wikipedia_with_Named_Entity_Tags}}

HTML

Dakka, Wisam; Cucerzan, Silviu. (2008). &quot;<a href="https://wikipediaquality.com/wiki/Augmenting_Wikipedia_with_Named_Entity_Tags">Augmenting Wikipedia with Named Entity Tags</a>&quot;.