Augmenting Wikipedia with Named Entity Tags
Authors | Wisam Dakka Silviu Cucerzan |
---|---|
Publication date | 2008 |
Links | Original |
Augmenting Wikipedia with Named Entity Tags - scientific work related to Wikipedia quality published in 2008, written by Wisam Dakka and Silviu Cucerzan.
Overview
Wikipedia is the largest organized knowledge repository on the Web, increasingly employed by natural language processing and search tools. In this paper, authors investigate the task of labeling Wikipedia pages with standard named entity tags, which can be used further by a range of information extraction and language processing tools. To train the classifiers, authors manually annotated a small set of Wikipedia pages and then extrapolated the annotations using the Wikipedia category information to a much larger training set. Authors employed several distinct features for each page: bag-of-words, page structure, abstract, titles, and entity mentions. Authors report high accuracies for several of the classifiers built. As a result of this work, a Web service that classifies any Wikipedia page has been made available to the academic community.
Embed
Wikipedia Quality
Dakka, Wisam; Cucerzan, Silviu. (2008). "[[Augmenting Wikipedia with Named Entity Tags]]".
English Wikipedia
{{cite journal |last1=Dakka |first1=Wisam |last2=Cucerzan |first2=Silviu |title=Augmenting Wikipedia with Named Entity Tags |date=2008 |url=https://wikipediaquality.com/wiki/Augmenting_Wikipedia_with_Named_Entity_Tags}}
HTML
Dakka, Wisam; Cucerzan, Silviu. (2008). "<a href="https://wikipediaquality.com/wiki/Augmenting_Wikipedia_with_Named_Entity_Tags">Augmenting Wikipedia with Named Entity Tags</a>".