Datasets and Gate Evaluation Framework for Benchmarking Wikipedia-Based Ner Systems

From Wikipedia Quality
Revision as of 23:31, 5 June 2019 by Emily (talk | contribs) (New work - Datasets and Gate Evaluation Framework for Benchmarking Wikipedia-Based Ner Systems)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Datasets and Gate Evaluation Framework for Benchmarking Wikipedia-Based Ner Systems - scientific work related to Wikipedia quality published in 2013, written by Milan Dojchinovski and Tomáš Klie.

Overview

Authors present a wikifier evaluation framework consisting of software support and two datasets (News and Tweets), which were derived from datasets previously published at WEKEX 2011 and MSM Challenge 2013. Entities recognized in the original datasets were enriched with new annotations - a link to Wikipedia and the most specific type from the DBpedia Ontology. The annotations were created by two annotators and a judge. The datasets are supplemented by plugins for their import to the GATE NLP framework and a DBpedia Ontology-aware plugin for aligning annotations created by a wikifier with the ground truth.