Analysing Wikipedia and Gold-Standard Corpora for Ner Training

Analysing Wikipedia and Gold-Standard Corpora for Ner Training - scientific work related to Wikipedia quality published in 2009, written by Joel Nothman, Tara Murphy and James R. Curran.

Overview

Named entity recognition (ner) for English typically involves one of three gold standards: muc, conll, or bbn, all created by costly manual annotation. Recent work has used Wikipedia to automatically create a massive corpus of named entity annotated text.

Analysing Wikipedia and Gold-Standard Corpora for Ner Training

Overview

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools