Difference between revisions of "Analysing Wikipedia and Gold-Standard Corpora for Ner Training"

From Wikipedia Quality
Jump to: navigation, search
(Infobox work)
(+ Embed)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Named [[entity recognition]] (ner) for English typically involves one of three gold standards: muc, conll, or bbn, all created by costly manual annotation. Recent work has used [[Wikipedia]] to automatically create a massive corpus of [[named entity]] annotated text.
 
Named [[entity recognition]] (ner) for English typically involves one of three gold standards: muc, conll, or bbn, all created by costly manual annotation. Recent work has used [[Wikipedia]] to automatically create a massive corpus of [[named entity]] annotated text.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Nothman, Joel; Murphy, Tara; Curran, James R.. (2009). "[[Analysing Wikipedia and Gold-Standard Corpora for Ner Training]]". Association for Computational Linguistics. DOI: 10.3115/1609067.1609135.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Nothman |first1=Joel |last2=Murphy |first2=Tara |last3=Curran |first3=James R. |title=Analysing Wikipedia and Gold-Standard Corpora for Ner Training |date=2009 |doi=10.3115/1609067.1609135 |url=https://wikipediaquality.com/wiki/Analysing_Wikipedia_and_Gold-Standard_Corpora_for_Ner_Training |journal=Association for Computational Linguistics}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Nothman, Joel; Murphy, Tara; Curran, James R.. (2009). &amp;quot;<a href="https://wikipediaquality.com/wiki/Analysing_Wikipedia_and_Gold-Standard_Corpora_for_Ner_Training">Analysing Wikipedia and Gold-Standard Corpora for Ner Training</a>&amp;quot;. Association for Computational Linguistics. DOI: 10.3115/1609067.1609135.
 +
</nowiki>
 +
</code>

Revision as of 14:40, 22 December 2019


Analysing Wikipedia and Gold-Standard Corpora for Ner Training
Authors
Joel Nothman
Tara Murphy
James R. Curran
Publication date
2009
DOI
10.3115/1609067.1609135
Links
Original

Analysing Wikipedia and Gold-Standard Corpora for Ner Training - scientific work related to Wikipedia quality published in 2009, written by Joel Nothman, Tara Murphy and James R. Curran.

Overview

Named entity recognition (ner) for English typically involves one of three gold standards: muc, conll, or bbn, all created by costly manual annotation. Recent work has used Wikipedia to automatically create a massive corpus of named entity annotated text.

Embed

Wikipedia Quality

Nothman, Joel; Murphy, Tara; Curran, James R.. (2009). "[[Analysing Wikipedia and Gold-Standard Corpora for Ner Training]]". Association for Computational Linguistics. DOI: 10.3115/1609067.1609135.

English Wikipedia

{{cite journal |last1=Nothman |first1=Joel |last2=Murphy |first2=Tara |last3=Curran |first3=James R. |title=Analysing Wikipedia and Gold-Standard Corpora for Ner Training |date=2009 |doi=10.3115/1609067.1609135 |url=https://wikipediaquality.com/wiki/Analysing_Wikipedia_and_Gold-Standard_Corpora_for_Ner_Training |journal=Association for Computational Linguistics}}

HTML

Nothman, Joel; Murphy, Tara; Curran, James R.. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Analysing_Wikipedia_and_Gold-Standard_Corpora_for_Ner_Training">Analysing Wikipedia and Gold-Standard Corpora for Ner Training</a>&quot;. Association for Computational Linguistics. DOI: 10.3115/1609067.1609135.