Difference between revisions of "Improved Text Categorisation for Wikipedia Named Entities"

From Wikipedia Quality
Jump to: navigation, search
(Infobox work)
(Embed for English Wikipedia, HTML)
 
Line 9: Line 9:
 
== Overview ==
 
== Overview ==
 
The accuracy of [[named entity recognition]] systems relies heavily upon the volume and quality of available training data. Improving the process of automatically producing such training data is an important task, as manual acquisition is both time consuming and expensive. Authors explore the use of a variety of machine learning algorithms for categorising [[Wikipedia]] articles, an initial step in producing the [[named entity]] training data. Authors were able to achieve a categorisation accuracy of 95% F -score over six coarse [[categories]], an improvement of up to 5% F -score over previous methods.
 
The accuracy of [[named entity recognition]] systems relies heavily upon the volume and quality of available training data. Improving the process of automatically producing such training data is an important task, as manual acquisition is both time consuming and expensive. Authors explore the use of a variety of machine learning algorithms for categorising [[Wikipedia]] articles, an initial step in producing the [[named entity]] training data. Authors were able to achieve a categorisation accuracy of 95% F -score over six coarse [[categories]], an improvement of up to 5% F -score over previous methods.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Tardif, Sam; Curran, James R.; Murphy, Tara. (2009). "[[Improved Text Categorisation for Wikipedia Named Entities]]".
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Tardif |first1=Sam |last2=Curran |first2=James R. |last3=Murphy |first3=Tara |title=Improved Text Categorisation for Wikipedia Named Entities |date=2009 |url=https://wikipediaquality.com/wiki/Improved_Text_Categorisation_for_Wikipedia_Named_Entities}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Tardif, Sam; Curran, James R.; Murphy, Tara. (2009). &amp;quot;<a href="https://wikipediaquality.com/wiki/Improved_Text_Categorisation_for_Wikipedia_Named_Entities">Improved Text Categorisation for Wikipedia Named Entities</a>&amp;quot;.
 +
</nowiki>
 +
</code>

Latest revision as of 02:53, 24 May 2020


Improved Text Categorisation for Wikipedia Named Entities
Authors
Sam Tardif
James R. Curran
Tara Murphy
Publication date
2009
Links
Original

Improved Text Categorisation for Wikipedia Named Entities - scientific work related to Wikipedia quality published in 2009, written by Sam Tardif, James R. Curran and Tara Murphy.

Overview

The accuracy of named entity recognition systems relies heavily upon the volume and quality of available training data. Improving the process of automatically producing such training data is an important task, as manual acquisition is both time consuming and expensive. Authors explore the use of a variety of machine learning algorithms for categorising Wikipedia articles, an initial step in producing the named entity training data. Authors were able to achieve a categorisation accuracy of 95% F -score over six coarse categories, an improvement of up to 5% F -score over previous methods.

Embed

Wikipedia Quality

Tardif, Sam; Curran, James R.; Murphy, Tara. (2009). "[[Improved Text Categorisation for Wikipedia Named Entities]]".

English Wikipedia

{{cite journal |last1=Tardif |first1=Sam |last2=Curran |first2=James R. |last3=Murphy |first3=Tara |title=Improved Text Categorisation for Wikipedia Named Entities |date=2009 |url=https://wikipediaquality.com/wiki/Improved_Text_Categorisation_for_Wikipedia_Named_Entities}}

HTML

Tardif, Sam; Curran, James R.; Murphy, Tara. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Improved_Text_Categorisation_for_Wikipedia_Named_Entities">Improved Text Categorisation for Wikipedia Named Entities</a>&quot;.