Difference between revisions of "Multilingual Named Entity Recognition Using Parallel Data and Metadata from Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(+ wikilinks)
(Infobox work)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Multilingual Named Entity Recognition Using Parallel Data and Metadata from Wikipedia
 +
| date = 2012
 +
| authors = [[Sungchul Kim]]<br />[[Kristina Toutanova]]<br />[[Hwanjo Yu]]
 +
| link = https://dl.acm.org/citation.cfm?id=2390622
 +
}}
 
'''Multilingual Named Entity Recognition Using Parallel Data and Metadata from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2012, written by [[Sungchul Kim]], [[Kristina Toutanova]] and [[Hwanjo Yu]].
 
'''Multilingual Named Entity Recognition Using Parallel Data and Metadata from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2012, written by [[Sungchul Kim]], [[Kristina Toutanova]] and [[Hwanjo Yu]].
  
 
== Overview ==
 
== Overview ==
 
In this paper authors propose a method to automatically label multi-lingual data with [[named entity]] tags. Authors build on prior work utilizing [[Wikipedia]] metadata and show how to effectively combine the weak annotations stemming from Wikipedia metadata with information obtained through English-foreign language parallel Wikipedia sentences. The combination is achieved using a novel semi-CRF model for foreign sentence tagging in the context of a parallel English sentence. The model outperforms both standard annotation projection methods and methods based solely on Wikipedia metadata.
 
In this paper authors propose a method to automatically label multi-lingual data with [[named entity]] tags. Authors build on prior work utilizing [[Wikipedia]] metadata and show how to effectively combine the weak annotations stemming from Wikipedia metadata with information obtained through English-foreign language parallel Wikipedia sentences. The combination is achieved using a novel semi-CRF model for foreign sentence tagging in the context of a parallel English sentence. The model outperforms both standard annotation projection methods and methods based solely on Wikipedia metadata.

Revision as of 10:30, 6 November 2019


Multilingual Named Entity Recognition Using Parallel Data and Metadata from Wikipedia
Authors
Sungchul Kim
Kristina Toutanova
Hwanjo Yu
Publication date
2012
Links
Original

Multilingual Named Entity Recognition Using Parallel Data and Metadata from Wikipedia - scientific work related to Wikipedia quality published in 2012, written by Sungchul Kim, Kristina Toutanova and Hwanjo Yu.

Overview

In this paper authors propose a method to automatically label multi-lingual data with named entity tags. Authors build on prior work utilizing Wikipedia metadata and show how to effectively combine the weak annotations stemming from Wikipedia metadata with information obtained through English-foreign language parallel Wikipedia sentences. The combination is achieved using a novel semi-CRF model for foreign sentence tagging in the context of a parallel English sentence. The model outperforms both standard annotation projection methods and methods based solely on Wikipedia metadata.