Difference between revisions of "Named Entity Corpus Construction Using Wikipedia and Dbpedia Ontology"

From Wikipedia Quality
Jump to: navigation, search
(+ wikilinks)
(infobox)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Named Entity Corpus Construction Using Wikipedia and Dbpedia Ontology
 +
| date = 2014
 +
| authors = [[YoungGyun Hahm]]<br />[[Jungyeul Park]]<br />[[Kyungtae Lim]]<br />[[Youngsik Kim]]<br />[[Dosam Hwang]]<br />[[Key-Sun Choi]]
 +
| link = http://www.lrec-conf.org/proceedings/lrec2014/pdf/688_Paper.pdf
 +
}}
 
'''Named Entity Corpus Construction Using Wikipedia and Dbpedia Ontology''' - scientific work related to [[Wikipedia quality]] published in 2014, written by [[YoungGyun Hahm]], [[Jungyeul Park]], [[Kyungtae Lim]], [[Youngsik Kim]], [[Dosam Hwang]] and [[Key-Sun Choi]].
 
'''Named Entity Corpus Construction Using Wikipedia and Dbpedia Ontology''' - scientific work related to [[Wikipedia quality]] published in 2014, written by [[YoungGyun Hahm]], [[Jungyeul Park]], [[Kyungtae Lim]], [[Youngsik Kim]], [[Dosam Hwang]] and [[Key-Sun Choi]].
  
 
== Overview ==
 
== Overview ==
 
In this paper, authors propose a novel method to automatically build a [[named entity]] corpus based on the [[DBpedia]] [[ontology]]. Since most of [[named entity recognition]] systems require time and effort consuming annotation tasks as training data. Work on NER has thus for been limited on certain languages like English that are resource-abundant in general. As an alternative, authors suggest that the NE corpus generated by proposed method, can be used as training data. Authors approach introduces [[Wikipedia]] as a raw text and uses the DBpedia data set for named entity disambiguation. Authors method is language-independent and easy to be applied to many [[different language]]s where Wikipedia and DBpedia are provided. Throughout the paper, authors demonstrate that NE corpus is of comparable quality even to the manually annotated NE corpus.
 
In this paper, authors propose a novel method to automatically build a [[named entity]] corpus based on the [[DBpedia]] [[ontology]]. Since most of [[named entity recognition]] systems require time and effort consuming annotation tasks as training data. Work on NER has thus for been limited on certain languages like English that are resource-abundant in general. As an alternative, authors suggest that the NE corpus generated by proposed method, can be used as training data. Authors approach introduces [[Wikipedia]] as a raw text and uses the DBpedia data set for named entity disambiguation. Authors method is language-independent and easy to be applied to many [[different language]]s where Wikipedia and DBpedia are provided. Throughout the paper, authors demonstrate that NE corpus is of comparable quality even to the manually annotated NE corpus.

Revision as of 00:39, 14 February 2021


Named Entity Corpus Construction Using Wikipedia and Dbpedia Ontology
Authors
YoungGyun Hahm
Jungyeul Park
Kyungtae Lim
Youngsik Kim
Dosam Hwang
Key-Sun Choi
Publication date
2014
Links
Original

Named Entity Corpus Construction Using Wikipedia and Dbpedia Ontology - scientific work related to Wikipedia quality published in 2014, written by YoungGyun Hahm, Jungyeul Park, Kyungtae Lim, Youngsik Kim, Dosam Hwang and Key-Sun Choi.

Overview

In this paper, authors propose a novel method to automatically build a named entity corpus based on the DBpedia ontology. Since most of named entity recognition systems require time and effort consuming annotation tasks as training data. Work on NER has thus for been limited on certain languages like English that are resource-abundant in general. As an alternative, authors suggest that the NE corpus generated by proposed method, can be used as training data. Authors approach introduces Wikipedia as a raw text and uses the DBpedia data set for named entity disambiguation. Authors method is language-independent and easy to be applied to many different languages where Wikipedia and DBpedia are provided. Throughout the paper, authors demonstrate that NE corpus is of comparable quality even to the manually annotated NE corpus.