Difference between revisions of "Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes"

From Wikipedia Quality
Jump to: navigation, search
(+ Infobox work)
(Adding embed)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
An approach for [[named entity]] classification based on [[Wikipedia]] article [[infoboxes]] is described in this paper. It identifies the three fundamental named entity types, namely; Person, Location and Organization. An entity classification is accomplished by matching entity attributes extracted from the relevant entity article infobox against core entity attributes built from Wikipedia Infobox Templates. Experimental results showed that the classifier can achieve a high accuracy and F-measure scores of 97%. Based on this approach, a database of around 1.6 million 3-typed [[named entities]] is created from 20140203 Wikipedia dump. Experiments on CoNLL2003 shared task [[named entity recognition]] (NER) dataset disclosed the system's outstanding performance in comparison to three different state- of-the-art systems.
 
An approach for [[named entity]] classification based on [[Wikipedia]] article [[infoboxes]] is described in this paper. It identifies the three fundamental named entity types, namely; Person, Location and Organization. An entity classification is accomplished by matching entity attributes extracted from the relevant entity article infobox against core entity attributes built from Wikipedia Infobox Templates. Experimental results showed that the classifier can achieve a high accuracy and F-measure scores of 97%. Based on this approach, a database of around 1.6 million 3-typed [[named entities]] is created from 20140203 Wikipedia dump. Experiments on CoNLL2003 shared task [[named entity recognition]] (NER) dataset disclosed the system's outstanding performance in comparison to three different state- of-the-art systems.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Mohamed, Muhidin; Oussalah, Mourad. (2014). "[[Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes]]". The Science and Information (SAI) Organization Limited. DOI: 10.14569/IJACSA.2014.050725.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Mohamed |first1=Muhidin |last2=Oussalah |first2=Mourad |title=Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes |date=2014 |doi=10.14569/IJACSA.2014.050725 |url=https://wikipediaquality.com/wiki/Identifying_and_Extracting_Named_Entities_from_Wikipedia_Database_Using_Entity_Infoboxes |journal=The Science and Information (SAI) Organization Limited}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Mohamed, Muhidin; Oussalah, Mourad. (2014). &amp;quot;<a href="https://wikipediaquality.com/wiki/Identifying_and_Extracting_Named_Entities_from_Wikipedia_Database_Using_Entity_Infoboxes">Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes</a>&amp;quot;. The Science and Information (SAI) Organization Limited. DOI: 10.14569/IJACSA.2014.050725.
 +
</nowiki>
 +
</code>

Revision as of 11:02, 15 February 2021


Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes
Authors
Muhidin Mohamed
Mourad Oussalah
Publication date
2014
DOI
10.14569/IJACSA.2014.050725
Links
Original

Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes - scientific work related to Wikipedia quality published in 2014, written by Muhidin Mohamed and Mourad Oussalah.

Overview

An approach for named entity classification based on Wikipedia article infoboxes is described in this paper. It identifies the three fundamental named entity types, namely; Person, Location and Organization. An entity classification is accomplished by matching entity attributes extracted from the relevant entity article infobox against core entity attributes built from Wikipedia Infobox Templates. Experimental results showed that the classifier can achieve a high accuracy and F-measure scores of 97%. Based on this approach, a database of around 1.6 million 3-typed named entities is created from 20140203 Wikipedia dump. Experiments on CoNLL2003 shared task named entity recognition (NER) dataset disclosed the system's outstanding performance in comparison to three different state- of-the-art systems.

Embed

Wikipedia Quality

Mohamed, Muhidin; Oussalah, Mourad. (2014). "[[Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes]]". The Science and Information (SAI) Organization Limited. DOI: 10.14569/IJACSA.2014.050725.

English Wikipedia

{{cite journal |last1=Mohamed |first1=Muhidin |last2=Oussalah |first2=Mourad |title=Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes |date=2014 |doi=10.14569/IJACSA.2014.050725 |url=https://wikipediaquality.com/wiki/Identifying_and_Extracting_Named_Entities_from_Wikipedia_Database_Using_Entity_Infoboxes |journal=The Science and Information (SAI) Organization Limited}}

HTML

Mohamed, Muhidin; Oussalah, Mourad. (2014). &quot;<a href="https://wikipediaquality.com/wiki/Identifying_and_Extracting_Named_Entities_from_Wikipedia_Database_Using_Entity_Infoboxes">Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes</a>&quot;. The Science and Information (SAI) Organization Limited. DOI: 10.14569/IJACSA.2014.050725.