An Unsupervised Approach for Identifying the Infobox Template of Wikipedia Article

From Wikipedia Quality
Jump to: navigation, search


An Unsupervised Approach for Identifying the Infobox Template of Wikipedia Article
Authors
Hanif Bhuiyan
Kyeong-Jin Oh
Myung-Duk Hong
Geun-Sik Jo
Publication date
2015
DOI
10.1109/CSE.2015.47
Links
Original

An Unsupervised Approach for Identifying the Infobox Template of Wikipedia Article - scientific work related to Wikipedia quality published in 2015, written by Hanif Bhuiyan, Kyeong-Jin Oh, Myung-Duk Hong and Geun-Sik Jo.

Overview

Wikipedia infoboxes serve as important structured information source in the web. To author infobox for a particular article, volunteers required a considerable amount of manual effort to identify the respective infobox template. Thus, an automatic process to mark infobox template might be useful and beneficial for the Wikipedia contributors. In this paper, authors present a Natural Language Processing (NLP)-based automated approach to identify the infobox template in an unsupervised fashion. The proposed approach has been developed by using semantic relations (hyponym and holonym) and word features of Wikipedia articles. Authors approach works in three steps: first it processes the raw text of the article to generate sets of words, next it apply the proposed algorithm to identify the infobox type and finally point out the infobox template from the large pool of template list. The effectiveness of the proposed approach has been proved in terms of autonomous and accuracy, by a data-driven experiment.

Embed

Wikipedia Quality

Bhuiyan, Hanif; Oh, Kyeong-Jin; Hong, Myung-Duk; Jo, Geun-Sik. (2015). "[[An Unsupervised Approach for Identifying the Infobox Template of Wikipedia Article]]".DOI: 10.1109/CSE.2015.47.

English Wikipedia

{{cite journal |last1=Bhuiyan |first1=Hanif |last2=Oh |first2=Kyeong-Jin |last3=Hong |first3=Myung-Duk |last4=Jo |first4=Geun-Sik |title=An Unsupervised Approach for Identifying the Infobox Template of Wikipedia Article |date=2015 |doi=10.1109/CSE.2015.47 |url=https://wikipediaquality.com/wiki/An_Unsupervised_Approach_for_Identifying_the_Infobox_Template_of_Wikipedia_Article}}

HTML

Bhuiyan, Hanif; Oh, Kyeong-Jin; Hong, Myung-Duk; Jo, Geun-Sik. (2015). &quot;<a href="https://wikipediaquality.com/wiki/An_Unsupervised_Approach_for_Identifying_the_Infobox_Template_of_Wikipedia_Article">An Unsupervised Approach for Identifying the Infobox Template of Wikipedia Article</a>&quot;.DOI: 10.1109/CSE.2015.47.