Difference between revisions of "Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles"

From Wikipedia Quality
Jump to: navigation, search
(Infobox)
(Adding embed)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Document categorization became a crucial task to organize the massive amount of data over the web. Moreover, many web repositories tended to classify its articles to hierarchies of topics. This structure facilitates connecting related topics and reaching articles. [[Wikipedia]] has organized its articles in a category hierarchy; but so far, the categorization process is done manually by human editors which is a confusing, tiring and a time-consuming task. In this work authors propose WikiAutoCat system for automatic categorization of Wikipedia articles. It is an [[information retrieval]] system that suggests the most relevant set of [[categories]] to the article editor to simplify the categorization process. Empirical evaluation demonstrates that system is scalable enough to perform the categorization process of such a big dataset and it achieves big improvements over the state of the art in Wikipedia categorization in accuracy by 41.65% over WikiCat-Word system and 26.83% over WikiCat-Link system. Also, it is evaluated on a benchmark dataset and achieved gains over their baseline by 8.1% in accuracy.
 
Document categorization became a crucial task to organize the massive amount of data over the web. Moreover, many web repositories tended to classify its articles to hierarchies of topics. This structure facilitates connecting related topics and reaching articles. [[Wikipedia]] has organized its articles in a category hierarchy; but so far, the categorization process is done manually by human editors which is a confusing, tiring and a time-consuming task. In this work authors propose WikiAutoCat system for automatic categorization of Wikipedia articles. It is an [[information retrieval]] system that suggests the most relevant set of [[categories]] to the article editor to simplify the categorization process. Empirical evaluation demonstrates that system is scalable enough to perform the categorization process of such a big dataset and it achieves big improvements over the state of the art in Wikipedia categorization in accuracy by 41.65% over WikiCat-Word system and 26.83% over WikiCat-Link system. Also, it is evaluated on a benchmark dataset and achieved gains over their baseline by 8.1% in accuracy.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Refaei, Nesma; Hemayed, Elsayed E.; Mansour, Riham. (2018). "[[Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles]]". Springer Berlin Heidelberg. DOI: 10.1007/s13369-018-3244-9.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Refaei |first1=Nesma |last2=Hemayed |first2=Elsayed E. |last3=Mansour |first3=Riham |title=Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles |date=2018 |doi=10.1007/s13369-018-3244-9 |url=https://wikipediaquality.com/wiki/Wikiautocat:_Information_Retrieval_System_for_Automatic_Categorization_of_Wikipedia_Articles |journal=Springer Berlin Heidelberg}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Refaei, Nesma; Hemayed, Elsayed E.; Mansour, Riham. (2018). &amp;quot;<a href="https://wikipediaquality.com/wiki/Wikiautocat:_Information_Retrieval_System_for_Automatic_Categorization_of_Wikipedia_Articles">Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles</a>&amp;quot;. Springer Berlin Heidelberg. DOI: 10.1007/s13369-018-3244-9.
 +
</nowiki>
 +
</code>

Revision as of 21:54, 12 August 2019


Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles
Authors
Nesma Refaei
Elsayed E. Hemayed
Riham Mansour
Publication date
2018
DOI
10.1007/s13369-018-3244-9
Links
Original

Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles - scientific work related to Wikipedia quality published in 2018, written by Nesma Refaei, Elsayed E. Hemayed and Riham Mansour.

Overview

Document categorization became a crucial task to organize the massive amount of data over the web. Moreover, many web repositories tended to classify its articles to hierarchies of topics. This structure facilitates connecting related topics and reaching articles. Wikipedia has organized its articles in a category hierarchy; but so far, the categorization process is done manually by human editors which is a confusing, tiring and a time-consuming task. In this work authors propose WikiAutoCat system for automatic categorization of Wikipedia articles. It is an information retrieval system that suggests the most relevant set of categories to the article editor to simplify the categorization process. Empirical evaluation demonstrates that system is scalable enough to perform the categorization process of such a big dataset and it achieves big improvements over the state of the art in Wikipedia categorization in accuracy by 41.65% over WikiCat-Word system and 26.83% over WikiCat-Link system. Also, it is evaluated on a benchmark dataset and achieved gains over their baseline by 8.1% in accuracy.

Embed

Wikipedia Quality

Refaei, Nesma; Hemayed, Elsayed E.; Mansour, Riham. (2018). "[[Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles]]". Springer Berlin Heidelberg. DOI: 10.1007/s13369-018-3244-9.

English Wikipedia

{{cite journal |last1=Refaei |first1=Nesma |last2=Hemayed |first2=Elsayed E. |last3=Mansour |first3=Riham |title=Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles |date=2018 |doi=10.1007/s13369-018-3244-9 |url=https://wikipediaquality.com/wiki/Wikiautocat:_Information_Retrieval_System_for_Automatic_Categorization_of_Wikipedia_Articles |journal=Springer Berlin Heidelberg}}

HTML

Refaei, Nesma; Hemayed, Elsayed E.; Mansour, Riham. (2018). &quot;<a href="https://wikipediaquality.com/wiki/Wikiautocat:_Information_Retrieval_System_for_Automatic_Categorization_of_Wikipedia_Articles">Wikiautocat: Information Retrieval System for Automatic Categorization of Wikipedia Articles</a>&quot;. Springer Berlin Heidelberg. DOI: 10.1007/s13369-018-3244-9.