Difference between revisions of "Wikidocsaligner: an Off-The-Shelf Wikipedia Documents Alignment Tool"

From Wikipedia Quality
Jump to: navigation, search
(Wikidocsaligner: an Off-The-Shelf Wikipedia Documents Alignment Tool -- new article)
 
(Wikilinks)
Line 1: Line 1:
'''Wikidocsaligner: an Off-The-Shelf Wikipedia Documents Alignment Tool''' - scientific work related to Wikipedia quality published in 2017, written by Motaz Saad and Basem O. Alijla.
+
'''Wikidocsaligner: an Off-The-Shelf Wikipedia Documents Alignment Tool''' - scientific work related to [[Wikipedia quality]] published in 2017, written by [[Motaz Saad]] and [[Basem O. Alijla]].
  
 
== Overview ==
 
== Overview ==
Wikipedia encyclopedia is an attractive source for comparable corpora in many languages. Most researchers develop their own script to perform document alignment task, which requires efforts and time. In this paper, authors present WikiDocsAligner, an off-the-shelf Wikipedia Articles alignment handy tool. The implementation of WikiDocsAligner does not require the researchers to import/export of interlanguage links databases. The user just need to download Wikipedia dumps (interlanguage links and articles), then provide them to the tool, which performs the alignment. This software can be used easily to align Wikipedia documents in any language pair. Finally, authors use WikiDocsAligner to align comparable documents from Arabic Wikipedia and Egyptian Wikipedia. So authors shed the light on Wikipedia as a source of Arabic dialects language resources. The produced resources is interesting and useful as the demand on Arabic/dialects language resources increased in the last decade.
+
Wikipedia encyclopedia is an attractive source for comparable corpora in many languages. Most researchers develop their own script to perform document alignment task, which requires efforts and time. In this paper, authors present WikiDocsAligner, an off-the-shelf [[Wikipedia]] Articles alignment handy tool. The implementation of WikiDocsAligner does not require the researchers to import/export of interlanguage links databases. The user just need to download Wikipedia dumps (interlanguage links and articles), then provide them to the tool, which performs the alignment. This software can be used easily to align Wikipedia documents in any language pair. Finally, authors use WikiDocsAligner to align comparable documents from [[Arabic Wikipedia]] and Egyptian Wikipedia. So authors shed the light on Wikipedia as a source of Arabic dialects language resources. The produced resources is interesting and useful as the demand on Arabic/dialects language resources increased in the last decade.

Revision as of 07:51, 22 June 2019

Wikidocsaligner: an Off-The-Shelf Wikipedia Documents Alignment Tool - scientific work related to Wikipedia quality published in 2017, written by Motaz Saad and Basem O. Alijla.

Overview

Wikipedia encyclopedia is an attractive source for comparable corpora in many languages. Most researchers develop their own script to perform document alignment task, which requires efforts and time. In this paper, authors present WikiDocsAligner, an off-the-shelf Wikipedia Articles alignment handy tool. The implementation of WikiDocsAligner does not require the researchers to import/export of interlanguage links databases. The user just need to download Wikipedia dumps (interlanguage links and articles), then provide them to the tool, which performs the alignment. This software can be used easily to align Wikipedia documents in any language pair. Finally, authors use WikiDocsAligner to align comparable documents from Arabic Wikipedia and Egyptian Wikipedia. So authors shed the light on Wikipedia as a source of Arabic dialects language resources. The produced resources is interesting and useful as the demand on Arabic/dialects language resources increased in the last decade.