Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’S Edit History

From Wikipedia Quality
Jump to: navigation, search


Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’S Edit History
Authors
Oliver Ferschke
Torsten Zesch
Iryna Gurevych
Publication date
2011
Links
Original

Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’S Edit History - scientific work related to Wikipedia quality published in 2011, written by Oliver Ferschke, Torsten Zesch and Iryna Gurevych.

Overview

Authors present an open-source toolkit which allows (i) to reconstruct past states of Wikipedia, and (ii) to efficiently access the edit history of Wikipedia articles. Reconstructing past states of Wikipedia is a prerequisite for reproducing previous experimental work based on Wikipedia. Beyond that, the edit history of Wikipedia articles has been shown to be a valuable knowledge source for NLP, but access is severely impeded by the lack of efficient tools for managing the huge amount of provided data. By using a dedicated storage format, toolkit massively decreases the data volume to less than 2% of the original size, and at the same time provides an easy-to-use interface to access the revision data. The language-independent design allows to process any language represented in Wikipedia. Authors expect this work to consolidate NLP research using Wikipedia in general, and to foster research making use of the knowledge encoded in Wikipedia's edit history.

Embed

Wikipedia Quality

Ferschke, Oliver; Zesch, Torsten; Gurevych, Iryna. (2011). "[[Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’S Edit History]]". Association for Computational Linguistics.

English Wikipedia

{{cite journal |last1=Ferschke |first1=Oliver |last2=Zesch |first2=Torsten |last3=Gurevych |first3=Iryna |title=Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’S Edit History |date=2011 |url=https://wikipediaquality.com/wiki/Wikipedia_Revision_Toolkit:_Efficiently_Accessing_Wikipedia’S_Edit_History |journal=Association for Computational Linguistics}}

HTML

Ferschke, Oliver; Zesch, Torsten; Gurevych, Iryna. (2011). &quot;<a href="https://wikipediaquality.com/wiki/Wikipedia_Revision_Toolkit:_Efficiently_Accessing_Wikipedia’S_Edit_History">Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’S Edit History</a>&quot;. Association for Computational Linguistics.