Monolingual Text Similarity Measures: a Comparison of Models over Wikipedia Articles Revisions

From Wikipedia Quality
Revision as of 22:39, 9 November 2019 by Isabelle (talk | contribs) (Embed)
Jump to: navigation, search


Monolingual Text Similarity Measures: a Comparison of Models over Wikipedia Articles Revisions
Authors
Andreas Eiselt
Paolo Rosso
Publication date
2009
Links
Original

Monolingual Text Similarity Measures: a Comparison of Models over Wikipedia Articles Revisions - scientific work related to Wikipedia quality published in 2009, written by Andreas Eiselt and Paolo Rosso.

Overview

Measuring the similarity of texts is a common task in detection of co-derivatives, plagiarism and information flow. In general the objective is to locate those fragments of a document that are derived from another text. Authors have carried out an exhaustive comparison of similarity estimation models in order to determine which one performs better on different levels of granularity and languages (English, German, Spanish, and Hindi). In connection with the comparison authors introduce a publicly available corpus specially suited for this task. Furthermore authors introduce some modifications to well known algorithms in order to demonstrate their applicability to this task. Amongst others, experiments show the strengths and weaknesses of the different models with respect to the granularity of the processed texts.

Embed

Wikipedia Quality

Eiselt, Andreas; Rosso, Paolo. (2009). "[[Monolingual Text Similarity Measures: a Comparison of Models over Wikipedia Articles Revisions]]".

English Wikipedia

{{cite journal |last1=Eiselt |first1=Andreas |last2=Rosso |first2=Paolo |title=Monolingual Text Similarity Measures: a Comparison of Models over Wikipedia Articles Revisions |date=2009 |url=https://wikipediaquality.com/wiki/Monolingual_Text_Similarity_Measures:_a_Comparison_of_Models_over_Wikipedia_Articles_Revisions}}

HTML

Eiselt, Andreas; Rosso, Paolo. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Monolingual_Text_Similarity_Measures:_a_Comparison_of_Models_over_Wikipedia_Articles_Revisions">Monolingual Text Similarity Measures: a Comparison of Models over Wikipedia Articles Revisions</a>&quot;.