On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia

From Wikipedia Quality
Jump to: navigation, search
On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia
Authors
Edgardo Ferretti
Marcelo Luis Errecalde
Maik Anderka
Benno Maria Stein
Publication date
2014
ISSN
15294188
ISBN
978-147995722-4
DOI
10.1109/DEXA.2014.52
Links

On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia - scientific work about Wikipedia quality published in 2014, written by Edgardo Ferretti, Marcelo Luis Errecalde, Maik Anderka and Benno Maria Stein.

Overview

Learning from positive and unlabeled examples (PU learning) has proven to be an effective method in several Web mining applications. In particular, in the 1st International Competition on Quality Flaw Prediction in Wikipedia in 2012, a tailored PU learning approach performed best amongst the competitors. A key feature of that approach is the introduction of sampling strategies within the original PU learning procedure. The paper in hand revisits the winner approach of 2012 and elaborates on neglected aspects in order to provide evidence for the usefulness of sampling in PU learning. In this regard, authors propose a modification to this PU learning approach, and authors show how the different sampling strategies affect the flaw prediction effectiveness. Their analysis is based on the original evaluation corpus of the 2012-competition on quality flaw prediction. A main outcome is that under the best sampling strategy, their new modified version of PU learning increases in average the flaw prediction effectiveness by 18.31%, when compared against the winning approach of the competition.

Embed

Wikipedia Quality

Ferretti, Edgardo; Errecalde, Marcelo Luis; Anderka, Maik; Stein, Benno Maria. (2014). "[[On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia]]". International Journal of Information Quality Volume 3, Issue 3, 1 January 2014, pp. 207-227. ISBN: 978-147995722-4. ISSN: 15294188. DOI: 10.1109/DEXA.2014.52.

English Wikipedia

{{cite journal |last1=Ferretti |first1=Edgardo |last2=Errecalde |first2=Marcelo Luis |last3=Anderka |first3=Maik |last4=Stein |first4=Benno Maria |title=On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia |date=2014 |isbn=978-147995722-4 |issn=15294188 |doi=10.1109/DEXA.2014.52 |url=https://wikipediaquality.com/wiki/On_the_use_of_Reliable-Negatives_Selection_Strategies_in_the_PU_Learning_Approach_for_Quality_Flaws_Prediction_in_Wikipedia |journal=International Journal of Information Quality Volume 3, Issue 3, 1 January 2014, pp. 207-227}}

HTML

Ferretti, Edgardo; Errecalde, Marcelo Luis; Anderka, Maik; Stein, Benno Maria. (2014). &quot;<a href="https://wikipediaquality.com/wiki/On_the_use_of_Reliable-Negatives_Selection_Strategies_in_the_PU_Learning_Approach_for_Quality_Flaws_Prediction_in_Wikipedia">On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia</a>&quot;. International Journal of Information Quality Volume 3, Issue 3, 1 January 2014, pp. 207-227. ISBN: 978-147995722-4. ISSN: 15294188. DOI: 10.1109/DEXA.2014.52.