On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia
Authors | Edgardo Ferretti Marcelo Luis Errecalde Maik Anderka Benno Maria Stein |
---|---|
Publication date | 2014 |
ISSN | 15294188 |
ISBN | 978-147995722-4 |
DOI | 10.1109/DEXA.2014.52 |
Links |
On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia - scientific work about Wikipedia quality published in 2014, written by Edgardo Ferretti, Marcelo Luis Errecalde, Maik Anderka and Benno Maria Stein.
Overview
Learning from positive and unlabeled examples (PU learning) has proven to be an effective method in several Web mining applications. In particular, in the 1st International Competition on Quality Flaw Prediction in Wikipedia in 2012, a tailored PU learning approach performed best amongst the competitors. A key feature of that approach is the introduction of sampling strategies within the original PU learning procedure. The paper in hand revisits the winner approach of 2012 and elaborates on neglected aspects in order to provide evidence for the usefulness of sampling in PU learning. In this regard, authors propose a modification to this PU learning approach, and authors show how the different sampling strategies affect the flaw prediction effectiveness. Their analysis is based on the original evaluation corpus of the 2012-competition on quality flaw prediction. A main outcome is that under the best sampling strategy, their new modified version of PU learning increases in average the flaw prediction effectiveness by 18.31%, when compared against the winning approach of the competition.
Embed
Wikipedia Quality
Ferretti, Edgardo; Errecalde, Marcelo Luis; Anderka, Maik; Stein, Benno Maria. (2014). "[[On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia]]". International Journal of Information Quality Volume 3, Issue 3, 1 January 2014, pp. 207-227. ISBN: 978-147995722-4. ISSN: 15294188. DOI: 10.1109/DEXA.2014.52.
English Wikipedia
{{cite journal |last1=Ferretti |first1=Edgardo |last2=Errecalde |first2=Marcelo Luis |last3=Anderka |first3=Maik |last4=Stein |first4=Benno Maria |title=On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia |date=2014 |isbn=978-147995722-4 |issn=15294188 |doi=10.1109/DEXA.2014.52 |url=https://wikipediaquality.com/wiki/On_the_use_of_Reliable-Negatives_Selection_Strategies_in_the_PU_Learning_Approach_for_Quality_Flaws_Prediction_in_Wikipedia |journal=International Journal of Information Quality Volume 3, Issue 3, 1 January 2014, pp. 207-227}}
HTML
Ferretti, Edgardo; Errecalde, Marcelo Luis; Anderka, Maik; Stein, Benno Maria. (2014). "<a href="https://wikipediaquality.com/wiki/On_the_use_of_Reliable-Negatives_Selection_Strategies_in_the_PU_Learning_Approach_for_Quality_Flaws_Prediction_in_Wikipedia">On the use of Reliable-Negatives Selection Strategies in the PU Learning Approach for Quality Flaws Prediction in Wikipedia</a>". International Journal of Information Quality Volume 3, Issue 3, 1 January 2014, pp. 207-227. ISBN: 978-147995722-4. ISSN: 15294188. DOI: 10.1109/DEXA.2014.52.