To Link or Not to Link: Ranking Hyperlinks in Wikipedia Using Collective Attention

From Wikipedia Quality
Jump to: navigation, search


To Link or Not to Link: Ranking Hyperlinks in Wikipedia Using Collective Attention
Authors
Philip Thruesen
Jaroslav Cechak
Blandine Seznec
Roel Castalio
Nattiya Kanhabua
Publication date
2016
DOI
10.1109/BigData.2016.7840785
Links
Original

To Link or Not to Link: Ranking Hyperlinks in Wikipedia Using Collective Attention - scientific work related to Wikipedia quality published in 2016, written by Philip Thruesen, Jaroslav Cechak, Blandine Seznec, Roel Castalio and Nattiya Kanhabua.

Overview

Wikipedia is one of the fastest growing websites and a primary source of knowledge on the Internet. Being a wiki, its content is crowd-sourced by the users. This has many benefits and it is one of the main reasons it has grown to reach more than 5 million articles in its English version. Nevertheless, this also raises issues, like the overlinking of articles, which are difficult to deal with by editors. In this paper, authors tackle overlinking in Wikipedia as a ranking problem. Authors apply Learning to Rank algorithms to evaluate the click frequency of links in an effort to distinguish the most useful links for users. To accomplish this, authors develop a ground truth, which serves as baseline for algorithm and compare hyperlink features to implement the most advantageous ones. The results show 86.2% accuracy with the top-6 most useful features and 87.7% accuracy with the complete feature set. Considering these results, authors outline a solution to the overlinking problem. By removing the most inadequate links, authors suggest that readability of Wikipedia articles could be improved while preserving most of its useful links.

Embed

Wikipedia Quality

Thruesen, Philip; Cechak, Jaroslav; Seznec, Blandine; Castalio, Roel; Kanhabua, Nattiya. (2016). "[[To Link or Not to Link: Ranking Hyperlinks in Wikipedia Using Collective Attention]]". IEEE Signal Processing Society. DOI: 10.1109/BigData.2016.7840785.

English Wikipedia

{{cite journal |last1=Thruesen |first1=Philip |last2=Cechak |first2=Jaroslav |last3=Seznec |first3=Blandine |last4=Castalio |first4=Roel |last5=Kanhabua |first5=Nattiya |title=To Link or Not to Link: Ranking Hyperlinks in Wikipedia Using Collective Attention |date=2016 |doi=10.1109/BigData.2016.7840785 |url=https://wikipediaquality.com/wiki/To_Link_or_Not_to_Link:_Ranking_Hyperlinks_in_Wikipedia_Using_Collective_Attention |journal=IEEE Signal Processing Society}}

HTML

Thruesen, Philip; Cechak, Jaroslav; Seznec, Blandine; Castalio, Roel; Kanhabua, Nattiya. (2016). &quot;<a href="https://wikipediaquality.com/wiki/To_Link_or_Not_to_Link:_Ranking_Hyperlinks_in_Wikipedia_Using_Collective_Attention">To Link or Not to Link: Ranking Hyperlinks in Wikipedia Using Collective Attention</a>&quot;. IEEE Signal Processing Society. DOI: 10.1109/BigData.2016.7840785.