Analysis of Cluster Structure in Large-Scale English Wikipedia Category Networks

From Wikipedia Quality
Revision as of 10:35, 18 December 2019 by Isabelle (talk | contribs) (Embed)
Jump to: navigation, search


Analysis of Cluster Structure in Large-Scale English Wikipedia Category Networks
Authors
Thidawan Klaysri
Trevor I. Fenner
Oded Lachish
Mark Levene
Panagiotis Papapetrou
Publication date
2013
DOI
10.1007/978-3-642-41398-8_23
Links
Original

Analysis of Cluster Structure in Large-Scale English Wikipedia Category Networks - scientific work related to Wikipedia quality published in 2013, written by Thidawan Klaysri, Trevor I. Fenner, Oded Lachish, Mark Levene and Panagiotis Papapetrou.

Overview

In this paper authors propose a framework for analysing the structure of a large-scale social media network, a topic of significant recent interest. Authors study is focused on the Wikipedia category network, where nodes correspond to Wikipedia categories and edges connect two nodes if the nodes share at least one common page within the Wikipedia network. Moreover, each edge is given a weight that corresponds to the number of pages shared between the two categories that it connects. Authors study the structure of category clusters within the three complete English Wikipedia category networks from 2010 to 2012. Authors observe that category clusters appear in the form of well-connected components that are naturally clustered together. For each dataset authors obtain a graph, which authors call the t-filtered category graph, by retaining just a single edge linking each pair of categories for which the weight of the edge exceeds some specified threshold t. Authors framework exploits this graph structure and identifies connected components within the t-filtered category graph. Authors studied the large-scale structural properties of the three Wikipedia category networks using the proposed approach. Authors found that the number of categories, the number of clusters of size two, and the size of the largest cluster within the graph all appear to follow power laws in the threshold t. Furthermore, for each network authors found the value of the threshold t for which increasing the threshold to t+1 caused the "giant" largest cluster to diffuse into two or more smaller clusters of significant size and studied the semantics behind this diffusion.

Embed

Wikipedia Quality

Klaysri, Thidawan; Fenner, Trevor I.; Lachish, Oded; Levene, Mark; Papapetrou, Panagiotis. (2013). "[[Analysis of Cluster Structure in Large-Scale English Wikipedia Category Networks]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-41398-8_23.

English Wikipedia

{{cite journal |last1=Klaysri |first1=Thidawan |last2=Fenner |first2=Trevor I. |last3=Lachish |first3=Oded |last4=Levene |first4=Mark |last5=Papapetrou |first5=Panagiotis |title=Analysis of Cluster Structure in Large-Scale English Wikipedia Category Networks |date=2013 |doi=10.1007/978-3-642-41398-8_23 |url=https://wikipediaquality.com/wiki/Analysis_of_Cluster_Structure_in_Large-Scale_English_Wikipedia_Category_Networks |journal=Springer, Berlin, Heidelberg}}

HTML

Klaysri, Thidawan; Fenner, Trevor I.; Lachish, Oded; Levene, Mark; Papapetrou, Panagiotis. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Analysis_of_Cluster_Structure_in_Large-Scale_English_Wikipedia_Category_Networks">Analysis of Cluster Structure in Large-Scale English Wikipedia Category Networks</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-41398-8_23.