Difference between revisions of "Wikipedia Information Flow Analysis Reveals the Scale-Free Architecture of the Semantic Space"

From Wikipedia Quality
Jump to: navigation, search
(Links)
(Infobox)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Wikipedia Information Flow Analysis Reveals the Scale-Free Architecture of the Semantic Space
 +
| date = 2011
 +
| authors = [[Adolfo Paolo Masucci]]<br />[[Alkiviadis Kalampokis]]<br />[[Víctor M. Eguíluz]]<br />[[Emilio Hernández-García]]
 +
| doi = 10.1371/journal.pone.0017333
 +
| link = http://www.sciencedirect.com/science/article/pii/S0263237311000053
 +
| plink = http://arxiv.org/pdf/1102.0651.pdf
 +
}}
 
'''Wikipedia Information Flow Analysis Reveals the Scale-Free Architecture of the Semantic Space''' - scientific work related to [[Wikipedia quality]] published in 2011, written by [[Adolfo Paolo Masucci]], [[Alkiviadis Kalampokis]], [[Víctor M. Eguíluz]] and [[Emilio Hernández-García]].
 
'''Wikipedia Information Flow Analysis Reveals the Scale-Free Architecture of the Semantic Space''' - scientific work related to [[Wikipedia quality]] published in 2011, written by [[Adolfo Paolo Masucci]], [[Alkiviadis Kalampokis]], [[Víctor M. Eguíluz]] and [[Emilio Hernández-García]].
  
 
== Overview ==
 
== Overview ==
 
In this paper authors extract the topology of the semantic space in its encyclopedic acception, measuring the semantic flow between the different entries of the largest modern encyclopedia, [[Wikipedia]], and thus creating a directed complex network of semantic flows. Notably at the percolation threshold the semantic space is characterised by scale-free behaviour at different levels of complexity and this relates the semantic space to a wide range of biological, social and linguistics phenomena. In particular authors find that the cluster size distribution, representing the size of different semantic areas, is scale-free. Moreover the topology of the resulting semantic space is scale-free in the connectivity distribution and displays small-world properties. However its statistical properties do not allow a classical interpretation via a generative model based on a simple multiplicative process. After giving a detailed description and interpretation of the topological properties of the semantic space, authors introduce a stochastic model of content-based network, based on a copy and mutation algorithm and on the Heaps' law, that is able to capture the main statistical properties of the analysed semantic space, including the Zipf's law for the word frequency distribution.
 
In this paper authors extract the topology of the semantic space in its encyclopedic acception, measuring the semantic flow between the different entries of the largest modern encyclopedia, [[Wikipedia]], and thus creating a directed complex network of semantic flows. Notably at the percolation threshold the semantic space is characterised by scale-free behaviour at different levels of complexity and this relates the semantic space to a wide range of biological, social and linguistics phenomena. In particular authors find that the cluster size distribution, representing the size of different semantic areas, is scale-free. Moreover the topology of the resulting semantic space is scale-free in the connectivity distribution and displays small-world properties. However its statistical properties do not allow a classical interpretation via a generative model based on a simple multiplicative process. After giving a detailed description and interpretation of the topological properties of the semantic space, authors introduce a stochastic model of content-based network, based on a copy and mutation algorithm and on the Heaps' law, that is able to capture the main statistical properties of the analysed semantic space, including the Zipf's law for the word frequency distribution.

Revision as of 09:47, 14 November 2019


Wikipedia Information Flow Analysis Reveals the Scale-Free Architecture of the Semantic Space
Authors
Adolfo Paolo Masucci
Alkiviadis Kalampokis
Víctor M. Eguíluz
Emilio Hernández-García
Publication date
2011
DOI
10.1371/journal.pone.0017333
Links
Original Preprint

Wikipedia Information Flow Analysis Reveals the Scale-Free Architecture of the Semantic Space - scientific work related to Wikipedia quality published in 2011, written by Adolfo Paolo Masucci, Alkiviadis Kalampokis, Víctor M. Eguíluz and Emilio Hernández-García.

Overview

In this paper authors extract the topology of the semantic space in its encyclopedic acception, measuring the semantic flow between the different entries of the largest modern encyclopedia, Wikipedia, and thus creating a directed complex network of semantic flows. Notably at the percolation threshold the semantic space is characterised by scale-free behaviour at different levels of complexity and this relates the semantic space to a wide range of biological, social and linguistics phenomena. In particular authors find that the cluster size distribution, representing the size of different semantic areas, is scale-free. Moreover the topology of the resulting semantic space is scale-free in the connectivity distribution and displays small-world properties. However its statistical properties do not allow a classical interpretation via a generative model based on a simple multiplicative process. After giving a detailed description and interpretation of the topological properties of the semantic space, authors introduce a stochastic model of content-based network, based on a copy and mutation algorithm and on the Heaps' law, that is able to capture the main statistical properties of the analysed semantic space, including the Zipf's law for the word frequency distribution.