Linguistic Influence Patterns Within the Global Network of Wikipedia Language Editions

From Wikipedia Quality
Revision as of 20:12, 6 June 2019 by Zoe (talk | contribs) (Starting a page: Linguistic Influence Patterns Within the Global Network of Wikipedia Language Editions)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Linguistic Influence Patterns Within the Global Network of Wikipedia Language Editions - scientific work related to Wikipedia quality published in 2015, written by Anna Samoilenko, Fariba Karimi, Jérôme Kunegis, Daniel Edler and Markus Strohmaier.

Overview

The Internet is highly multilingual, and its content is created, shared, debated and shaped within many different language-speaking communities. These communities do not exist in isolation, but communicate and influence each other's interests, just as in the offline world. Quantifying this influence is however a non-trivial task, as these communities are usually spread across multiple heterogeneous platforms. In this work, authors set out to measure the influence of languages on each other by observing concept overlap between the 110 largest Wikipedia language editions. Authors describe experiments to test if language overlap in concept coverage is a random process, and find that edition size is a strong predictor of higher concept overlap, with English--German being the most frequently co-occurring pair (45%). Both small and large editions co-occur more frequently than expected with editions of similar size, but co-occurrences across groups are below what is expected by chance. Additionally, by applying network analysis, authors find that the hierarchy of language interconnections differs depending on the locality of topics: for interlingually popular topics, the dominance of English, German and French is pronounced, while for topics with a local reach, geographical and cultural proximity as well as common heritage are better explanators of co-occurrence.