A Cross-Lingual Dictionary for English Wikipedia Concepts

From Wikipedia Quality
Jump to: navigation, search


A Cross-Lingual Dictionary for English Wikipedia Concepts
Authors
Valentin I. Spitkovsky
Angel X. Chang
Publication date
2012
Links
Original

A Cross-Lingual Dictionary for English Wikipedia Concepts - scientific work related to Wikipedia quality published in 2012, written by Valentin I. Spitkovsky and Angel X. Chang.

Overview

Authors present a resource for automatically associating strings of text with English Wikipedia concepts. Authors machinery is bi-directional, in the sense that it uses the same fundamental probabilistic methods to map strings to empirical distributions over Wikipedia articles as it does to map article URLs to distributions over short, language-independent strings of natural language text. For maximal interoperability, authors release resource as a set of flat line-based text files, lexicographically sorted and encoded with UTF-8. These files capture joint probability distributions underlying concepts (we use the terms article, concept and Wikipedia URL interchangeably) and associated snippets of text, as well as other features that can come in handy when working with Wikipedia articles and related information.

Embed

Wikipedia Quality

Spitkovsky, Valentin I.; Chang, Angel X.. (2012). "[[A Cross-Lingual Dictionary for English Wikipedia Concepts]]".

English Wikipedia

{{cite journal |last1=Spitkovsky |first1=Valentin I. |last2=Chang |first2=Angel X. |title=A Cross-Lingual Dictionary for English Wikipedia Concepts |date=2012 |url=https://wikipediaquality.com/wiki/A_Cross-Lingual_Dictionary_for_English_Wikipedia_Concepts}}

HTML

Spitkovsky, Valentin I.; Chang, Angel X.. (2012). &quot;<a href="https://wikipediaquality.com/wiki/A_Cross-Lingual_Dictionary_for_English_Wikipedia_Concepts">A Cross-Lingual Dictionary for English Wikipedia Concepts</a>&quot;.