Difference between revisions of "Extraction of Bilingual Cognates from Wikipedia"
(Overview - Extraction of Bilingual Cognates from Wikipedia) |
(Wikilinks) |
||
Line 1: | Line 1: | ||
− | '''Extraction of Bilingual Cognates from Wikipedia''' - scientific work related to Wikipedia quality published in 2012, written by Pablo Gamallo and Marcos Garcia. | + | '''Extraction of Bilingual Cognates from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2012, written by [[Pablo Gamallo]] and [[Marcos Garcia]]. |
== Overview == | == Overview == | ||
− | In this article, authors propose a method to extract translation equivalents with similar spelling from comparable corpora. The method was applied on Wikipedia to extract a large amount of Portuguese-Spanish bilingual terminological pairs that were not found in existing dictionaries. The resulting bilingual lexicons consists of more than 27,000 new pairs of lemmas and multiwords, with about 92% accuracy. | + | In this article, authors propose a method to extract translation equivalents with similar spelling from comparable corpora. The method was applied on [[Wikipedia]] to extract a large amount of Portuguese-Spanish bilingual terminological pairs that were not found in existing dictionaries. The resulting bilingual lexicons consists of more than 27,000 new pairs of lemmas and multiwords, with about 92% accuracy. |
Revision as of 22:16, 2 July 2019
Extraction of Bilingual Cognates from Wikipedia - scientific work related to Wikipedia quality published in 2012, written by Pablo Gamallo and Marcos Garcia.
Overview
In this article, authors propose a method to extract translation equivalents with similar spelling from comparable corpora. The method was applied on Wikipedia to extract a large amount of Portuguese-Spanish bilingual terminological pairs that were not found in existing dictionaries. The resulting bilingual lexicons consists of more than 27,000 new pairs of lemmas and multiwords, with about 92% accuracy.