Difference between revisions of "Finding Missing Cross-Language Links in Wikipedia"
(wikilinks) |
(Cats.) |
||
(2 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
+ | {{Infobox work | ||
+ | | title = Finding Missing Cross-Language Links in Wikipedia | ||
+ | | date = 2013 | ||
+ | | authors = [[Carlos Eduardo M. Moreira]]<br />[[Viviane Pereira Moreira]] | ||
+ | | link = https://seer.ufmg.br/index.php/jidm/article/download/245/198 | ||
+ | }} | ||
'''Finding Missing Cross-Language Links in Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2013, written by [[Carlos Eduardo M. Moreira]] and [[Viviane Pereira Moreira]]. | '''Finding Missing Cross-Language Links in Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2013, written by [[Carlos Eduardo M. Moreira]] and [[Viviane Pereira Moreira]]. | ||
== Overview == | == Overview == | ||
Wikipedia is a public encyclopedia composed of millions of articles written daily by volunteer authors from different regions of the world. The articles contain links called cross-language links which relate corresponding articles across [[different language]]s. This feature is extremely useful for applications that work with automatic translation and [[multilingual]] [[information retrieval]] as it allows the assembly of comparable corpora. Thus, it is important to have a mechanism that automatically creates such links. This has been motivating the development of techniques to identify missing cross-language links. In this article, authors present CLLFinder, an approach for finding missing cross-language links. The approach makes use of the links between [[categories]] and of the transitivity between existing cross-language links, as well as textual [[features]] extracted from the articles. Experiments using one million articles from the English and Portuguese [[Wikipedia]]s attest the viability of CLLFinder. The results show that approach has a recall of 96% and a precision of 98%, outperforming the baseline system, even though authors employ simpler and fewer features. | Wikipedia is a public encyclopedia composed of millions of articles written daily by volunteer authors from different regions of the world. The articles contain links called cross-language links which relate corresponding articles across [[different language]]s. This feature is extremely useful for applications that work with automatic translation and [[multilingual]] [[information retrieval]] as it allows the assembly of comparable corpora. Thus, it is important to have a mechanism that automatically creates such links. This has been motivating the development of techniques to identify missing cross-language links. In this article, authors present CLLFinder, an approach for finding missing cross-language links. The approach makes use of the links between [[categories]] and of the transitivity between existing cross-language links, as well as textual [[features]] extracted from the articles. Experiments using one million articles from the English and Portuguese [[Wikipedia]]s attest the viability of CLLFinder. The results show that approach has a recall of 96% and a precision of 98%, outperforming the baseline system, even though authors employ simpler and fewer features. | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Moreira, Carlos Eduardo M.; Moreira, Viviane Pereira. (2013). "[[Finding Missing Cross-Language Links in Wikipedia]]". | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Moreira |first1=Carlos Eduardo M. |last2=Moreira |first2=Viviane Pereira |title=Finding Missing Cross-Language Links in Wikipedia |date=2013 |url=https://wikipediaquality.com/wiki/Finding_Missing_Cross-Language_Links_in_Wikipedia}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Moreira, Carlos Eduardo M.; Moreira, Viviane Pereira. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Finding_Missing_Cross-Language_Links_in_Wikipedia">Finding Missing Cross-Language Links in Wikipedia</a>&quot;. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | |||
+ | |||
+ | [[Category:Scientific works]] | ||
+ | [[Category:English Wikipedia]] | ||
+ | [[Category:Portuguese Wikipedia]] |
Latest revision as of 15:34, 7 December 2019
Authors | Carlos Eduardo M. Moreira Viviane Pereira Moreira |
---|---|
Publication date | 2013 |
Links | Original |
Finding Missing Cross-Language Links in Wikipedia - scientific work related to Wikipedia quality published in 2013, written by Carlos Eduardo M. Moreira and Viviane Pereira Moreira.
Overview
Wikipedia is a public encyclopedia composed of millions of articles written daily by volunteer authors from different regions of the world. The articles contain links called cross-language links which relate corresponding articles across different languages. This feature is extremely useful for applications that work with automatic translation and multilingual information retrieval as it allows the assembly of comparable corpora. Thus, it is important to have a mechanism that automatically creates such links. This has been motivating the development of techniques to identify missing cross-language links. In this article, authors present CLLFinder, an approach for finding missing cross-language links. The approach makes use of the links between categories and of the transitivity between existing cross-language links, as well as textual features extracted from the articles. Experiments using one million articles from the English and Portuguese Wikipedias attest the viability of CLLFinder. The results show that approach has a recall of 96% and a precision of 98%, outperforming the baseline system, even though authors employ simpler and fewer features.
Embed
Wikipedia Quality
Moreira, Carlos Eduardo M.; Moreira, Viviane Pereira. (2013). "[[Finding Missing Cross-Language Links in Wikipedia]]".
English Wikipedia
{{cite journal |last1=Moreira |first1=Carlos Eduardo M. |last2=Moreira |first2=Viviane Pereira |title=Finding Missing Cross-Language Links in Wikipedia |date=2013 |url=https://wikipediaquality.com/wiki/Finding_Missing_Cross-Language_Links_in_Wikipedia}}
HTML
Moreira, Carlos Eduardo M.; Moreira, Viviane Pereira. (2013). "<a href="https://wikipediaquality.com/wiki/Finding_Missing_Cross-Language_Links_in_Wikipedia">Finding Missing Cross-Language Links in Wikipedia</a>".