Difference between revisions of "Improving the Extraction of Bilingual Terminology from Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(+ Infobox work)
(Embed)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Research on the automatic construction of bilingual dictionaries has achieved impressive results. Bilingual dictionaries are usually constructed from parallel corpora, but since these corpora are available only for selected text domains and language pairs, the potential of other resources is being explored as well. In this article, authors want to further pursue the idea of using [[Wikipedia]] as a corpus for bilingual terminology extraction. Authors propose a method that extracts term-translation pairs from different types of Wikipedia link information. After that, an SVM classifier trained on the [[features]] of manually labeled training data determines the correctness of unseen term-translation pairs.
 
Research on the automatic construction of bilingual dictionaries has achieved impressive results. Bilingual dictionaries are usually constructed from parallel corpora, but since these corpora are available only for selected text domains and language pairs, the potential of other resources is being explored as well. In this article, authors want to further pursue the idea of using [[Wikipedia]] as a corpus for bilingual terminology extraction. Authors propose a method that extracts term-translation pairs from different types of Wikipedia link information. After that, an SVM classifier trained on the [[features]] of manually labeled training data determines the correctness of unseen term-translation pairs.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Erdmann, Maike; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2009). "[[Improving the Extraction of Bilingual Terminology from Wikipedia]]".DOI: 10.1145/1596990.1596995.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Erdmann |first1=Maike |last2=Nakayama |first2=Kotaro |last3=Hara |first3=Takahiro |last4=Nishio |first4=Shojiro |title=Improving the Extraction of Bilingual Terminology from Wikipedia |date=2009 |doi=10.1145/1596990.1596995 |url=https://wikipediaquality.com/wiki/Improving_the_Extraction_of_Bilingual_Terminology_from_Wikipedia}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Erdmann, Maike; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2009). &amp;quot;<a href="https://wikipediaquality.com/wiki/Improving_the_Extraction_of_Bilingual_Terminology_from_Wikipedia">Improving the Extraction of Bilingual Terminology from Wikipedia</a>&amp;quot;.DOI: 10.1145/1596990.1596995.
 +
</nowiki>
 +
</code>

Revision as of 09:27, 20 November 2019


Improving the Extraction of Bilingual Terminology from Wikipedia
Authors
Maike Erdmann
Kotaro Nakayama
Takahiro Hara
Shojiro Nishio
Publication date
2009
DOI
10.1145/1596990.1596995
Links
Original

Improving the Extraction of Bilingual Terminology from Wikipedia - scientific work related to Wikipedia quality published in 2009, written by Maike Erdmann, Kotaro Nakayama, Takahiro Hara and Shojiro Nishio.

Overview

Research on the automatic construction of bilingual dictionaries has achieved impressive results. Bilingual dictionaries are usually constructed from parallel corpora, but since these corpora are available only for selected text domains and language pairs, the potential of other resources is being explored as well. In this article, authors want to further pursue the idea of using Wikipedia as a corpus for bilingual terminology extraction. Authors propose a method that extracts term-translation pairs from different types of Wikipedia link information. After that, an SVM classifier trained on the features of manually labeled training data determines the correctness of unseen term-translation pairs.

Embed

Wikipedia Quality

Erdmann, Maike; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2009). "[[Improving the Extraction of Bilingual Terminology from Wikipedia]]".DOI: 10.1145/1596990.1596995.

English Wikipedia

{{cite journal |last1=Erdmann |first1=Maike |last2=Nakayama |first2=Kotaro |last3=Hara |first3=Takahiro |last4=Nishio |first4=Shojiro |title=Improving the Extraction of Bilingual Terminology from Wikipedia |date=2009 |doi=10.1145/1596990.1596995 |url=https://wikipediaquality.com/wiki/Improving_the_Extraction_of_Bilingual_Terminology_from_Wikipedia}}

HTML

Erdmann, Maike; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Improving_the_Extraction_of_Bilingual_Terminology_from_Wikipedia">Improving the Extraction of Bilingual Terminology from Wikipedia</a>&quot;.DOI: 10.1145/1596990.1596995.