Difference between revisions of "Disambiguation to Wikipedia: a Language and Domain Independent Approach"

From Wikipedia Quality
Jump to: navigation, search
(Infobox)
(Embed for English Wikipedia, HTML)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Disambiguation to [[Wikipedia]] (D2W) is the task of linking mentions of concepts in text to their corresponding Wikipedia articles. Traditional approaches to D2W has focused either in only one language (e.g. English) or in formal texts (e.g. news articles). In this paper, authors present a [[multilingual]] framework with a set of new [[features]] that can be obtained purely from the online encyclopedia, without the need of any natural language specific tool. Authors analyze these features with [[different language]]s and different domains. The approach shows as fully language-independent and has been applied successfully to English, Italian, Polish, with a consistent improvement. Authors show that only a sufficient number of Wikipedia articles is needed for training. When trained on real-world data sets for English, new features yield substantial improvement compared to current local and global disambiguation algorithms. Finally, the adaption to the Bridgeman query logs in digital libraries shows the robustness of approach even in the lack of disambiguation context. Also, as no natural language specific tool is needed, the method can be applied to other languages in a similar manner with little adaptation.
 
Disambiguation to [[Wikipedia]] (D2W) is the task of linking mentions of concepts in text to their corresponding Wikipedia articles. Traditional approaches to D2W has focused either in only one language (e.g. English) or in formal texts (e.g. news articles). In this paper, authors present a [[multilingual]] framework with a set of new [[features]] that can be obtained purely from the online encyclopedia, without the need of any natural language specific tool. Authors analyze these features with [[different language]]s and different domains. The approach shows as fully language-independent and has been applied successfully to English, Italian, Polish, with a consistent improvement. Authors show that only a sufficient number of Wikipedia articles is needed for training. When trained on real-world data sets for English, new features yield substantial improvement compared to current local and global disambiguation algorithms. Finally, the adaption to the Bridgeman query logs in digital libraries shows the robustness of approach even in the lack of disambiguation context. Also, as no natural language specific tool is needed, the method can be applied to other languages in a similar manner with little adaptation.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Nguyen, Truc-Vien T.. (2013). "[[Disambiguation to Wikipedia: a Language and Domain Independent Approach]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-45068-6_27.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Nguyen |first1=Truc-Vien T. |title=Disambiguation to Wikipedia: a Language and Domain Independent Approach |date=2013 |doi=10.1007/978-3-642-45068-6_27 |url=https://wikipediaquality.com/wiki/Disambiguation_to_Wikipedia:_a_Language_and_Domain_Independent_Approach |journal=Springer, Berlin, Heidelberg}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Nguyen, Truc-Vien T.. (2013). &amp;quot;<a href="https://wikipediaquality.com/wiki/Disambiguation_to_Wikipedia:_a_Language_and_Domain_Independent_Approach">Disambiguation to Wikipedia: a Language and Domain Independent Approach</a>&amp;quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-45068-6_27.
 +
</nowiki>
 +
</code>

Revision as of 06:39, 21 February 2021


Disambiguation to Wikipedia: a Language and Domain Independent Approach
Authors
Truc-Vien T. Nguyen
Publication date
2013
DOI
10.1007/978-3-642-45068-6_27
Links
Original

Disambiguation to Wikipedia: a Language and Domain Independent Approach - scientific work related to Wikipedia quality published in 2013, written by Truc-Vien T. Nguyen.

Overview

Disambiguation to Wikipedia (D2W) is the task of linking mentions of concepts in text to their corresponding Wikipedia articles. Traditional approaches to D2W has focused either in only one language (e.g. English) or in formal texts (e.g. news articles). In this paper, authors present a multilingual framework with a set of new features that can be obtained purely from the online encyclopedia, without the need of any natural language specific tool. Authors analyze these features with different languages and different domains. The approach shows as fully language-independent and has been applied successfully to English, Italian, Polish, with a consistent improvement. Authors show that only a sufficient number of Wikipedia articles is needed for training. When trained on real-world data sets for English, new features yield substantial improvement compared to current local and global disambiguation algorithms. Finally, the adaption to the Bridgeman query logs in digital libraries shows the robustness of approach even in the lack of disambiguation context. Also, as no natural language specific tool is needed, the method can be applied to other languages in a similar manner with little adaptation.

Embed

Wikipedia Quality

Nguyen, Truc-Vien T.. (2013). "[[Disambiguation to Wikipedia: a Language and Domain Independent Approach]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-45068-6_27.

English Wikipedia

{{cite journal |last1=Nguyen |first1=Truc-Vien T. |title=Disambiguation to Wikipedia: a Language and Domain Independent Approach |date=2013 |doi=10.1007/978-3-642-45068-6_27 |url=https://wikipediaquality.com/wiki/Disambiguation_to_Wikipedia:_a_Language_and_Domain_Independent_Approach |journal=Springer, Berlin, Heidelberg}}

HTML

Nguyen, Truc-Vien T.. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Disambiguation_to_Wikipedia:_a_Language_and_Domain_Independent_Approach">Disambiguation to Wikipedia: a Language and Domain Independent Approach</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-45068-6_27.