Difference between revisions of "Unsupervised Construction of a Word List on Tourism from Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(infobox)
(Embed)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
The demand for word lists in a specialized domain is increasing in language learning. Authors propose an unsupervised framework to extract a word list from [[Wikipedia]] data for a language learning class specialized on tourism. Authors extract topics in Wikipedia articles using non-negative matrix factorization. Each topic is classified as tourism related or not using articles in WikiVoyage. Authors choose paragraphs in Wikipedia that are classified as in-domain and rank words in such paragraphs by their frequencies. The proposed framework retrieves more than 90% of words in the gold list, but the extracted list still includes a large number of general terms.
 
The demand for word lists in a specialized domain is increasing in language learning. Authors propose an unsupervised framework to extract a word list from [[Wikipedia]] data for a language learning class specialized on tourism. Authors extract topics in Wikipedia articles using non-negative matrix factorization. Each topic is classified as tourism related or not using articles in WikiVoyage. Authors choose paragraphs in Wikipedia that are classified as in-domain and rank words in such paragraphs by their frequencies. The proposed framework retrieves more than 90% of words in the gold list, but the extracted list still includes a large number of general terms.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). "[[Unsupervised Construction of a Word List on Tourism from Wikipedia]]".DOI: 10.1109/ICSEC.2015.7401412.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Wanvarie |first1=Dittaya |last2=Ek-atchariya |first2=Sansanee |last3=Kaewwipat |first3=Thanakon |title=Unsupervised Construction of a Word List on Tourism from Wikipedia |date=2015 |doi=10.1109/ICSEC.2015.7401412 |url=https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). &amp;quot;<a href="https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia">Unsupervised Construction of a Word List on Tourism from Wikipedia</a>&amp;quot;.DOI: 10.1109/ICSEC.2015.7401412.
 +
</nowiki>
 +
</code>

Revision as of 09:47, 2 July 2020


Unsupervised Construction of a Word List on Tourism from Wikipedia
Authors
Dittaya Wanvarie
Sansanee Ek-atchariya
Thanakon Kaewwipat
Publication date
2015
DOI
10.1109/ICSEC.2015.7401412
Links
Original

Unsupervised Construction of a Word List on Tourism from Wikipedia - scientific work related to Wikipedia quality published in 2015, written by Dittaya Wanvarie, Sansanee Ek-atchariya and Thanakon Kaewwipat.

Overview

The demand for word lists in a specialized domain is increasing in language learning. Authors propose an unsupervised framework to extract a word list from Wikipedia data for a language learning class specialized on tourism. Authors extract topics in Wikipedia articles using non-negative matrix factorization. Each topic is classified as tourism related or not using articles in WikiVoyage. Authors choose paragraphs in Wikipedia that are classified as in-domain and rank words in such paragraphs by their frequencies. The proposed framework retrieves more than 90% of words in the gold list, but the extracted list still includes a large number of general terms.

Embed

Wikipedia Quality

Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). "[[Unsupervised Construction of a Word List on Tourism from Wikipedia]]".DOI: 10.1109/ICSEC.2015.7401412.

English Wikipedia

{{cite journal |last1=Wanvarie |first1=Dittaya |last2=Ek-atchariya |first2=Sansanee |last3=Kaewwipat |first3=Thanakon |title=Unsupervised Construction of a Word List on Tourism from Wikipedia |date=2015 |doi=10.1109/ICSEC.2015.7401412 |url=https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia}}

HTML

Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). &quot;<a href="https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia">Unsupervised Construction of a Word List on Tourism from Wikipedia</a>&quot;.DOI: 10.1109/ICSEC.2015.7401412.