Difference between revisions of "Unsupervised Construction of a Word List on Tourism from Wikipedia"
(infobox) |
(Embed) |
||
Line 10: | Line 10: | ||
== Overview == | == Overview == | ||
The demand for word lists in a specialized domain is increasing in language learning. Authors propose an unsupervised framework to extract a word list from [[Wikipedia]] data for a language learning class specialized on tourism. Authors extract topics in Wikipedia articles using non-negative matrix factorization. Each topic is classified as tourism related or not using articles in WikiVoyage. Authors choose paragraphs in Wikipedia that are classified as in-domain and rank words in such paragraphs by their frequencies. The proposed framework retrieves more than 90% of words in the gold list, but the extracted list still includes a large number of general terms. | The demand for word lists in a specialized domain is increasing in language learning. Authors propose an unsupervised framework to extract a word list from [[Wikipedia]] data for a language learning class specialized on tourism. Authors extract topics in Wikipedia articles using non-negative matrix factorization. Each topic is classified as tourism related or not using articles in WikiVoyage. Authors choose paragraphs in Wikipedia that are classified as in-domain and rank words in such paragraphs by their frequencies. The proposed framework retrieves more than 90% of words in the gold list, but the extracted list still includes a large number of general terms. | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). "[[Unsupervised Construction of a Word List on Tourism from Wikipedia]]".DOI: 10.1109/ICSEC.2015.7401412. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Wanvarie |first1=Dittaya |last2=Ek-atchariya |first2=Sansanee |last3=Kaewwipat |first3=Thanakon |title=Unsupervised Construction of a Word List on Tourism from Wikipedia |date=2015 |doi=10.1109/ICSEC.2015.7401412 |url=https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). &quot;<a href="https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia">Unsupervised Construction of a Word List on Tourism from Wikipedia</a>&quot;.DOI: 10.1109/ICSEC.2015.7401412. | ||
+ | </nowiki> | ||
+ | </code> |
Revision as of 09:47, 2 July 2020
Authors | Dittaya Wanvarie Sansanee Ek-atchariya Thanakon Kaewwipat |
---|---|
Publication date | 2015 |
DOI | 10.1109/ICSEC.2015.7401412 |
Links | Original |
Unsupervised Construction of a Word List on Tourism from Wikipedia - scientific work related to Wikipedia quality published in 2015, written by Dittaya Wanvarie, Sansanee Ek-atchariya and Thanakon Kaewwipat.
Overview
The demand for word lists in a specialized domain is increasing in language learning. Authors propose an unsupervised framework to extract a word list from Wikipedia data for a language learning class specialized on tourism. Authors extract topics in Wikipedia articles using non-negative matrix factorization. Each topic is classified as tourism related or not using articles in WikiVoyage. Authors choose paragraphs in Wikipedia that are classified as in-domain and rank words in such paragraphs by their frequencies. The proposed framework retrieves more than 90% of words in the gold list, but the extracted list still includes a large number of general terms.
Embed
Wikipedia Quality
Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). "[[Unsupervised Construction of a Word List on Tourism from Wikipedia]]".DOI: 10.1109/ICSEC.2015.7401412.
English Wikipedia
{{cite journal |last1=Wanvarie |first1=Dittaya |last2=Ek-atchariya |first2=Sansanee |last3=Kaewwipat |first3=Thanakon |title=Unsupervised Construction of a Word List on Tourism from Wikipedia |date=2015 |doi=10.1109/ICSEC.2015.7401412 |url=https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia}}
HTML
Wanvarie, Dittaya; Ek-atchariya, Sansanee; Kaewwipat, Thanakon. (2015). "<a href="https://wikipediaquality.com/wiki/Unsupervised_Construction_of_a_Word_List_on_Tourism_from_Wikipedia">Unsupervised Construction of a Word List on Tourism from Wikipedia</a>".DOI: 10.1109/ICSEC.2015.7401412.