Difference between revisions of "Improving Distributed Representation by Feature Selection of Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Adding infobox)
(Adding embed)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Distributed representation plays an important role in many application of [[Natural Language Processing]] (NLP). Today, Word2Vec model has been getting an attention against the backdrop of the easy access to enormous language data from the Internet such as [[Wikipedia]]. For the effective use of Word2Vec, authors have to concern not only about the improvement of the method itself but also about the process of making training data. In this paper, authors demonstrate that adequate selection of training data can make a great improvement of the performance of Word2Vec compared to existing research. Authors also confirmed that Wikipedia dump data is not a good source of training data as is.
 
Distributed representation plays an important role in many application of [[Natural Language Processing]] (NLP). Today, Word2Vec model has been getting an attention against the backdrop of the easy access to enormous language data from the Internet such as [[Wikipedia]]. For the effective use of Word2Vec, authors have to concern not only about the improvement of the method itself but also about the process of making training data. In this paper, authors demonstrate that adequate selection of training data can make a great improvement of the performance of Word2Vec compared to existing research. Authors also confirmed that Wikipedia dump data is not a good source of training data as is.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Tuan, Dao Van; Sato, Hiroshi. (2017). "[[Improving Distributed Representation by Feature Selection of Wikipedia]]".DOI: 10.1109/acdtj.2017.8259588.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Tuan |first1=Dao Van |last2=Sato |first2=Hiroshi |title=Improving Distributed Representation by Feature Selection of Wikipedia |date=2017 |doi=10.1109/acdtj.2017.8259588 |url=https://wikipediaquality.com/wiki/Improving_Distributed_Representation_by_Feature_Selection_of_Wikipedia}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Tuan, Dao Van; Sato, Hiroshi. (2017). &amp;quot;<a href="https://wikipediaquality.com/wiki/Improving_Distributed_Representation_by_Feature_Selection_of_Wikipedia">Improving Distributed Representation by Feature Selection of Wikipedia</a>&amp;quot;.DOI: 10.1109/acdtj.2017.8259588.
 +
</nowiki>
 +
</code>

Revision as of 07:54, 3 October 2020


Improving Distributed Representation by Feature Selection of Wikipedia
Authors
Dao Van Tuan
Hiroshi Sato
Publication date
2017
DOI
10.1109/acdtj.2017.8259588
Links
Original

Improving Distributed Representation by Feature Selection of Wikipedia - scientific work related to Wikipedia quality published in 2017, written by Dao Van Tuan and Hiroshi Sato.

Overview

Distributed representation plays an important role in many application of Natural Language Processing (NLP). Today, Word2Vec model has been getting an attention against the backdrop of the easy access to enormous language data from the Internet such as Wikipedia. For the effective use of Word2Vec, authors have to concern not only about the improvement of the method itself but also about the process of making training data. In this paper, authors demonstrate that adequate selection of training data can make a great improvement of the performance of Word2Vec compared to existing research. Authors also confirmed that Wikipedia dump data is not a good source of training data as is.

Embed

Wikipedia Quality

Tuan, Dao Van; Sato, Hiroshi. (2017). "[[Improving Distributed Representation by Feature Selection of Wikipedia]]".DOI: 10.1109/acdtj.2017.8259588.

English Wikipedia

{{cite journal |last1=Tuan |first1=Dao Van |last2=Sato |first2=Hiroshi |title=Improving Distributed Representation by Feature Selection of Wikipedia |date=2017 |doi=10.1109/acdtj.2017.8259588 |url=https://wikipediaquality.com/wiki/Improving_Distributed_Representation_by_Feature_Selection_of_Wikipedia}}

HTML

Tuan, Dao Van; Sato, Hiroshi. (2017). &quot;<a href="https://wikipediaquality.com/wiki/Improving_Distributed_Representation_by_Feature_Selection_of_Wikipedia">Improving Distributed Representation by Feature Selection of Wikipedia</a>&quot;.DOI: 10.1109/acdtj.2017.8259588.