LDA-Based Topic Modeling in Labeling Blog Posts with Wikipedia Entries

From Wikipedia Quality
Revision as of 09:26, 17 May 2019 by Emilia (talk | contribs) (Created page with "{{Infobox work | title = LDA-Based Topic Modeling in Labeling Blog Posts with Wikipedia Entries | date = 2012 | authors = Daisuke Yokomoto<br />Kensaku Makita<br />H...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


LDA-Based Topic Modeling in Labeling Blog Posts with Wikipedia Entries
Authors
Daisuke Yokomoto
Kensaku Makita
Hiroko Suzuki
Daichi Koike
Takehito Utsuro
Yasuhide Kawada
Tomohiro Fukuhara
Publication date
2012
DOI
10.1007/978-3-642-29426-6_15
Links
Original

LDA-Based Topic Modeling in Labeling Blog Posts with Wikipedia Entries - scientific work related to Wikipedia quality published in 2012, written by Daisuke Yokomoto, Kensaku Makita, Hiroko Suzuki, Daichi Koike, Takehito Utsuro, Yasuhide Kawada and Tomohiro Fukuhara.

Overview

Given a search query, most existing search engines simply return a ranked list of search results. However, it is often the case that those search result documents consist of a mixture of documents that are closely related to various contents. In order to address the issue of quickly overviewing the distribution of contents, this paper proposes a framework of labeling blog posts with Wikipedia entries through LDA (latent Dirichlet allocation) based topic modeling. More specifically, this paper applies an LDA-based document model to the task of labelling blog posts with Wikipedia entries. One of the most important advantages of this LDA-based document model is that the collected Wikipedia entries and their LDA parameters heavily depend on the distribution of keywords across all the search result of blog posts. This tendency actually contributes to quickly overviewing the search result of blog posts through the LDA-based topic distribution. In the evaluation of the paper, authors also show that the LDA-based document retrieval scheme outperforms previous approach.

Embed

Wikipedia Quality

Daisuke, Yokomoto; Kensaku, Makita; Hiroko, Suzuki; Daichi, Koike; Takehito, Utsuro; Yasuhide, Kawada; Tomohiro, Fukuhara. (2012). "[[LDA-Based Topic Modeling in Labeling Blog Posts with Wikipedia Entries]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-29426-6_15.

English Wikipedia

{{cite journal |last1=Daisuke |first1=Yokomoto |last2=Kensaku |first2=Makita |last3=Hiroko |first3=Suzuki |last4=Daichi |first4=Koike |last5=Takehito |first5=Utsuro |last6=Yasuhide |first6=Kawada |last7=Tomohiro |first7=Fukuhara |title=LDA-Based Topic Modeling in Labeling Blog Posts with Wikipedia Entries |date=2012 |doi=10.1007/978-3-642-29426-6_15 |url=https://wikipediaquality.com/wiki/LDA-Based_topic_modeling_in_labeling_blog_posts_with_wikipedia_entries |journal=Springer, Berlin, Heidelberg}}

HTML

Daisuke, Yokomoto; Kensaku, Makita; Hiroko, Suzuki; Daichi, Koike; Takehito, Utsuro; Yasuhide, Kawada; Tomohiro, Fukuhara. (2012). &quot;<a href="https://wikipediaquality.com/wiki/LDA-Based_topic_modeling_in_labeling_blog_posts_with_wikipedia_entries">LDA-Based Topic Modeling in Labeling Blog Posts with Wikipedia Entries</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-29426-6_15.