Difference between revisions of "Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets"

From Wikipedia Quality
Jump to: navigation, search
(Infobox)
(+ embed code)
Line 9: Line 9:
 
== Overview ==
 
== Overview ==
 
As blog services and blog tools are becoming more and more popular, people have been able to express one’s own interests as well as opinions on the Web. Search engines are then used for accessing various information that can be found in the blogosphere, where, given a search query, a ranked list of blog posts is provided as a search result. However, such a search result in the form of a ranked list is not usually helpful for a user to quickly identify blog posts that satisfy his/her information need. This is especially true when, given a search query, the search result is a mixture of blog posts that focus on various sub-topics. In such a situation, the framework of faceted search [8], which has been well studied in the [[information retrieval]] community, can be a solution. In this paper, authors propose a framework of categorizing Korean blog posts according to their sub-topics, where, given a search query, those blog posts are collected from the Korean blogosphere. In framework, the sub-topic of each blog post is regarded as a facet of an initial topic keyword, and a facet is automatically assigned to each blog post. For example, Figure 1 illustrates a result of faceted search for an initial topic keyword “global warming” within the Korean blogosphere. In this result, a number of collected blog posts regarding “global warming” are categorized into facets by identifying each blogger’s interest in a blog post. This procedure of assigning a facet to a blog post is realized by utilizing [[Wikipedia]] entries as a knowledge source and each Wikipedia entry title is considered as a facet label. In the evaluation, authors can achieve about 50∼70 % accuracy.
 
As blog services and blog tools are becoming more and more popular, people have been able to express one’s own interests as well as opinions on the Web. Search engines are then used for accessing various information that can be found in the blogosphere, where, given a search query, a ranked list of blog posts is provided as a search result. However, such a search result in the form of a ranked list is not usually helpful for a user to quickly identify blog posts that satisfy his/her information need. This is especially true when, given a search query, the search result is a mixture of blog posts that focus on various sub-topics. In such a situation, the framework of faceted search [8], which has been well studied in the [[information retrieval]] community, can be a solution. In this paper, authors propose a framework of categorizing Korean blog posts according to their sub-topics, where, given a search query, those blog posts are collected from the Korean blogosphere. In framework, the sub-topic of each blog post is regarded as a facet of an initial topic keyword, and a facet is automatically assigned to each blog post. For example, Figure 1 illustrates a result of faceted search for an initial topic keyword “global warming” within the Korean blogosphere. In this result, a number of collected blog posts regarding “global warming” are categorized into facets by identifying each blogger’s interest in a blog post. This procedure of assigning a facet to a blog post is realized by utilizing [[Wikipedia]] entries as a knowledge source and each Wikipedia entry title is considered as a facet label. In the evaluation, authors can achieve about 50∼70 % accuracy.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). "[[Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets]]".
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Lim |first1=Dongkwon |last2=Yokomoto |first2=Daisuke |last3=Makita |first3=Kensaku |last4=Utsuro |first4=Takehito |last5=Fukuhara |first5=Tomohiro |title=Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets |date=2011 |url=https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). &amp;quot;<a href="https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets">Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets</a>&amp;quot;.
 +
</nowiki>
 +
</code>

Revision as of 08:38, 6 June 2020


Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets
Authors
Dongkwon Lim
Daisuke Yokomoto
Kensaku Makita
Takehito Utsuro
Tomohiro Fukuhara
Publication date
2011
Links
Original

Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets - scientific work related to Wikipedia quality published in 2011, written by Dongkwon Lim, Daisuke Yokomoto, Kensaku Makita, Takehito Utsuro and Tomohiro Fukuhara.

Overview

As blog services and blog tools are becoming more and more popular, people have been able to express one’s own interests as well as opinions on the Web. Search engines are then used for accessing various information that can be found in the blogosphere, where, given a search query, a ranked list of blog posts is provided as a search result. However, such a search result in the form of a ranked list is not usually helpful for a user to quickly identify blog posts that satisfy his/her information need. This is especially true when, given a search query, the search result is a mixture of blog posts that focus on various sub-topics. In such a situation, the framework of faceted search [8], which has been well studied in the information retrieval community, can be a solution. In this paper, authors propose a framework of categorizing Korean blog posts according to their sub-topics, where, given a search query, those blog posts are collected from the Korean blogosphere. In framework, the sub-topic of each blog post is regarded as a facet of an initial topic keyword, and a facet is automatically assigned to each blog post. For example, Figure 1 illustrates a result of faceted search for an initial topic keyword “global warming” within the Korean blogosphere. In this result, a number of collected blog posts regarding “global warming” are categorized into facets by identifying each blogger’s interest in a blog post. This procedure of assigning a facet to a blog post is realized by utilizing Wikipedia entries as a knowledge source and each Wikipedia entry title is considered as a facet label. In the evaluation, authors can achieve about 50∼70 % accuracy.

Embed

Wikipedia Quality

Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). "[[Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets]]".

English Wikipedia

{{cite journal |last1=Lim |first1=Dongkwon |last2=Yokomoto |first2=Daisuke |last3=Makita |first3=Kensaku |last4=Utsuro |first4=Takehito |last5=Fukuhara |first5=Tomohiro |title=Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets |date=2011 |url=https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets}}

HTML

Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). &quot;<a href="https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets">Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets</a>&quot;.