Difference between revisions of "Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets"
(Infobox) |
(+ embed code) |
||
Line 9: | Line 9: | ||
== Overview == | == Overview == | ||
As blog services and blog tools are becoming more and more popular, people have been able to express one’s own interests as well as opinions on the Web. Search engines are then used for accessing various information that can be found in the blogosphere, where, given a search query, a ranked list of blog posts is provided as a search result. However, such a search result in the form of a ranked list is not usually helpful for a user to quickly identify blog posts that satisfy his/her information need. This is especially true when, given a search query, the search result is a mixture of blog posts that focus on various sub-topics. In such a situation, the framework of faceted search [8], which has been well studied in the [[information retrieval]] community, can be a solution. In this paper, authors propose a framework of categorizing Korean blog posts according to their sub-topics, where, given a search query, those blog posts are collected from the Korean blogosphere. In framework, the sub-topic of each blog post is regarded as a facet of an initial topic keyword, and a facet is automatically assigned to each blog post. For example, Figure 1 illustrates a result of faceted search for an initial topic keyword “global warming” within the Korean blogosphere. In this result, a number of collected blog posts regarding “global warming” are categorized into facets by identifying each blogger’s interest in a blog post. This procedure of assigning a facet to a blog post is realized by utilizing [[Wikipedia]] entries as a knowledge source and each Wikipedia entry title is considered as a facet label. In the evaluation, authors can achieve about 50∼70 % accuracy. | As blog services and blog tools are becoming more and more popular, people have been able to express one’s own interests as well as opinions on the Web. Search engines are then used for accessing various information that can be found in the blogosphere, where, given a search query, a ranked list of blog posts is provided as a search result. However, such a search result in the form of a ranked list is not usually helpful for a user to quickly identify blog posts that satisfy his/her information need. This is especially true when, given a search query, the search result is a mixture of blog posts that focus on various sub-topics. In such a situation, the framework of faceted search [8], which has been well studied in the [[information retrieval]] community, can be a solution. In this paper, authors propose a framework of categorizing Korean blog posts according to their sub-topics, where, given a search query, those blog posts are collected from the Korean blogosphere. In framework, the sub-topic of each blog post is regarded as a facet of an initial topic keyword, and a facet is automatically assigned to each blog post. For example, Figure 1 illustrates a result of faceted search for an initial topic keyword “global warming” within the Korean blogosphere. In this result, a number of collected blog posts regarding “global warming” are categorized into facets by identifying each blogger’s interest in a blog post. This procedure of assigning a facet to a blog post is realized by utilizing [[Wikipedia]] entries as a knowledge source and each Wikipedia entry title is considered as a facet label. In the evaluation, authors can achieve about 50∼70 % accuracy. | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). "[[Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets]]". | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Lim |first1=Dongkwon |last2=Yokomoto |first2=Daisuke |last3=Makita |first3=Kensaku |last4=Utsuro |first4=Takehito |last5=Fukuhara |first5=Tomohiro |title=Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets |date=2011 |url=https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). &quot;<a href="https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets">Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets</a>&quot;. | ||
+ | </nowiki> | ||
+ | </code> |
Revision as of 08:38, 6 June 2020
Authors | Dongkwon Lim Daisuke Yokomoto Kensaku Makita Takehito Utsuro Tomohiro Fukuhara |
---|---|
Publication date | 2011 |
Links | Original |
Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets - scientific work related to Wikipedia quality published in 2011, written by Dongkwon Lim, Daisuke Yokomoto, Kensaku Makita, Takehito Utsuro and Tomohiro Fukuhara.
Overview
As blog services and blog tools are becoming more and more popular, people have been able to express one’s own interests as well as opinions on the Web. Search engines are then used for accessing various information that can be found in the blogosphere, where, given a search query, a ranked list of blog posts is provided as a search result. However, such a search result in the form of a ranked list is not usually helpful for a user to quickly identify blog posts that satisfy his/her information need. This is especially true when, given a search query, the search result is a mixture of blog posts that focus on various sub-topics. In such a situation, the framework of faceted search [8], which has been well studied in the information retrieval community, can be a solution. In this paper, authors propose a framework of categorizing Korean blog posts according to their sub-topics, where, given a search query, those blog posts are collected from the Korean blogosphere. In framework, the sub-topic of each blog post is regarded as a facet of an initial topic keyword, and a facet is automatically assigned to each blog post. For example, Figure 1 illustrates a result of faceted search for an initial topic keyword “global warming” within the Korean blogosphere. In this result, a number of collected blog posts regarding “global warming” are categorized into facets by identifying each blogger’s interest in a blog post. This procedure of assigning a facet to a blog post is realized by utilizing Wikipedia entries as a knowledge source and each Wikipedia entry title is considered as a facet label. In the evaluation, authors can achieve about 50∼70 % accuracy.
Embed
Wikipedia Quality
Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). "[[Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets]]".
English Wikipedia
{{cite journal |last1=Lim |first1=Dongkwon |last2=Yokomoto |first2=Daisuke |last3=Makita |first3=Kensaku |last4=Utsuro |first4=Takehito |last5=Fukuhara |first5=Tomohiro |title=Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets |date=2011 |url=https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets}}
HTML
Lim, Dongkwon; Yokomoto, Daisuke; Makita, Kensaku; Utsuro, Takehito; Fukuhara, Tomohiro. (2011). "<a href="https://wikipediaquality.com/wiki/Utilizing_Wikipedia_as_a_Knowledge_Source_in_Categorizing_Topic_Related_Korean_Blogs_into_Facets">Utilizing Wikipedia as a Knowledge Source in Categorizing Topic Related Korean Blogs into Facets</a>".