Association Thesaurus Construction Methods based on Link Co-Occurrence Analysis for Wikipedia

From Wikipedia Quality
Revision as of 11:58, 17 June 2020 by Sofia (talk | contribs) (Embed for English Wikipedia, HTML)
Jump to: navigation, search


Association Thesaurus Construction Methods based on Link Co-Occurrence Analysis for Wikipedia
Authors
Masahiro Ito
Kotaro Nakayama
Takahiro Hara
Shojiro Nishio
Publication date
2008
DOI
10.1145/1458082.1458191
Links
Original

Association Thesaurus Construction Methods based on Link Co-Occurrence Analysis for Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Masahiro Ito, Kotaro Nakayama, Takahiro Hara and Shojiro Nishio.

Overview

Wikipedia, a huge scale Web based encyclopedia, attracts great attention as an invaluable corpus for knowledge extraction because it has various impressive characteristics such as a huge number of articles, live updates, a dense link structure, brief anchor texts and URL identification for concepts. Authors have already proved that authors can use Wikipedia to construct a huge scale accurate association thesaurus. The association thesaurus authors constructed covers almost 1.3 million concepts and its accuracy is proved in detailed experiments. However, authors still need scalable methods to analyze the huge number of Web pages and hyperlinks among articles in the Web based encyclopedia. In this paper, authors propose a scalable method for constructing an association thesaurus from Wikipedia based on link co-occurrences. Link co-occurrence analysis is more scalable than link structure analysis because it is a one-pass process. Authors also propose integration method of tfidf and link co-occurrence analysis. Experimental results show that both proposed methods are more accurate and scalable than conventional methods. Furthermore, the integration of tfidf achieved higher accuracy than using only link co-occurrences.

Embed

Wikipedia Quality

Ito, Masahiro; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2008). "[[Association Thesaurus Construction Methods based on Link Co-Occurrence Analysis for Wikipedia]]".DOI: 10.1145/1458082.1458191.

English Wikipedia

{{cite journal |last1=Ito |first1=Masahiro |last2=Nakayama |first2=Kotaro |last3=Hara |first3=Takahiro |last4=Nishio |first4=Shojiro |title=Association Thesaurus Construction Methods based on Link Co-Occurrence Analysis for Wikipedia |date=2008 |doi=10.1145/1458082.1458191 |url=https://wikipediaquality.com/wiki/Association_Thesaurus_Construction_Methods_based_on_Link_Co-Occurrence_Analysis_for_Wikipedia}}

HTML

Ito, Masahiro; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2008). &quot;<a href="https://wikipediaquality.com/wiki/Association_Thesaurus_Construction_Methods_based_on_Link_Co-Occurrence_Analysis_for_Wikipedia">Association Thesaurus Construction Methods based on Link Co-Occurrence Analysis for Wikipedia</a>&quot;.DOI: 10.1145/1458082.1458191.