Concept Vector Extraction from Wikipedia Category Network

From Wikipedia Quality
Revision as of 00:44, 4 July 2018 by Librarian (talk | contribs) (New scientific work)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
Concept Vector Extraction from Wikipedia Category Network
Authors
Masumi Shirakawa
Kotaro Nakayama
Takahiro Hara
Shojiro Nishio
Publication date
2009
ISBN
978-160558405-8
DOI
10.1145/1516241.1516255
Links

Concept Vector Extraction from Wikipedia Category Network - scientific work about Wikipedia quality published in 2009, written by Masumi Shirakawa, Kotaro Nakayama, Takahiro Hara and Shojiro Nishio.

Overview

The availability of machine readable taxonomy has been demonstrated by various applications such as document classification and information retrieval. One of the main topics of automated taxonomy extraction research is Web mining based statistical NLP and a significant number of researches have been conducted. However, existing works on automatic dictionary building have accuracy problems due to the technical limitation of statistical NLP (Natural Language Processing) and noise data on the WWW. To solve these problems, in this work, authors focus on mining Wikipedia, a large scale Web encyclopedia. Wikipedia has high-quality and huge-scale articles and a category system because many users in the world have edited and refined these articles and category system daily. Using Wikipedia, the decrease of accuracy deriving from NLP can be avoided. However, affiliation relations cannot be extracted by simply descending the category system automatically since the category system in Wikipedia is not in a tree structure but a network structure. Authors propose concept vectorization methods which are applicable to the category network structured in Wikipedia.

Embed

Wikipedia Quality

Shirakawa, Masumi; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2009). "[[Concept Vector Extraction from Wikipedia Category Network]]". Journal of Documentation Volume 65, Issue 6, 16 October 2009, pp. 977-996. ISBN: 978-160558405-8. DOI: 10.1145/1516241.1516255.

English Wikipedia

{{cite journal |last1=Shirakawa |first1=Masumi |last2=Nakayama |first2=Kotaro |last3=Hara |first3=Takahiro |last4=Nishio |first4=Shojiro |title=Concept Vector Extraction from Wikipedia Category Network |date=2009 |isbn=978-160558405-8 |doi=10.1145/1516241.1516255 |url=https://wikipediaquality.com/wiki/Concept_Vector_Extraction_from_Wikipedia_Category_Network |journal=Journal of Documentation Volume 65, Issue 6, 16 October 2009, pp. 977-996}}

HTML

Shirakawa, Masumi; Nakayama, Kotaro; Hara, Takahiro; Nishio, Shojiro. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Concept_Vector_Extraction_from_Wikipedia_Category_Network">Concept Vector Extraction from Wikipedia Category Network</a>&quot;. Journal of Documentation Volume 65, Issue 6, 16 October 2009, pp. 977-996. ISBN: 978-160558405-8. DOI: 10.1145/1516241.1516255.