Difference between revisions of "Mining Concepts from Wikipedia for Ontology Construction"

From Wikipedia Quality
Jump to: navigation, search
(wikilinks)
(Infobox work)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Mining Concepts from Wikipedia for Ontology Construction
 +
| date = 2009
 +
| authors = [[Gaoying Cui]]<br />[[Qin Lu]]<br />[[Wenjie Li]]<br />[[Yirong Chen]]
 +
| doi = 10.1109/WI-IAT.2009.284
 +
| link = http://dl.acm.org/citation.cfm?id=1632313
 +
}}
 
'''Mining Concepts from Wikipedia for Ontology Construction''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Gaoying Cui]], [[Qin Lu]], [[Wenjie Li]] and [[Yirong Chen]].
 
'''Mining Concepts from Wikipedia for Ontology Construction''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Gaoying Cui]], [[Qin Lu]], [[Wenjie Li]] and [[Yirong Chen]].
  
 
== Overview ==
 
== Overview ==
 
An [[ontology]] is a structured knowledgebase of concepts organized by relations among them. But concepts are usually mixed with their instances in the corpora for knowledge extraction. Concepts and their corresponding instances share similar [[features]] and are difficult to distinguish. In this paper, a novel approach is proposed to comprehensively obtain concepts with the help of definition sentences and Category Labels in [[Wikipedia]] pages. N-gram statistics and other NLP knowledge are used to help extracting appropriate concepts. The proposed method identified nearly 50,000 concepts from about 700,000 Wiki pages. The precision reaching 78.5% makes it an effective approach to mine concepts from Wikipedia for ontology construction.
 
An [[ontology]] is a structured knowledgebase of concepts organized by relations among them. But concepts are usually mixed with their instances in the corpora for knowledge extraction. Concepts and their corresponding instances share similar [[features]] and are difficult to distinguish. In this paper, a novel approach is proposed to comprehensively obtain concepts with the help of definition sentences and Category Labels in [[Wikipedia]] pages. N-gram statistics and other NLP knowledge are used to help extracting appropriate concepts. The proposed method identified nearly 50,000 concepts from about 700,000 Wiki pages. The precision reaching 78.5% makes it an effective approach to mine concepts from Wikipedia for ontology construction.

Revision as of 12:25, 27 October 2019


Mining Concepts from Wikipedia for Ontology Construction
Authors
Gaoying Cui
Qin Lu
Wenjie Li
Yirong Chen
Publication date
2009
DOI
10.1109/WI-IAT.2009.284
Links
Original

Mining Concepts from Wikipedia for Ontology Construction - scientific work related to Wikipedia quality published in 2009, written by Gaoying Cui, Qin Lu, Wenjie Li and Yirong Chen.

Overview

An ontology is a structured knowledgebase of concepts organized by relations among them. But concepts are usually mixed with their instances in the corpora for knowledge extraction. Concepts and their corresponding instances share similar features and are difficult to distinguish. In this paper, a novel approach is proposed to comprehensively obtain concepts with the help of definition sentences and Category Labels in Wikipedia pages. N-gram statistics and other NLP knowledge are used to help extracting appropriate concepts. The proposed method identified nearly 50,000 concepts from about 700,000 Wiki pages. The precision reaching 78.5% makes it an effective approach to mine concepts from Wikipedia for ontology construction.