Learning to Discover Subsumptions Between Software Engineering Concepts in Wikipedia

From Wikipedia Quality
Revision as of 07:20, 25 January 2021 by Ecclesia (talk | contribs) (+ categories)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Learning to Discover Subsumptions Between Software Engineering Concepts in Wikipedia
Authors
Xiang Dong
Kai Chen
Jiangang Zhu
Beijun Shen
Publication date
2016
DOI
10.18293/SEKE2016-021
Links
Original

Learning to Discover Subsumptions Between Software Engineering Concepts in Wikipedia - scientific work related to Wikipedia quality published in 2016, written by Xiang Dong, Kai Chen, Jiangang Zhu and Beijun Shen.

Overview

Wikipedia contains large-scale concepts and rich semantic information. A number of knowledge base construction projects such as WikiTaxonomy, DBpedia, and YAGO have acquired data from Wikipedia. Despite the huge amount of relations in Wikipedia, the semantic relations (i.e. subsumptions) between domain concepts are rather sparse, especially in software engineering (SE) area. Hence, it is difficult to derive a software engineering knowledge base directly from Wikipedia. Meanwhile, domain knowledge base has become indispensable to a growing number of applications in software engineering. So the discov- ery of missing semantic relations between software engineering concepts in Wikipedia is essential. In this paper, authors propose an approach to automatically discovering the missing subsumption relations between software engineering concepts. Specifically, authors extract the SE domain concepts from Wikipedia firstly. And secondly, authors design a machine learning based algorithm with some novel features to calculate the semantic relevancy between concepts. Thirdly, authors offer and utilize a semi-supervised model to incorporate the features, which discovers the SE subsumptions. Experimental results show that approach can effectively find the missing subsumption relations between software engineering concepts. Finally, authors build a taxonomy which contains 193,593 concepts together with 357,662 subsumption relations. Compared with the taxonomies which are extracted from general-purpose knowledge bases such as WikiTaxonomy, YAGO and Schema.org, dataset has a larger scale in software engineering domain. Index Terms—Subsumption Extraction, Software Engineering, Wikipedia

Embed

Wikipedia Quality

Dong, Xiang; Chen, Kai; Zhu, Jiangang; Shen, Beijun. (2016). "[[Learning to Discover Subsumptions Between Software Engineering Concepts in Wikipedia]]".DOI: 10.18293/SEKE2016-021.

English Wikipedia

{{cite journal |last1=Dong |first1=Xiang |last2=Chen |first2=Kai |last3=Zhu |first3=Jiangang |last4=Shen |first4=Beijun |title=Learning to Discover Subsumptions Between Software Engineering Concepts in Wikipedia |date=2016 |doi=10.18293/SEKE2016-021 |url=https://wikipediaquality.com/wiki/Learning_to_Discover_Subsumptions_Between_Software_Engineering_Concepts_in_Wikipedia}}

HTML

Dong, Xiang; Chen, Kai; Zhu, Jiangang; Shen, Beijun. (2016). &quot;<a href="https://wikipediaquality.com/wiki/Learning_to_Discover_Subsumptions_Between_Software_Engineering_Concepts_in_Wikipedia">Learning to Discover Subsumptions Between Software Engineering Concepts in Wikipedia</a>&quot;.DOI: 10.18293/SEKE2016-021.