Difference between revisions of "An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs"
(Infobox) |
(Adding embed) |
||
Line 10: | Line 10: | ||
== Overview == | == Overview == | ||
Wikipedia is the largest online encyclopedia known to date. Its rich content and semi-structured nature has made it into a very valuable research tool used for classification, [[information extraction]], and semantic annotation, among others. Many applications can benefit from the presence of a topic hierarchy in [[Wikipedia]]. However, what Wikipedia currently offers is a category graph built through hierarchical category links the semantics of which are un-defined. Because of this lack of semantics, a sub-category in Wikipedia does not necessarily comply with the concept of a sub-category in a hierarchy. Instead, all it signifies is that there is some sort of relationship between the parent category and its sub-category. As a result, traversing the category links of any given category can often result in surprising results. For example, following the category of “Computing” down its sub-category links, the totally unrelated category of “Theology” appears. In this paper, authors introduce a novel algorithm that through measuring the semantic [[relatedness]] between any given Wikipedia category and nodes in its sub-graph is capable of extracting a category hierarchy containing only nodes that are relevant to the parent category. The algorithm has been evaluated by comparing its output with a gold standard data set. The experimental setup and results are presented. | Wikipedia is the largest online encyclopedia known to date. Its rich content and semi-structured nature has made it into a very valuable research tool used for classification, [[information extraction]], and semantic annotation, among others. Many applications can benefit from the presence of a topic hierarchy in [[Wikipedia]]. However, what Wikipedia currently offers is a category graph built through hierarchical category links the semantics of which are un-defined. Because of this lack of semantics, a sub-category in Wikipedia does not necessarily comply with the concept of a sub-category in a hierarchy. Instead, all it signifies is that there is some sort of relationship between the parent category and its sub-category. As a result, traversing the category links of any given category can often result in surprising results. For example, following the category of “Computing” down its sub-category links, the totally unrelated category of “Theology” appears. In this paper, authors introduce a novel algorithm that through measuring the semantic [[relatedness]] between any given Wikipedia category and nodes in its sub-graph is capable of extracting a category hierarchy containing only nodes that are relevant to the parent category. The algorithm has been evaluated by comparing its output with a gold standard data set. The experimental setup and results are presented. | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Hejazy, Khaled A.; El-Beltagy, Samhaa R.. (2013). "[[An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-36981-0_8. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Hejazy |first1=Khaled A. |last2=El-Beltagy |first2=Samhaa R. |title=An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs |date=2013 |doi=10.1007/978-3-642-36981-0_8 |url=https://wikipediaquality.com/wiki/An_Approach_for_Deriving_Semantically_Related_Category_Hierarchies_from_Wikipedia_Category_Graphs |journal=Springer, Berlin, Heidelberg}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Hejazy, Khaled A.; El-Beltagy, Samhaa R.. (2013). &quot;<a href="https://wikipediaquality.com/wiki/An_Approach_for_Deriving_Semantically_Related_Category_Hierarchies_from_Wikipedia_Category_Graphs">An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-36981-0_8. | ||
+ | </nowiki> | ||
+ | </code> |
Revision as of 07:47, 19 November 2020
Authors | Khaled A. Hejazy Samhaa R. El-Beltagy |
---|---|
Publication date | 2013 |
DOI | 10.1007/978-3-642-36981-0_8 |
Links | Original |
An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs - scientific work related to Wikipedia quality published in 2013, written by Khaled A. Hejazy and Samhaa R. El-Beltagy.
Overview
Wikipedia is the largest online encyclopedia known to date. Its rich content and semi-structured nature has made it into a very valuable research tool used for classification, information extraction, and semantic annotation, among others. Many applications can benefit from the presence of a topic hierarchy in Wikipedia. However, what Wikipedia currently offers is a category graph built through hierarchical category links the semantics of which are un-defined. Because of this lack of semantics, a sub-category in Wikipedia does not necessarily comply with the concept of a sub-category in a hierarchy. Instead, all it signifies is that there is some sort of relationship between the parent category and its sub-category. As a result, traversing the category links of any given category can often result in surprising results. For example, following the category of “Computing” down its sub-category links, the totally unrelated category of “Theology” appears. In this paper, authors introduce a novel algorithm that through measuring the semantic relatedness between any given Wikipedia category and nodes in its sub-graph is capable of extracting a category hierarchy containing only nodes that are relevant to the parent category. The algorithm has been evaluated by comparing its output with a gold standard data set. The experimental setup and results are presented.
Embed
Wikipedia Quality
Hejazy, Khaled A.; El-Beltagy, Samhaa R.. (2013). "[[An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-36981-0_8.
English Wikipedia
{{cite journal |last1=Hejazy |first1=Khaled A. |last2=El-Beltagy |first2=Samhaa R. |title=An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs |date=2013 |doi=10.1007/978-3-642-36981-0_8 |url=https://wikipediaquality.com/wiki/An_Approach_for_Deriving_Semantically_Related_Category_Hierarchies_from_Wikipedia_Category_Graphs |journal=Springer, Berlin, Heidelberg}}
HTML
Hejazy, Khaled A.; El-Beltagy, Samhaa R.. (2013). "<a href="https://wikipediaquality.com/wiki/An_Approach_for_Deriving_Semantically_Related_Category_Hierarchies_from_Wikipedia_Category_Graphs">An Approach for Deriving Semantically Related Category Hierarchies from Wikipedia Category Graphs</a>". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-36981-0_8.