Catriple: Extracting Triples from Wikipedia Categories

From Wikipedia Quality
Jump to: navigation, search


Catriple: Extracting Triples from Wikipedia Categories
Authors
Qiaoling Liu
Kaifeng Xu
Lei Zhang
Haofen Wang
Yong Yu
Yue Pan
Publication date
2008
DOI
10.1007/978-3-540-89704-0_23
Links
Original

Catriple: Extracting Triples from Wikipedia Categories - scientific work related to Wikipedia quality published in 2008, written by Qiaoling Liu, Kaifeng Xu, Lei Zhang, Haofen Wang, Yong Yu and Yue Pan.

Overview

As an important step towards bootstrapping the Semantic Web, many efforts have been made to extract triples from Wikipedia because of its wide coverage, good organization and rich knowledge. One kind of important triples is about Wikipedia articles and their non-isa properties, e.g. (Beijing, country, China). Previous work has tried to extract such triples from Wikipedia infoboxes, article text and categories. The infobox-based and text-based extraction methods depend on the infoboxes and suffer from a low article coverage. In contrast, the category-based extraction methods exploit the widespread categories. However, they rely on predefined properties, which is too effort-consuming and explores only very limited knowledge in the categories. This paper automatically extracts properties and triples from the less explored Wikipedia categories so as to achieve a wider article coverage with less manual effort. Authors manage to realize this goal by utilizing the syntax and semantics brought by super-sub category pairs in Wikipedia. Authors prototype implementation outputs about 10M triples with a 12-level confidence ranging from 47.0% to 96.4%, which cover 78.2% of Wikipedia articles. Among them, 1.27M triples have confidence of 96.4%. Applications can on demand use the triples with suitable confidence.

Embed

Wikipedia Quality

Liu, Qiaoling; Xu, Kaifeng; Zhang, Lei; Wang, Haofen; Yu, Yong; Pan, Yue. (2008). "[[Catriple: Extracting Triples from Wikipedia Categories]]". Springer-Verlag. DOI: 10.1007/978-3-540-89704-0_23.

English Wikipedia

{{cite journal |last1=Liu |first1=Qiaoling |last2=Xu |first2=Kaifeng |last3=Zhang |first3=Lei |last4=Wang |first4=Haofen |last5=Yu |first5=Yong |last6=Pan |first6=Yue |title=Catriple: Extracting Triples from Wikipedia Categories |date=2008 |doi=10.1007/978-3-540-89704-0_23 |url=https://wikipediaquality.com/wiki/Catriple:_Extracting_Triples_from_Wikipedia_Categories |journal=Springer-Verlag}}

HTML

Liu, Qiaoling; Xu, Kaifeng; Zhang, Lei; Wang, Haofen; Yu, Yong; Pan, Yue. (2008). &quot;<a href="https://wikipediaquality.com/wiki/Catriple:_Extracting_Triples_from_Wikipedia_Categories">Catriple: Extracting Triples from Wikipedia Categories</a>&quot;. Springer-Verlag. DOI: 10.1007/978-3-540-89704-0_23.