Extracting Events from Wikipedia as Rdf Triples Linked to Widespread Semantic Web Datasets

From Wikipedia Quality
Revision as of 21:13, 24 May 2019 by Agnieszka (talk | contribs) (Information about: Extracting Events from Wikipedia as Rdf Triples Linked to Widespread Semantic Web Datasets)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Extracting Events from Wikipedia as Rdf Triples Linked to Widespread Semantic Web Datasets - scientific work related to Wikipedia quality published in 2011, written by Carlo Aliprandi, Francesco Ronzano, Andrea Marchetti, Maurizio Tesconi and Salvatore Minutoli.

Overview

Many attempts have been made to extract structured data from Web resources, exposing them as RDF triples and interlinking them with other RDF datasets: in this way it is possible to create clouds of highly integrated Semantic Web data collections. In this paper authors describe an approach to enhance the extraction of semantic contents from unstructured textual documents, in particular considering Wikipedia articles and focusing on event mining. Starting from the deep parsing of a set of English Wikipedia articles, authors produce a semantic annotation compliant with the Knowledge Annotation Format (KAF). Authors extract events from the KAF semantic annotation and then authors structure each event as a set of RDF triples linked to both DBpedia and WordNet. Authors point out examples of automatically mined events, providing some general evaluation of how approach may discover new events and link them to existing contents.