Acquiring Thesauri from Wikis by Exploiting Domain Models and Lexical Substitution

From Wikipedia Quality
Revision as of 23:43, 3 July 2018 by Librarian (talk | contribs) (New scientific work)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
Acquiring Thesauri from Wikis by Exploiting Domain Models and Lexical Substitution
Authors
Claudio Giuliano
Alfio Massimiliano Gliozzo
Aldo Gangemi
Kateryna Tymoshenko
Publication date
2010
ISSN
03029743
ISBN
3642134882;978-364213488-3
DOI
10.1007/978-3-642-13489-0_9
Links

Acquiring Thesauri from Wikis by Exploiting Domain Models and Lexical Substitution - scientific work about Wikipedia quality published in 2010, written by Claudio Giuliano, Alfio Massimiliano Gliozzo, Aldo Gangemi and Kateryna Tymoshenko.

Overview

Acquiring structured data from wikis is a problem of increasing interest in knowledge engineering and Semantic Web. In fact, collaboratively developed resources are growing in time, have high quality and are constantly updated. Among these problems, an area of interest is extracting thesauri from wikis. A thesaurus is a resource that lists words grouped together according to similarity of meaning, generally organized into sets of synonyms. Thesauri are useful for a large variety of applications, including information retrieval and knowledge engineering. Most information in wikis is expressed by means of natural language texts and internal links among Web pages, the so-called wikilinks. In this paper, an innovative method for inducing thesauri from Wikipedia is presented. It leverages on the Wikipedia structure to extract concepts and terms denoting them, obtaining a thesaurus that can be profitably used into applications. This method boosts sensibly precision and recall if applied to re-rank a state-of-the-art baseline approach. Finally, authors discuss how to represent the extracted results in RDF/OWL, with respect to existing good practices.

Embed

Wikipedia Quality

Giuliano, Claudio; Gliozzo, Alfio Massimiliano; Gangemi, Aldo; Tymoshenko, Kateryna. (2010). "[[Acquiring Thesauri from Wikis by Exploiting Domain Models and Lexical Substitution]]". Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 6089 LNCS, Issue PART 2, 2010, pp. 121-135. ISBN: 3642134882;978-364213488-3. ISSN: 03029743. DOI: 10.1007/978-3-642-13489-0_9.

English Wikipedia

{{cite journal |last1=Giuliano |first1=Claudio |last2=Gliozzo |first2=Alfio Massimiliano |last3=Gangemi |first3=Aldo |last4=Tymoshenko |first4=Kateryna |title=Acquiring Thesauri from Wikis by Exploiting Domain Models and Lexical Substitution |date=2010 |isbn=3642134882;978-364213488-3 |issn=03029743 |doi=10.1007/978-3-642-13489-0_9 |url=https://wikipediaquality.com/wiki/Acquiring_Thesauri_from_Wikis_by_Exploiting_Domain_Models_and_Lexical_Substitution |journal=Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 6089 LNCS, Issue PART 2, 2010, pp. 121-135}}

HTML

Giuliano, Claudio; Gliozzo, Alfio Massimiliano; Gangemi, Aldo; Tymoshenko, Kateryna. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Acquiring_Thesauri_from_Wikis_by_Exploiting_Domain_Models_and_Lexical_Substitution">Acquiring Thesauri from Wikis by Exploiting Domain Models and Lexical Substitution</a>&quot;. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 6089 LNCS, Issue PART 2, 2010, pp. 121-135. ISBN: 3642134882;978-364213488-3. ISSN: 03029743. DOI: 10.1007/978-3-642-13489-0_9.