A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features

From Wikipedia Quality
Revision as of 09:33, 26 November 2019 by Agnieszka (talk | contribs) (Information about: A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features - scientific work related to Wikipedia quality published in 2014, written by Shahida Jabeen, Xiaoying Gao and Peter Andreae.

Overview

Semantic relatedness computation is the task of quantifying the degree of relatedness of two concepts. The performance of existing approaches to computing semantic relatedness is highly dependent on particular aspects of relatedness. For instance, taxonomy-based approaches aim at computing similarity, which is a special case of semantic relatedness. On the other hand, corpus-based approaches focus on the associative relations of words by taking their distributional features into account. Based on the assumption that different aspects of knowledge sources cover different kinds of semantic relations, this paper presents a hybrid model for computing semantic relatedness of words using new features extracted from various aspects of Wikipedia. The focus of this paper is on finding the optimal feature combination(s) that enhance the performance of the hybrid model. The empirical evaluation on benchmark datasets has shown that hybrid features perform better than single features by providing a complementary coverage of semantic relations, leading to improved correlation with human judgments.