Expanding Textual Entailment Corpora Fromwikipedia Using Co-Training

From Wikipedia Quality
Revision as of 09:28, 31 May 2019 by Hanna (talk | contribs) (Overview: Expanding Textual Entailment Corpora Fromwikipedia Using Co-Training)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Expanding Textual Entailment Corpora Fromwikipedia Using Co-Training - scientific work related to Wikipedia quality published in 2010, written by Fabio Massimo Zanzotto and Marco Pennacchiotti.

Overview

In this paper authors propose a novel method to automatically extract large textual entailment datasets homogeneous to existing ones. The key idea is the combination of two intuitions: (1) the use of Wikipedia to extract a large set of textual entailment pairs; (2) the application of semisupervised machine learning methods to make the extracted dataset homogeneous to the existing ones. Authors report empirical evidence that method successfully expands existing textual entailment corpora.