Entity Extraction via Ensemble Semantics

From Wikipedia Quality
Jump to: navigation, search


Entity Extraction via Ensemble Semantics
Authors
Marco Pennacchiotti
Patrick Pantel
Publication date
2009
Links

Entity Extraction via Ensemble Semantics - scientific work about Wikipedia quality published in 2009, written by Marco Pennacchiotti and Patrick Pantel.

Overview

Combining information extraction systems yields significantly higher quality resources than each system in isolation. In this paper, authors generalize such a mixing of sources and features in a framework called Ensemble Semantics. Authors show very large gains in entity extraction by combining state-of-the-art distributional and patternbased systems with a large set of features from a webcrawl, query logs, and Wikipedia. Experimental results on a webscale extraction of actors, athletes and musicians show significantly higher mean average precision scores (29% gain) compared with the current state of the art.

Embed

Wikipedia Quality

Pennacchiotti, Marco; Pantel, Patrick. (2009). "[[Entity Extraction via Ensemble Semantics]]". International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224.

English Wikipedia

{{cite journal |last1=Pennacchiotti |first1=Marco |last2=Pantel |first2=Patrick |title=Entity Extraction via Ensemble Semantics |date=2009 |url=https://wikipediaquality.com/wiki/Entity_Extraction_via_Ensemble_Semantics |journal=International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224}}

HTML

Pennacchiotti, Marco; Pantel, Patrick. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Entity_Extraction_via_Ensemble_Semantics">Entity Extraction via Ensemble Semantics</a>&quot;. International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224.