Entity Extraction via Ensemble Semantics
Authors | Marco Pennacchiotti Patrick Pantel |
---|---|
Publication date | 2009 |
Links |
Entity Extraction via Ensemble Semantics - scientific work about Wikipedia quality published in 2009, written by Marco Pennacchiotti and Patrick Pantel.
Overview
Combining information extraction systems yields significantly higher quality resources than each system in isolation. In this paper, authors generalize such a mixing of sources and features in a framework called Ensemble Semantics. Authors show very large gains in entity extraction by combining state-of-the-art distributional and patternbased systems with a large set of features from a webcrawl, query logs, and Wikipedia. Experimental results on a webscale extraction of actors, athletes and musicians show significantly higher mean average precision scores (29% gain) compared with the current state of the art.
Embed
Wikipedia Quality
Pennacchiotti, Marco; Pantel, Patrick. (2009). "[[Entity Extraction via Ensemble Semantics]]". International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224.
English Wikipedia
{{cite journal |last1=Pennacchiotti |first1=Marco |last2=Pantel |first2=Patrick |title=Entity Extraction via Ensemble Semantics |date=2009 |url=https://wikipediaquality.com/wiki/Entity_Extraction_via_Ensemble_Semantics |journal=International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224}}
HTML
Pennacchiotti, Marco; Pantel, Patrick. (2009). "<a href="https://wikipediaquality.com/wiki/Entity_Extraction_via_Ensemble_Semantics">Entity Extraction via Ensemble Semantics</a>". International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224.