Entity Classification by Bag of Wikipedia Articles

From Wikipedia Quality
Jump to: navigation, search


Entity Classification by Bag of Wikipedia Articles
Authors
Tomáš Kliegr
Publication date
2010
DOI
10.1145/1871902.1871914
Links
Original

Entity Classification by Bag of Wikipedia Articles - scientific work related to Wikipedia quality published in 2010, written by Tomáš Kliegr.

Overview

The input for a Bag-of-Articles (BOA) classifier is a set of unlabeled entities - noun chunks and a set of target labeled entities - Wikipedia articles. The classifier locates Wikipedia articles that might define the unlabeled entity and performs disambiguation selecting one. Both unlabeled and labeled entity is represented with the proposed BOA term weight vector, which is created by aggregating term weight vectors of articles related to the Wikipedia article defining it. The label is assigned by choosing the closest labeled entity, also a BOA term weight vector, with cosine similarity. The paper formally defines the BOA entity representation and BOA-based entity classification and presents a partial software implementation. A BOA-based disambiguation algorithm is presented as a planned extension.

Embed

Wikipedia Quality

Kliegr, Tomáš. (2010). "[[Entity Classification by Bag of Wikipedia Articles]]".DOI: 10.1145/1871902.1871914.

English Wikipedia

{{cite journal |last1=Kliegr |first1=Tomáš |title=Entity Classification by Bag of Wikipedia Articles |date=2010 |doi=10.1145/1871902.1871914 |url=https://wikipediaquality.com/wiki/Entity_Classification_by_Bag_of_Wikipedia_Articles}}

HTML

Kliegr, Tomáš. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Entity_Classification_by_Bag_of_Wikipedia_Articles">Entity Classification by Bag of Wikipedia Articles</a>&quot;.DOI: 10.1145/1871902.1871914.