Entity Classification by Bag of Wikipedia Articles
Authors | Tomáš Kliegr |
---|---|
Publication date | 2010 |
DOI | 10.1145/1871902.1871914 |
Links | Original |
Entity Classification by Bag of Wikipedia Articles - scientific work related to Wikipedia quality published in 2010, written by Tomáš Kliegr.
Overview
The input for a Bag-of-Articles (BOA) classifier is a set of unlabeled entities - noun chunks and a set of target labeled entities - Wikipedia articles. The classifier locates Wikipedia articles that might define the unlabeled entity and performs disambiguation selecting one. Both unlabeled and labeled entity is represented with the proposed BOA term weight vector, which is created by aggregating term weight vectors of articles related to the Wikipedia article defining it. The label is assigned by choosing the closest labeled entity, also a BOA term weight vector, with cosine similarity. The paper formally defines the BOA entity representation and BOA-based entity classification and presents a partial software implementation. A BOA-based disambiguation algorithm is presented as a planned extension.
Embed
Wikipedia Quality
Kliegr, Tomáš. (2010). "[[Entity Classification by Bag of Wikipedia Articles]]".DOI: 10.1145/1871902.1871914.
English Wikipedia
{{cite journal |last1=Kliegr |first1=Tomáš |title=Entity Classification by Bag of Wikipedia Articles |date=2010 |doi=10.1145/1871902.1871914 |url=https://wikipediaquality.com/wiki/Entity_Classification_by_Bag_of_Wikipedia_Articles}}
HTML
Kliegr, Tomáš. (2010). "<a href="https://wikipediaquality.com/wiki/Entity_Classification_by_Bag_of_Wikipedia_Articles">Entity Classification by Bag of Wikipedia Articles</a>".DOI: 10.1145/1871902.1871914.