Wikipedia Entity Expansion and Attribute Extraction from the Web Using Semi-Supervised Learning

From Wikipedia Quality
Jump to: navigation, search


Wikipedia Entity Expansion and Attribute Extraction from the Web Using Semi-Supervised Learning
Authors
Lidong Bing
Wai Lam
Tak-Lam Wong
Publication date
2013
DOI
10.1145/2433396.2433468
Links
Original Preprint

Wikipedia Entity Expansion and Attribute Extraction from the Web Using Semi-Supervised Learning - scientific work related to Wikipedia quality published in 2013, written by Lidong Bing, Wai Lam and Tak-Lam Wong.

Overview

Authors develop a new framework to achieve the goal of Wikipedia entity expansion and attribute extraction from the Web. Authors framework takes a few existing entities that are automatically collected from a particular Wikipedia category as seed input and explores their attribute infoboxes to obtain clues for the discovery of more entities for this category and the attribute content of the newly discovered entities. One characteristic of framework is to conduct discovery and extraction from desirable semi-structured data record sets which are automatically collected from the Web. A semi-supervised learning model with Conditional Random Fields is developed to deal with the issues of extraction learning and limited number of labeled examples derived from the seed entities. Authors make use of a proximate record graph to guide the semi-supervised learning process. The graph captures alignment similarity among data records. Then the semi-supervised learning process can leverage the unlabeled data in the record set by controlling the label regularization under the guidance of the proximate record graph. Extensive experiments on different domains have been conducted to demonstrate its superiority for discovering new entities and extracting attribute content.

Embed

Wikipedia Quality

Bing, Lidong; Lam, Wai; Wong, Tak-Lam. (2013). "[[Wikipedia Entity Expansion and Attribute Extraction from the Web Using Semi-Supervised Learning]]".DOI: 10.1145/2433396.2433468.

English Wikipedia

{{cite journal |last1=Bing |first1=Lidong |last2=Lam |first2=Wai |last3=Wong |first3=Tak-Lam |title=Wikipedia Entity Expansion and Attribute Extraction from the Web Using Semi-Supervised Learning |date=2013 |doi=10.1145/2433396.2433468 |url=https://wikipediaquality.com/wiki/Wikipedia_Entity_Expansion_and_Attribute_Extraction_from_the_Web_Using_Semi-Supervised_Learning}}

HTML

Bing, Lidong; Lam, Wai; Wong, Tak-Lam. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Wikipedia_Entity_Expansion_and_Attribute_Extraction_from_the_Web_Using_Semi-Supervised_Learning">Wikipedia Entity Expansion and Attribute Extraction from the Web Using Semi-Supervised Learning</a>&quot;.DOI: 10.1145/2433396.2433468.