A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields

From Wikipedia Quality
Jump to: navigation, search


A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields
Authors
Yotaro Watanabe
Masayuki Asahara
Yuji Matsumoto
Publication date
2007
Links
Original Preprint

A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields - scientific work related to Wikipedia quality published in 2007, written by Yotaro Watanabe, Masayuki Asahara and Yuji Matsumoto.

Overview

This paper presents a method for categorizing named entities in Wikipedia. In Wikipedia, an anchor text is glossed in a linked HTML text. Authors formalize named entity categorization as a task of categorizing anchor texts with linked HTML texts which glosses a named entity. Using this representation, authors introduce a graph structure in which anchor texts are regarded as nodes. In order to incorporate HTML structure on the graph, three types of cliques are defined based on the HTML tree structure. Authors propose a method with Conditional Random Fields (CRFs) to categorize the nodes on the graph. Since the defined graph may include cycles, the exact inference of CRFs is computationally expensive. Authors introduce an approximate inference method using Treebased Reparameterization (TRP) to reduce computational cost. In experiments, proposed model obtained significant improvements compare to baseline models that use Support Vector Machine.

Embed

Wikipedia Quality

Yotaro, Watanabe; Masayuki, Asahara; Yuji, Matsumoto. (2007). "[[A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields]]".

English Wikipedia

{{cite journal |last1=Yotaro |first1=Watanabe |last2=Masayuki |first2=Asahara |last3=Yuji |first3=Matsumoto |title=A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields |date=2007 |url=https://wikipediaquality.com/wiki/A_Graph-Based_Approach_to_Named_Entity_Categorization_in_Wikipedia_Using_Conditional_Random_Fields}}

HTML

Yotaro, Watanabe; Masayuki, Asahara; Yuji, Matsumoto. (2007). &quot;<a href="https://wikipediaquality.com/wiki/A_Graph-Based_Approach_to_Named_Entity_Categorization_in_Wikipedia_Using_Conditional_Random_Fields">A Graph-Based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields</a>&quot;.