Harvesting Domain-Specific Terms Using Wikipedia

Harvesting Domain-Specific Terms Using Wikipedia - scientific work related to Wikipedia quality published in 2011, written by Su Nam Kim, Lawrence Cavedon and Timothy Baldwin.

Overview

Authors present a simple but effective method of automatically extracting domain-specific terms using Wikipedia as training data (i.e. self-supervised learning). Authors first goal is to show, using human judgments, that Wikipedia categories are domainspecific and thus can replace manually annotated terms. Second, authors show that identifying such terms using harvested Wikipedia categories and entities as seeds is reliable when compared to the use of dictionary terms. Authors technique facilitates the construction of large semantic resources in multiple domains without requiring manually annotated training data.

Harvesting Domain-Specific Terms Using Wikipedia

Overview

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools