Difference between revisions of "Recognizing Biographical Sections in Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Adding wikilinks)
(infobox)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Recognizing Biographical Sections in Wikipedia
 +
| date = 2015
 +
| authors = [[Alessio Palmero Aprosio]]<br />[[Sara Tonelli]]
 +
| doi = 10.18653/v1/D15-1095
 +
| link = http://aclweb.org/anthology/D15-1095
 +
}}
 
'''Recognizing Biographical Sections in Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2015, written by [[Alessio Palmero Aprosio]] and [[Sara Tonelli]].
 
'''Recognizing Biographical Sections in Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2015, written by [[Alessio Palmero Aprosio]] and [[Sara Tonelli]].
  
 
== Overview ==
 
== Overview ==
 
Wikipedia is the largest collection of encyclopedic data ever written in the history of humanity. Thanks to its coverage and its availability in machine-readable format, it has become a primary resource for largescale research in historical and cultural studies. In this work, authors focus on the subset of pages describing persons, and authors investigate the task of recognizing biographical sections from them: given a person’s page, authors identify the list of sections where information about her/his life is present. Authors model this as a sequence classification problem, and propose a supervised setting, in which the training data are acquired automatically. Besides, authors show that six simple [[features]] extracted only from the section titles are very informative and yield good results well above a strong baseline.
 
Wikipedia is the largest collection of encyclopedic data ever written in the history of humanity. Thanks to its coverage and its availability in machine-readable format, it has become a primary resource for largescale research in historical and cultural studies. In this work, authors focus on the subset of pages describing persons, and authors investigate the task of recognizing biographical sections from them: given a person’s page, authors identify the list of sections where information about her/his life is present. Authors model this as a sequence classification problem, and propose a supervised setting, in which the training data are acquired automatically. Besides, authors show that six simple [[features]] extracted only from the section titles are very informative and yield good results well above a strong baseline.

Revision as of 08:30, 21 May 2020


Recognizing Biographical Sections in Wikipedia
Authors
Alessio Palmero Aprosio
Sara Tonelli
Publication date
2015
DOI
10.18653/v1/D15-1095
Links
Original

Recognizing Biographical Sections in Wikipedia - scientific work related to Wikipedia quality published in 2015, written by Alessio Palmero Aprosio and Sara Tonelli.

Overview

Wikipedia is the largest collection of encyclopedic data ever written in the history of humanity. Thanks to its coverage and its availability in machine-readable format, it has become a primary resource for largescale research in historical and cultural studies. In this work, authors focus on the subset of pages describing persons, and authors investigate the task of recognizing biographical sections from them: given a person’s page, authors identify the list of sections where information about her/his life is present. Authors model this as a sequence classification problem, and propose a supervised setting, in which the training data are acquired automatically. Besides, authors show that six simple features extracted only from the section titles are very informative and yield good results well above a strong baseline.