Difference between revisions of "An Unsupervised Approach to Biography Production Using Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(An Unsupervised Approach to Biography Production Using Wikipedia - basic info)
 
(Adding wikilinks)
Line 1: Line 1:
'''An Unsupervised Approach to Biography Production Using Wikipedia''' - scientific work related to Wikipedia quality published in 2008, written by Fadi Biadsy, Julia Hirschberg and Elena Filatova.
+
'''An Unsupervised Approach to Biography Production Using Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2008, written by [[Fadi Biadsy]], [[Julia Hirschberg]] and [[Elena Filatova]].
  
 
== Overview ==
 
== Overview ==
Authors describe an unsupervised approach to multi-document sentence-extraction based summarization for the task of producing biographies. Authors utilize Wikipedia to automatically construct a corpus of biographical sentences and TDT4 to construct a corpus of non-biographical sentences. Authors build a biographical-sentence classifier from these corpora and an SVM regression model for sentence ordering from the Wikipedia corpus. Authors evaluate work on the DUC2004 evaluation data and with human judges. Overall, system significantly outperforms all systems that participated in DUC2004, according to the ROUGE-L metric, and is preferred by human subjects.
+
Authors describe an unsupervised approach to multi-document sentence-extraction based summarization for the task of producing biographies. Authors utilize [[Wikipedia]] to automatically construct a corpus of biographical sentences and TDT4 to construct a corpus of non-biographical sentences. Authors build a biographical-sentence classifier from these corpora and an SVM regression model for sentence ordering from the Wikipedia corpus. Authors evaluate work on the DUC2004 evaluation data and with human judges. Overall, system significantly outperforms all systems that participated in DUC2004, according to the ROUGE-L metric, and is preferred by human subjects.

Revision as of 23:27, 16 September 2019

An Unsupervised Approach to Biography Production Using Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Fadi Biadsy, Julia Hirschberg and Elena Filatova.

Overview

Authors describe an unsupervised approach to multi-document sentence-extraction based summarization for the task of producing biographies. Authors utilize Wikipedia to automatically construct a corpus of biographical sentences and TDT4 to construct a corpus of non-biographical sentences. Authors build a biographical-sentence classifier from these corpora and an SVM regression model for sentence ordering from the Wikipedia corpus. Authors evaluate work on the DUC2004 evaluation data and with human judges. Overall, system significantly outperforms all systems that participated in DUC2004, according to the ROUGE-L metric, and is preferred by human subjects.