From French Wikipedia to Erudit: a Test Case for Cross-Domain Open Information Extraction

From Wikipedia Quality
Jump to: navigation, search


From French Wikipedia to Erudit: a Test Case for Cross-Domain Open Information Extraction
Authors
Fabrizio Gotti
Philippe Langlais
Publication date
2018
DOI
10.1111/coin.12120
Links
Original

From French Wikipedia to Erudit: a Test Case for Cross-Domain Open Information Extraction - scientific work related to Wikipedia quality published in 2018, written by Fabrizio Gotti and Philippe Langlais.

Overview

In this paper, authors describe an open information extraction pipeline based on ReVerb for extracting knowledge from French text. Authors put it to the test by using the information triples extracted to build an entity classifier, ie, a system able to label a given instance with its type (for instance, Michel Foucault is a philosopher). The classifier requires little supervision. One novel aspect of this study is that authors show how general domain information triples (extracted from French Wikipedia) can be used for deriving new knowledge from domain-specific documents unrelated to Wikipedia, in case scholarly articles focusing on the humanities. Authors believe that the present study is the first that focuses on such a cross-domain, recall-oriented approach in open information extraction. While system's performance shows room for improvement, manual assessments show that the task is quite hard, even for a human, in part because of the cross-domain aspect of the problem authors tackle.

Embed

Wikipedia Quality

Gotti, Fabrizio; Langlais, Philippe. (2018). "[[From French Wikipedia to Erudit: a Test Case for Cross-Domain Open Information Extraction]]". Wiley/Blackwell (10.1111). DOI: 10.1111/coin.12120.

English Wikipedia

{{cite journal |last1=Gotti |first1=Fabrizio |last2=Langlais |first2=Philippe |title=From French Wikipedia to Erudit: a Test Case for Cross-Domain Open Information Extraction |date=2018 |doi=10.1111/coin.12120 |url=https://wikipediaquality.com/wiki/From_French_Wikipedia_to_Erudit:_a_Test_Case_for_Cross-Domain_Open_Information_Extraction |journal=Wiley/Blackwell (10.1111)}}

HTML

Gotti, Fabrizio; Langlais, Philippe. (2018). &quot;<a href="https://wikipediaquality.com/wiki/From_French_Wikipedia_to_Erudit:_a_Test_Case_for_Cross-Domain_Open_Information_Extraction">From French Wikipedia to Erudit: a Test Case for Cross-Domain Open Information Extraction</a>&quot;. Wiley/Blackwell (10.1111). DOI: 10.1111/coin.12120.