Web Corpus Mining by Instance of Wikipedia

From Wikipedia Quality
Revision as of 08:52, 6 November 2020 by Allison (talk | contribs) (+ Embed)
Jump to: navigation, search


Web Corpus Mining by Instance of Wikipedia
Authors
Rüdiger Gleim
Alexander Mehler
Matthias Dehmer
Publication date
2006
DOI
10.3115/1628297.1628307
Links
Original

Web Corpus Mining by Instance of Wikipedia - scientific work related to Wikipedia quality published in 2006, written by Rüdiger Gleim, Alexander Mehler and Matthias Dehmer.

Overview

In this paper authors present an approach to structure learning in the area of web documents. This is done in order to approach the goal of webgenre tagging in the area of web corpus linguistics. A central outcome of the paper is that purely structure oriented approaches to web document classification provide an information gain which may be utilized in combined approaches of web content and structure analysis.

Embed

Wikipedia Quality

Gleim, Rüdiger; Mehler, Alexander; Dehmer, Matthias. (2006). "[[Web Corpus Mining by Instance of Wikipedia]]". Association for Computational Linguistics. DOI: 10.3115/1628297.1628307.

English Wikipedia

{{cite journal |last1=Gleim |first1=Rüdiger |last2=Mehler |first2=Alexander |last3=Dehmer |first3=Matthias |title=Web Corpus Mining by Instance of Wikipedia |date=2006 |doi=10.3115/1628297.1628307 |url=https://wikipediaquality.com/wiki/Web_Corpus_Mining_by_Instance_of_Wikipedia |journal=Association for Computational Linguistics}}

HTML

Gleim, Rüdiger; Mehler, Alexander; Dehmer, Matthias. (2006). &quot;<a href="https://wikipediaquality.com/wiki/Web_Corpus_Mining_by_Instance_of_Wikipedia">Web Corpus Mining by Instance of Wikipedia</a>&quot;. Association for Computational Linguistics. DOI: 10.3115/1628297.1628307.