Web Corpus Mining by Instance of Wikipedia
Authors | Rüdiger Gleim Alexander Mehler Matthias Dehmer |
---|---|
Publication date | 2006 |
DOI | 10.3115/1628297.1628307 |
Links | Original |
Web Corpus Mining by Instance of Wikipedia - scientific work related to Wikipedia quality published in 2006, written by Rüdiger Gleim, Alexander Mehler and Matthias Dehmer.
Overview
In this paper authors present an approach to structure learning in the area of web documents. This is done in order to approach the goal of webgenre tagging in the area of web corpus linguistics. A central outcome of the paper is that purely structure oriented approaches to web document classification provide an information gain which may be utilized in combined approaches of web content and structure analysis.
Embed
Wikipedia Quality
Gleim, Rüdiger; Mehler, Alexander; Dehmer, Matthias. (2006). "[[Web Corpus Mining by Instance of Wikipedia]]". Association for Computational Linguistics. DOI: 10.3115/1628297.1628307.
English Wikipedia
{{cite journal |last1=Gleim |first1=Rüdiger |last2=Mehler |first2=Alexander |last3=Dehmer |first3=Matthias |title=Web Corpus Mining by Instance of Wikipedia |date=2006 |doi=10.3115/1628297.1628307 |url=https://wikipediaquality.com/wiki/Web_Corpus_Mining_by_Instance_of_Wikipedia |journal=Association for Computational Linguistics}}
HTML
Gleim, Rüdiger; Mehler, Alexander; Dehmer, Matthias. (2006). "<a href="https://wikipediaquality.com/wiki/Web_Corpus_Mining_by_Instance_of_Wikipedia">Web Corpus Mining by Instance of Wikipedia</a>". Association for Computational Linguistics. DOI: 10.3115/1628297.1628307.