Difference between revisions of "A Generic Method for Multi Word Extraction from Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Embed for English Wikipedia, HTML)
(+ cat.)
 
Line 32: Line 32:
 
</nowiki>
 
</nowiki>
 
</code>
 
</code>
 +
 +
 +
 +
[[Category:Scientific works]]
 +
[[Category:Croatian Wikipedia]]

Latest revision as of 12:52, 2 November 2020


A Generic Method for Multi Word Extraction from Wikipedia
Authors
Bozo Bekavac
Marko Tadić
Publication date
2008
DOI
10.1109/ITI.2008.4588490
Links
Original

A Generic Method for Multi Word Extraction from Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Bozo Bekavac and Marko Tadić.

Overview

This paper presents the generic method for multiword expression extraction from Wikipedia. The method is using the properties of this specific encyclopedic genre in its HTML format and it relies on the intention of the authors of articles to link to other articles. The relevant links were processed by applying local regular grammars within the NooJ development environment. Authors tested the method on a Croatian version of Wikipedia and authors present the results obtained.

Embed

Wikipedia Quality

Bekavac, Bozo; Tadić, Marko. (2008). "[[A Generic Method for Multi Word Extraction from Wikipedia]]".DOI: 10.1109/ITI.2008.4588490.

English Wikipedia

{{cite journal |last1=Bekavac |first1=Bozo |last2=Tadić |first2=Marko |title=A Generic Method for Multi Word Extraction from Wikipedia |date=2008 |doi=10.1109/ITI.2008.4588490 |url=https://wikipediaquality.com/wiki/A_Generic_Method_for_Multi_Word_Extraction_from_Wikipedia}}

HTML

Bekavac, Bozo; Tadić, Marko. (2008). &quot;<a href="https://wikipediaquality.com/wiki/A_Generic_Method_for_Multi_Word_Extraction_from_Wikipedia">A Generic Method for Multi Word Extraction from Wikipedia</a>&quot;.DOI: 10.1109/ITI.2008.4588490.