Difference between revisions of "A Generic Method for Multi Word Extraction from Wikipedia"
(Wikilinks) |
(Infobox) |
||
Line 1: | Line 1: | ||
+ | {{Infobox work | ||
+ | | title = A Generic Method for Multi Word Extraction from Wikipedia | ||
+ | | date = 2008 | ||
+ | | authors = [[Bozo Bekavac]]<br />[[Marko Tadić]] | ||
+ | | doi = 10.1109/ITI.2008.4588490 | ||
+ | | link = http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4588490 | ||
+ | }} | ||
'''A Generic Method for Multi Word Extraction from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2008, written by [[Bozo Bekavac]] and [[Marko Tadić]]. | '''A Generic Method for Multi Word Extraction from Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2008, written by [[Bozo Bekavac]] and [[Marko Tadić]]. | ||
== Overview == | == Overview == | ||
This paper presents the generic method for multiword expression extraction from [[Wikipedia]]. The method is using the properties of this specific encyclopedic genre in its HTML format and it relies on the intention of the authors of articles to link to other articles. The relevant links were processed by applying local regular grammars within the NooJ development environment. Authors tested the method on a Croatian version of Wikipedia and authors present the results obtained. | This paper presents the generic method for multiword expression extraction from [[Wikipedia]]. The method is using the properties of this specific encyclopedic genre in its HTML format and it relies on the intention of the authors of articles to link to other articles. The relevant links were processed by applying local regular grammars within the NooJ development environment. Authors tested the method on a Croatian version of Wikipedia and authors present the results obtained. |
Revision as of 06:52, 28 November 2019
Authors | Bozo Bekavac Marko Tadić |
---|---|
Publication date | 2008 |
DOI | 10.1109/ITI.2008.4588490 |
Links | Original |
A Generic Method for Multi Word Extraction from Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Bozo Bekavac and Marko Tadić.
Overview
This paper presents the generic method for multiword expression extraction from Wikipedia. The method is using the properties of this specific encyclopedic genre in its HTML format and it relies on the intention of the authors of articles to link to other articles. The relevant links were processed by applying local regular grammars within the NooJ development environment. Authors tested the method on a Croatian version of Wikipedia and authors present the results obtained.