Difference between revisions of "Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia"
(Infobox work) |
(Embed) |
||
Line 10: | Line 10: | ||
== Overview == | == Overview == | ||
Finding domain specific key terms/phrases from a given set of documents is a challenging task. A domain may be defined as an area of interest over a collection of documents which may not be explicitly defined but implicitly observable in those documents. When considering a collection of documents related to academic research, examples of key terms/phrases may be Information Retrieval", "Marine Biology", etc. In this paper a technique for extracting important key terms/phrases in a considered topical domain is proposed using external evidence from the titles of [[Wikipedia]] articles and the Wikipedia category graph. Authors performed some experiments over the document collection of Web sites of different post-graduate schools. Authors preliminary evaluations show promising results for the detection of domain specific key terms/phrases from the given set of domain focused Web pages. | Finding domain specific key terms/phrases from a given set of documents is a challenging task. A domain may be defined as an area of interest over a collection of documents which may not be explicitly defined but implicitly observable in those documents. When considering a collection of documents related to academic research, examples of key terms/phrases may be Information Retrieval", "Marine Biology", etc. In this paper a technique for extracting important key terms/phrases in a considered topical domain is proposed using external evidence from the titles of [[Wikipedia]] articles and the Wikipedia category graph. Authors performed some experiments over the document collection of Web sites of different post-graduate schools. Authors preliminary evaluations show promising results for the detection of domain specific key terms/phrases from the given set of domain focused Web pages. | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Qureshi, M. Atif; O'Riordan, Colm; Pasi, Gabriella. (2012). "[[Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia]]".DOI: 10.1145/2396761.2398680. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Qureshi |first1=M. Atif |last2=O'Riordan |first2=Colm |last3=Pasi |first3=Gabriella |title=Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia |date=2012 |doi=10.1145/2396761.2398680 |url=https://wikipediaquality.com/wiki/Short-Text_Domain_Specific_Key_Terms/Phrases_Extraction_Using_an_N-Gram_Model_with_Wikipedia}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Qureshi, M. Atif; O'Riordan, Colm; Pasi, Gabriella. (2012). &quot;<a href="https://wikipediaquality.com/wiki/Short-Text_Domain_Specific_Key_Terms/Phrases_Extraction_Using_an_N-Gram_Model_with_Wikipedia">Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia</a>&quot;.DOI: 10.1145/2396761.2398680. | ||
+ | </nowiki> | ||
+ | </code> |
Revision as of 06:36, 22 April 2021
Authors | M. Atif Qureshi Colm O'Riordan Gabriella Pasi |
---|---|
Publication date | 2012 |
DOI | 10.1145/2396761.2398680 |
Links | Original |
Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia - scientific work related to Wikipedia quality published in 2012, written by M. Atif Qureshi, Colm O'Riordan and Gabriella Pasi.
Overview
Finding domain specific key terms/phrases from a given set of documents is a challenging task. A domain may be defined as an area of interest over a collection of documents which may not be explicitly defined but implicitly observable in those documents. When considering a collection of documents related to academic research, examples of key terms/phrases may be Information Retrieval", "Marine Biology", etc. In this paper a technique for extracting important key terms/phrases in a considered topical domain is proposed using external evidence from the titles of Wikipedia articles and the Wikipedia category graph. Authors performed some experiments over the document collection of Web sites of different post-graduate schools. Authors preliminary evaluations show promising results for the detection of domain specific key terms/phrases from the given set of domain focused Web pages.
Embed
Wikipedia Quality
Qureshi, M. Atif; O'Riordan, Colm; Pasi, Gabriella. (2012). "[[Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia]]".DOI: 10.1145/2396761.2398680.
English Wikipedia
{{cite journal |last1=Qureshi |first1=M. Atif |last2=O'Riordan |first2=Colm |last3=Pasi |first3=Gabriella |title=Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia |date=2012 |doi=10.1145/2396761.2398680 |url=https://wikipediaquality.com/wiki/Short-Text_Domain_Specific_Key_Terms/Phrases_Extraction_Using_an_N-Gram_Model_with_Wikipedia}}
HTML
Qureshi, M. Atif; O'Riordan, Colm; Pasi, Gabriella. (2012). "<a href="https://wikipediaquality.com/wiki/Short-Text_Domain_Specific_Key_Terms/Phrases_Extraction_Using_an_N-Gram_Model_with_Wikipedia">Short-Text Domain Specific Key Terms/Phrases Extraction Using an N-Gram Model with Wikipedia</a>".DOI: 10.1145/2396761.2398680.