Difference between revisions of "Token Level Code-Switching Detection Using Wikipedia as a Lexical Resource"

Revision as of 00:42, 27 October 2019

Token Level Code-Switching Detection Using Wikipedia as a Lexical Resource
Authors	Daniel Claeser Dennis Felske Samantha Kent
Publication date	2017
DOI	10.1007/978-3-319-73706-5_16
Links	Original

Token Level Code-Switching Detection Using Wikipedia as a Lexical Resource - scientific work related to Wikipedia quality published in 2017, written by Daniel Claeser, Dennis Felske and Samantha Kent.

Overview

Authors present a novel lexicon-based classification approach for code-switching detection on Twitter. The main aim is to develop a simple lexical look-up classifier based on frequency information retrieved from Wikipedia. Authors evaluate the classifier using three different language pairs: Spanish-English, Dutch-English, and German-Turkish. The results indicate that figures for Spanish-English are competitive with current state of the art classifiers, even though the approach is simplistic and based solely on word frequency information.

@@ Line 1: / Line 1: @@
+{{Infobox work
+| title = Token Level Code-Switching Detection Using Wikipedia as a Lexical Resource
+| date = 2017
+| authors = [[Daniel Claeser]]<br />[[Dennis Felske]]<br />[[Samantha Kent]]
+| doi = 10.1007/978-3-319-73706-5_16
+| link = https://link.springer.com/content/pdf/10.1007%2F978-3-319-73706-5_16.pdf
+}}
 '''Token Level Code-Switching Detection Using Wikipedia as a Lexical Resource''' - scientific work related to [[Wikipedia quality]] published in 2017, written by [[Daniel Claeser]], [[Dennis Felske]] and [[Samantha Kent]].
 == Overview ==
 Authors present a novel lexicon-based classification approach for code-switching detection on [[Twitter]]. The main aim is to develop a simple lexical look-up classifier based on frequency information retrieved from [[Wikipedia]]. Authors evaluate the classifier using three [[different language]] pairs: Spanish-English, Dutch-English, and German-Turkish. The results indicate that figures for Spanish-English are competitive with current state of the art classifiers, even though the approach is simplistic and based solely on word frequency information.

Difference between revisions of "Token Level Code-Switching Detection Using Wikipedia as a Lexical Resource"

Revision as of 00:42, 27 October 2019

Overview

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools