Building Indonesian Local Language Detection Tools Using Wikipedia Data

From Wikipedia Quality
Revision as of 08:54, 21 January 2020 by Maria (talk | contribs) (+ embed code)
Jump to: navigation, search


Building Indonesian Local Language Detection Tools Using Wikipedia Data
Authors
Puji Martadinata
Bayu Distiawan Trisedya
Hisar Maruli Manurung
Mirna Adriani
Publication date
2015
DOI
10.1007/978-3-319-31468-6_8
Links
Original

Building Indonesian Local Language Detection Tools Using Wikipedia Data - scientific work related to Wikipedia quality published in 2015, written by Puji Martadinata, Bayu Distiawan Trisedya, Hisar Maruli Manurung and Mirna Adriani.

Overview

The widespread use of social media today has generated lots of research interest towards information retrieval, natural language processing, and also machine learning. The vast diversity of languages used on social media creates the need for accurate automated language identification tools. In this research, authors develop a language identification tool that can help automatically identify social media posts in Indonesian, Javanese, Sundanese, and Minangkabau. The latter three are some of the most widely spoken regional languages in Indonesia. Authors conducted experiments to compare three popular methods used to develop language identification tools, namely N-grams, statistical models, and the Small Words technique. Authors experiments conducted using articles on internet for training and tested using social media data that authors constructed, show that the statistical method obtains the best result among all the methods used.

Embed

Wikipedia Quality

Martadinata, Puji; Trisedya, Bayu Distiawan; Manurung, Hisar Maruli; Adriani, Mirna. (2015). "[[Building Indonesian Local Language Detection Tools Using Wikipedia Data]]". Springer, Cham. DOI: 10.1007/978-3-319-31468-6_8.

English Wikipedia

{{cite journal |last1=Martadinata |first1=Puji |last2=Trisedya |first2=Bayu Distiawan |last3=Manurung |first3=Hisar Maruli |last4=Adriani |first4=Mirna |title=Building Indonesian Local Language Detection Tools Using Wikipedia Data |date=2015 |doi=10.1007/978-3-319-31468-6_8 |url=https://wikipediaquality.com/wiki/Building_Indonesian_Local_Language_Detection_Tools_Using_Wikipedia_Data |journal=Springer, Cham}}

HTML

Martadinata, Puji; Trisedya, Bayu Distiawan; Manurung, Hisar Maruli; Adriani, Mirna. (2015). &quot;<a href="https://wikipediaquality.com/wiki/Building_Indonesian_Local_Language_Detection_Tools_Using_Wikipedia_Data">Building Indonesian Local Language Detection Tools Using Wikipedia Data</a>&quot;. Springer, Cham. DOI: 10.1007/978-3-319-31468-6_8.