Compilation of malay criminological terms from online news

A Malay language corpus has been established by the Institute of Language and Literature (Dewan Bahasa dan Pustaka, DBP in Malaysia). Most of the past research on the Malay language corpus has focused on the description, lexicography and translation of the Malay language. However, in the existing li...

Full description

Saved in:
Bibliographic Details
Main Authors: Ling Lee, Joanna Chiew (Author), Lee Teh, Phoey (Author), Lun Lau, Sian (Author), Pak, Irina (Author)
Format: EJournal Article
Published: Institute of Advanced Engineering and Science, 2019-07-01.
Subjects:
Online Access:Get fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02446 am a22003253u 4500
001 ijeecs18480_12573
042 |a dc 
100 1 0 |a Ling Lee, Joanna Chiew  |e author 
100 1 0 |e contributor 
700 1 0 |a Lee Teh, Phoey  |e author 
700 1 0 |a Lun Lau, Sian  |e author 
700 1 0 |a Pak, Irina  |e author 
245 0 0 |a Compilation of malay criminological terms from online news 
260 |b Institute of Advanced Engineering and Science,   |c 2019-07-01. 
500 |a https://ijeecs.iaescore.com/index.php/IJEECS/article/view/18480 
520 |a A Malay language corpus has been established by the Institute of Language and Literature (Dewan Bahasa dan Pustaka, DBP in Malaysia). Most of the past research on the Malay language corpus has focused on the description, lexicography and translation of the Malay language. However, in the existing literature, there is no list of Malay words that categorizes crime terminologies. This study aims to fill that linguistic gap. First, we aggregated the most frequently used crime terminology words from Malaysian online news sources. Five hundred crime-related words were compiled. No automatic machines were in the initial process, but they were subsequently used to verify the data. Four human coders were used to validate the data and ensure the originality of the semantic understanding of the Malay text. Finally, major crime terminologies were outlined from a set of keywords to serve as taggers in our solution. The ultimate goal of this study is to provide a corpus for forensic linguistics, police investigations, and general crime research. This study has established the first corpus of a criminological text in the Malay language. 
540 |a Copyright (c) 2019 Institute of Advanced Engineering and Science 
540 |a http://creativecommons.org/licenses/by-nc/4.0 
546 |a eng 
690
690 |a Criminological text; Malay language; Part-of-speech; Semantic tagging 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n Indonesian Journal of Electrical Engineering and Computer Science; Vol 15, No 1: July 2019; 355-364 
786 0 |n 2502-4760 
786 0 |n 2502-4752 
786 0 |n 10.11591/ijeecs.v15.i1 
787 0 |n https://ijeecs.iaescore.com/index.php/IJEECS/article/view/18480/12573 
856 4 1 |u https://ijeecs.iaescore.com/index.php/IJEECS/article/view/18480/12573  |z Get fulltext