Hate Speech Detection in Indonesian Twitter using Contextual Embedding Approach

Hate speech develops along with the rapid development of social media. Hate speech is often issued due to a lack of public awareness of the difference between criticism and statements that might contribute to this crime. Therefore, it is very important to do early detection of sentences that will be...

Full description

Saved in:
Bibliographic Details
Main Authors: Herwanto, Guntur Budi (Author), Ningtyas, Annisa Maulida (Author), Mujiyatna, I Gede (Author), Nugraha, Kurniawan Eka (Author), Prayana Trisna, I Nyoman (Author)
Format: EJournal Article
Published: IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia., 2021-04-30.
Subjects:
Online Access:Get Fulltext
Get Fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02463 am a22003373u 4500
001 IJCSS_64916
042 |a dc 
100 1 0 |a Herwanto, Guntur Budi  |e author 
100 1 0 |e contributor 
700 1 0 |a Ningtyas, Annisa Maulida  |e author 
700 1 0 |a Mujiyatna, I Gede  |e author 
700 1 0 |a Nugraha, Kurniawan Eka  |e author 
700 1 0 |a Prayana Trisna, I Nyoman  |e author 
245 0 0 |a Hate Speech Detection in Indonesian Twitter using Contextual Embedding Approach 
260 |b IndoCEISS in colaboration with Universitas Gadjah Mada, Indonesia.,   |c 2021-04-30. 
500 |a https://jurnal.ugm.ac.id/ijccs/article/view/64916 
520 |a Hate speech develops along with the rapid development of social media. Hate speech is often issued due to a lack of public awareness of the difference between criticism and statements that might contribute to this crime. Therefore, it is very important to do early detection of sentences that will be written before causing a criminal act due to public ignorance. In this paper, we use the advancement of deep neural networks to predict whether a sentence contains a hate speech and abusive tone. We demonstrate the robustness of different word and contextual embedding to represent the semantic of hate speech words. In addition, we use a document embedding representation via a recurrent neural networks with gated recurrent unit as the main architecture to provide richer representation. Compared to syntactic representation of the previous approach, the contextual embedding in our model proved to give a significant boost on the performance by a significant margin. 
540 |a Copyright (c) 2021 IJCCS (Indonesian Journal of Computing and Cybernetics Systems) 
540 |a http://creativecommons.org/licenses/by-sa/4.0 
546 |a eng 
690 |a Computer Science 
690 |a hate speech; natural language processing; deep neural network; contextual embedding; recurrent neural network 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n IJCCS (Indonesian Journal of Computing and Cybernetics Systems); Vol 15, No 2 (2021): April; 177-188 
786 0 |n 2460-7258 
786 0 |n 1978-1520 
787 0 |n https://jurnal.ugm.ac.id/ijccs/article/view/64916/31157 
856 4 1 |u https://jurnal.ugm.ac.id/ijccs/article/view/64916  |z Get Fulltext 
856 4 1 |u https://jurnal.ugm.ac.id/ijccs/article/view/64916/31157  |z Get Fulltext