Integration of synthetic minority oversampling technique for imbalanced class

In the data mining, a class imbalance is a problematic issue to look for the solutions. It probably because machine learning is constructed by using algorithms with assuming the number of instances in each balanced class, so when using a class imbalance, it is possible that the prediction results ar...

Full description

Saved in:
Bibliographic Details
Main Authors: Santoso, Noviyanti (Author), Wibowo, Wahyu (Author), Hikmawati, Hilda (Author)
Format: EJournal Article
Published: Institute of Advanced Engineering and Science, 2019-01-01.
Subjects:
Online Access:Get fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02274 am a22003133u 4500
001 ijeecs14796_10261
042 |a dc 
100 1 0 |a Santoso, Noviyanti  |e author 
100 1 0 |e contributor 
700 1 0 |a Wibowo, Wahyu  |e author 
700 1 0 |a Hikmawati, Hilda  |e author 
245 0 0 |a Integration of synthetic minority oversampling technique for imbalanced class 
260 |b Institute of Advanced Engineering and Science,   |c 2019-01-01. 
500 |a https://ijeecs.iaescore.com/index.php/IJEECS/article/view/14796 
520 |a In the data mining, a class imbalance is a problematic issue to look for the solutions. It probably because machine learning is constructed by using algorithms with assuming the number of instances in each balanced class, so when using a class imbalance, it is possible that the prediction results are not appropriate. They are solutions offered to solve class imbalance issues, including oversampling, undersampling, and synthetic minority oversampling technique (SMOTE). Both oversampling and undersampling have its disadvantages, so SMOTE is an alternative to overcome it. By integrating SMOTE in the data mining classification method such as Naive Bayes, Support Vector Machine (SVM), and Random Forest (RF) is expected to improve the performance of accuracy. In this research, it was found that the data of SMOTE gave better accuracy than the original data. In addition to the three classification methods used, RF gives the highest average AUC, F-measure, and G-means score. 
540 |a Copyright (c) 2018 Institute of Advanced Engineering and Science 
540 |a http://creativecommons.org/licenses/by-nc/4.0 
546 |a eng 
690 |a Statistics, data mining 
690 |a imbalanced class, smote, data mining, accuracy 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n Indonesian Journal of Electrical Engineering and Computer Science; Vol 13, No 1: January 2019; 102-108 
786 0 |n 2502-4760 
786 0 |n 2502-4752 
786 0 |n 10.11591/ijeecs.v13.i1 
787 0 |n https://ijeecs.iaescore.com/index.php/IJEECS/article/view/14796/10261 
856 4 1 |u https://ijeecs.iaescore.com/index.php/IJEECS/article/view/14796/10261  |z Get fulltext