Improved point center algorithm for K-Means clustering to increase software defect prediction

The k-means is a clustering algorithm that is often and easy to use. This algorithm is susceptible to randomly chosen centroid points so that it cannot produce optimal results. This research aimed to improve the k-means algorithm's performance by applying a proposed algorithm called point cente...

Full description

Saved in:
Bibliographic Details
Main Authors: Annisa, Riski (Author), Rosiyadi, Didi (Author), Riana, Dwiza (Author)
Other Authors: Bina Sarana Informatika University (Contributor), Master Program of Computer Science STMIK Nusa Mandiri (Contributor)
Format: EJournal Article
Published: Universitas Ahmad Dahlan, 2020-11-06.
Subjects:
Online Access:Get Fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02480 am a22003013u 4500
001 IJAIN_484_ijain_v6i3_p328-339
042 |a dc 
100 1 0 |a Annisa, Riski  |e author 
100 1 0 |a Bina Sarana Informatika University  |e contributor 
100 1 0 |a Master Program of Computer Science STMIK Nusa Mandiri  |e contributor 
700 1 0 |a Rosiyadi, Didi  |e author 
700 1 0 |a Riana, Dwiza  |e author 
245 0 0 |a Improved point center algorithm for K-Means clustering to increase software defect prediction 
260 |b Universitas Ahmad Dahlan,   |c 2020-11-06. 
500 |a https://ijain.org/index.php/IJAIN/article/view/484 
520 |a The k-means is a clustering algorithm that is often and easy to use. This algorithm is susceptible to randomly chosen centroid points so that it cannot produce optimal results. This research aimed to improve the k-means algorithm's performance by applying a proposed algorithm called point center. The proposed algorithm overcame the random centroid value in k-means and then applied it to predict software defects modules' errors. The point center algorithm was proposed to determine the initial centroid value for the k-means algorithm optimization. Then, the selection of X and Y variables determined the cluster center members. The ten datasets were used to perform the testing, of which nine datasets were used for predicting software defects. The proposed center point algorithm showed the lowest errors. It also improved the k-means algorithm's performance by an average of 12.82% cluster errors in the software compared to the centroid value obtained randomly on the simple k-means algorithm. The findings are beneficial and contribute to developing a clustering model to handle data, such as to predict software defect modules more accurately. 
540 |a Copyright (c) 2020 Riski Annisa, Didi Rosiyadi, Dwiza Riana 
540 |a https://creativecommons.org/licenses/by-sa/4.0 
546 |a eng 
690 |a Algorithm, K-Means, Cluster, Centroid, Software defect 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n International Journal of Advances in Intelligent Informatics; Vol 6, No 3 (2020): November 2020; 328-339 
786 0 |n 2548-3161 
786 0 |n 2442-6571 
787 0 |n https://ijain.org/index.php/IJAIN/article/view/484/ijain_v6i3_p328-339 
856 4 1 |u https://ijain.org/index.php/IJAIN/article/view/484/ijain_v6i3_p328-339  |z Get Fulltext