Comparison of Distance Methods in K-Means Algorithm for Determining Village Status in Bekasi District

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE) in 2019 International Conference of Artificial Intelligence and Information Technology (ICAIIT)

p. 270-276
https://doi.org/10.1109/icaiit.2019.8834604

Abstract

The Bekasi regency government reveals that there are around 21 slummy villages that are spread across seven sub-districts in Bekasi Regency. This indicates the need for funding and development assistance. Regarding to the village development, the government and regional government must provide information about which villages should be prioritized for development. The Village Potential or “Potensi Desa” statistics dataset in 2014 (Podes 2014) at Bekasi regency are released by the Central Bureau of Statistics in the form of unsupervised data consisting of 182 villages and 41 indicators. The Podes 2014 data is collected based on village specific levels in Indonesia by making the village a unit of analysis. By using the k-means algorithm, village status can be determined in Bekasi Regency. The data clustering using k-means is done by calculating the closest distance from a data to a centroid point. This study comparison of distance calculation methods on k-means using Manhattan, Euclidean and Chebychev will be made. Tests will be carried out using Davies Bouldin index and execution time. Based on the tests result, Euclidean metric has most optimum value of Davies Index and efficient execution time compared to Manhattan and Chebyshev metrics.

Keywords

This publication has 7 references indexed in Scilit:

Error Evaluation on K- Means and Hierarchical Clustering with Effect of Distance Functions for Iris Dataset
International Journal of Computer Applications, 2014
Comparison of data mining clustering algorithms
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
K-means with Three different Distance Metrics
International Journal of Computer Applications, 2013
Rapid development of applications in data mining
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Comparative Analysis of K-Means and Fuzzy C-Means Algorithms
International Journal of Advanced Computer Science and Applications, 2013
Far efficient K-means clustering algorithm
Published by Association for Computing Machinery (ACM) ,2012
Performance Comparison of Incremental Kmeans and Incremental DBSCAN Algorithms
International Journal of Computer Applications, 2011

Cited by 3 articles