Improving Density Peak Clustering by Automatic Peak Selection and Single Linkage Clustering
Open Access
- 14 July 2020
- Vol. 12 (7), 1168
- https://doi.org/10.3390/sym12071168
Abstract
Density peak clustering (DPC) is a density-based clustering method that has attracted much attention in the academic community. DPC works by first searching density peaks in the dataset, and then assigning each data point to the same cluster as its nearest higher-density point. One problem with DPC is the determination of the density peaks, where poor selection of the density peaks could yield poor clustering results. Another problem with DPC is its cluster assignment strategy, which often makes incorrect cluster assignments for data points that are far from their nearest higher-density points. This study modifies DPC and proposes a new clustering algorithm to resolve the above problems. The proposed algorithm uses the radius of the neighborhood to automatically select a set of the likely density peaks, which are far from their nearest higher-density points. Using the potential density peaks as the density peaks, it then applies DPC to yield the preliminary clustering results. Finally, it uses single-linkage clustering on the preliminary clustering results to reduce the number of clusters, if necessary. The proposed algorithm avoids the cluster assignment problem in DPC because the cluster assignments for the potential density peaks are based on single-linkage clustering, not based on DPC. Our performance study shows that the proposed algorithm outperforms DPC for datasets with irregularly shaped clusters.Keywords
Funding Information
- Ministry of Science and Technology, Taiwan (MOST 108-2221-E-155-013)
This publication has 29 references indexed in Scilit:
- Kernel methods for point symmetry-based clusteringPattern Recognition, 2015
- A Comprehensive Survey of Clustering AlgorithmsAnnals of Data Science, 2015
- Clustering by fast search and find of density peaksScience, 2014
- Semi-supervised cluster analysis of imaging dataNeuroImage, 2011
- Robust path-based spectral clusteringPattern Recognition, 2008
- Clustering aggregationACM Transactions on Knowledge Discovery From Data, 2007
- Survey of Clustering AlgorithmsIEEE Transactions on Neural Networks, 2005
- Computing Persistent HomologyDiscrete & Computational Geometry, 2004
- A maximum variance cluster algorithmIEEE Transactions on Pattern Analysis and Machine Intelligence, 2002
- Cluster Analysis and Stock Price ComovementCFA Magazine, 1980