Estimating and Controlling Overlap in Gaussian Mixtures for Clustering Methods Evaluation
Published: 8 April 2020
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems , Volume 28, pp 183-211; https://doi.org/10.1142/s0218488520500087
Abstract: The ad hoc nature of the clustering methods makes simulated data paramount in assessing the performance of clustering methods. Real datasets could be used in the evaluation of clustering methods with the major drawback of missing the assessment of many test scenarios. In this paper, we propose a formal quantification of component overlap. This quantification is derived from a set of theorems which allow us to derive an automatic method for artificial data generation. We also derive a method to estimate parameters of existing models and to evaluate the results of other approaches. Automatic estimation of the overlap rate can also be used as an unsupervised learning approach in data mining to determine the parameters of mixture models from actual observations.
Keywords: models / estimation / Quantification / Overlap / Artificial Data / Clustering Methods Evaluation
Scifeed alert for new publicationsNever miss any articles matching your research from any publisher
- Get alerts for new papers matching your research
- Find out the new papers from selected authors
- Updated daily for 49'000+ journals and 6000+ publishers
- Define your Scifeed now
Click here to see the statistics on "International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems" .