Cluster analysis and related techniques in medical research

1 March 1992

journal article
review article
Published by SAGE Publications in Statistical Methods in Medical Research

Vol. 1 (1), 27-48
https://doi.org/10.1177/096228029200100103

Abstract

In this paper we review methods of cluster analysis in the context of classifying patients on the basis of clinical and/or laboratory type observations. Both hierarchical and non-hierarchical methods of clustering are considered, although the emphasis is on the latter type, with particular attention devoted to the mixture likelihood-based approach. For the purposes of dividing a given data set into g clusters, this approach fits a mixture model of g components, using the method of maximum likelihood. It thus provides a sound statistical basis for clustering. The important but difficult question of how many clusters are there in the data can be addressed within the framework of standard statistical theory, although theoretical and computational difficulties still remain. Two case studies, involving the cluster analysis of some haemophilia and diabetes data respectively, are reported to demonstrate the mixture likelihood-based approach to clustering.

Keywords

This publication has 28 references indexed in Scilit:

On the Choice of Starting Values for the EM Algorithm in Fitting Mixture Models
Journal of the Royal Statistical Society: Series D (The Statistician), 1988
Classification of Parasiticide by Cluster Analysis
The British Journal of Psychiatry, 1987
Exploratory Projection Pursuit
Journal of the American Statistical Association, 1987
Projection Pursuit
The Annals of Statistics, 1985
Estimation of Allocation Rates in a Cluster Analysis Context
Journal of the American Statistical Association, 1985
On the Convergence Properties of the EM Algorithm
The Annals of Statistics, 1983
A New Test for Multivariate Normality and Homoscedasticity
Technometrics, 1981
Bootstrap Methods: Another Look at the Jackknife
The Annals of Statistics, 1979
Numerical classification applied to certain Jamaican eocene nummulitids
Mathematical Geology, 1971
Hierarchical Grouping to Optimize an Objective Function
Journal of the American Statistical Association, 1963

Cited by 138 articles