You are working with the text-only light edition of "H.Lohninger: Teach/Me Data Analysis, Springer-Verlag, Berlin-New York-Tokyo, 1999. ISBN 3-540-14743-8". Click here for further information.

Cluster Analysis

 

"Cluster Analysis" is the generic term for multivariate methods which attempt to find structures ("clusters") in the data. These methods are mostly based on calculations of the distance of the observations in multidimensional data space. Basically, cluster analysis will give answers to one of the following three questions:
 

At right you see a plot of about 150 data of three different kinds of flowers (50 each) which clearly show two clusters. Cluster analysis can help to find such clusters automatically.


The results of a cluster analysis are often displayed as dendrograms which show the multidimensional relationships as a two dimensional line plot. In general, cluster analysis methods can be grouped into several categories:
 


Last Update: 2005-Jän-25