K-means cluster analysis, Advanced Statistics

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 

Posted Date: 7/30/2012 1:31:04 AM | Location : United States







Related Discussions:- K-means cluster analysis, Assignment Help, Ask Question on K-means cluster analysis, Get Answer, Expert's Help, K-means cluster analysis Discussions

Write discussion on K-means cluster analysis
Your posts are moderated
Related Questions
Generalized principal components analysis: The non-linear version of the principal components analysis in which the goal is to determine the non-linear coordinate system which is

difference between histogram and historigram

Meta-analysis is the collection of techniques whereby the results of two or more independent studies are statistically combined to yield the overall answer to a question of intere

2 jobs n machines,graphical method,how to determine which job should proceed first on each machine

Jonckheere Terpstra test  is the test for detecting particular types of departures from the independence in a contingency table in which both the row and column categories contain

Log-linear models is the models for count data in which the logarithm of expected value of a count variable is modelled as the linear function of parameters; the latter represent

Data which occur when failure period is recorded which are dependent. Such type of data can arise in number contexts, for instance, in epidemiological cohort studies in which th

Mauchly test is a test which a variance-covariance matrix of pair wise differences of responses in the set of longitudinal data is the scalar multiple of identity matrix, a proper

Categorizing continuous variables : A practice which involves the conversion of the continuous variables into the series of the categories, which is common in the field of medical

importance of mathamatical expection in business