K-means cluster analysis, Advanced Statistics

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 

Posted Date: 7/30/2012 1:31:04 AM | Location : United States







Related Discussions:- K-means cluster analysis, Assignment Help, Ask Question on K-means cluster analysis, Get Answer, Expert's Help, K-means cluster analysis Discussions

Write discussion on K-means cluster analysis
Your posts are moderated
Related Questions
Occam's razor  is an early statement of the parsimony principle, which was given by William of Occam (1280-1349) namely 'entia non sunt multiplicanda praeter necessitatem'; which m

Observational study   is the study in which the objective is to discover cause-and-effect relationships but in which it is not feasible to use the controlled experimentation, in th

Least significant difference test is an approach to comparing a set of means which controls the family wise error rate at some specific level, let's assume it to be α. The hypothe

Zero-inflated Poisson regression is  the model for count data with the excess zeros. It supposes that with probability p the only possible observation is 0 and with the probabilit

You have probably noticed by now that some of the statements of necessary and sufficient conditions sound more natural than others. For example it seems more natural to express "We

The measure of the degree to which the particular model differs from the saturated model for the data set. Explicitly in terms of the likelihoods of the two models can be defined a

Misspecification  is the term is applied to describe the assumed statistical models which are incorrect for one of the several of reasons, for instance, using the wrong probability

The analysis of data which are the functions observed continuously, for instance, functions of time. Basically a collection of statistical techniques or methods for answering quest

Data theory is anxious with how observations are transformed into data which can be analyzed. Data are thus viewed as the theory laden in the sense that the observations can be giv

Response surface methodology (RSM): The collection of the statistical and mathematical methods useful for improving, developing, and optimizing processes with significant applicat