K-means cluster analysis, Advanced Statistics

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 

Posted Date: 7/30/2012 1:31:04 AM | Location : United States







Related Discussions:- K-means cluster analysis, Assignment Help, Ask Question on K-means cluster analysis, Get Answer, Expert's Help, K-means cluster analysis Discussions

Write discussion on K-means cluster analysis
Your posts are moderated
Related Questions
when there is tie in sequencing then what we do

The Null Hypothesis - H0:  There is no heteroscedasticity i.e. β 1 = 0 The Alternative Hypothesis - H1:  There is heteroscedasticity i.e. β 1 0 Reject H0 if |t | > t = 1.96

This graph for Cross Correlation Function for RES1, RES1 shows that there is possibly negative autocorrelation as there are alternating spikes; also the first spike is negative whi

A name sometimes given to the type of diagram generally used in meta-analysis, in which point estimates and confidence intervals are displayed for all the studies included in the a

a shop is selling laptops at regular price and at half price.If the laptops are regular price a day they will be at regular price tha day after with proba 2/3, if the laptops are a

wat iz z difference b/n logistic regression and multiple regression analysis /

Information theory: This is the branch of applied probability theory applicable to various communication and signal processing problems in the field of engineering and biology. In

Tracking is the term sometimes used in the discussions of data from the longitudinal study, to describe the ability to predict the subsequent observations from previous values. In

Population pyramid : The diagram designed to show the comparison of the human population by sex and age at a given instant time, consisting of a pair of the histograms, one for eve

Geometric distribution: The probability distribution of the number of trials (N) before the first success in the sequence of Bernoulli trials. Specifically the distribution is can