K-means cluster analysis, Advanced Statistics

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 

Posted Date: 7/30/2012 1:31:04 AM | Location : United States







Related Discussions:- K-means cluster analysis, Assignment Help, Ask Question on K-means cluster analysis, Get Answer, Expert's Help, K-means cluster analysis Discussions

Write discussion on K-means cluster analysis
Your posts are moderated
Related Questions
Leaps-and-bounds algorithm is an algorithm which is used to ?nd the optimal solution in problems which might have a large number of possible solutions. Begins by dividing the poss

The Null Hypothesis - H0: Model does not fit the data i.e. all slopes are equal to zero β 1 =β 2 =...=β k =  0 The Alternative Hypothesis - H1:  Model does fit the data i.e. at

The particular projection which an investigator believes is most likely to give an accurate prediction of the future value of some process. Commonly used in the context of the anal

need answers to questions in book advanced and multivariate statistical methods

importance of mathamatical expection in business

Reasons for screening data     Garbage in-garbage out     Missing data          a. Amount of missing data is less crucial than the pattern of it. If randomly

There are two periods. You observe that Jack consumes 100 apples in period t = 0, and 120 apples in period t = 1. That is, (c 0 ; c 1 ) = (100; 120) Suppose Jack has the util

Consolidated Standards for Reporting Trials (CONSORT) statement : The protocol for reporting the results of the clinical trials. The core contribution of the statement comprises of

Matching distribution is  a probability distribution which arises in the following manner. Suppose that the set of n subjects, numbered 1; . . . ; n respectively, are arranged in

Partial least squares is an alternative to the multiple regressions which, in spite of using the original q explanatory variables directly, constructs the new set of k regressor v