K-means cluster analysis, Advanced Statistics

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 

Posted Date: 7/30/2012 1:31:04 AM | Location : United States







Related Discussions:- K-means cluster analysis, Assignment Help, Ask Question on K-means cluster analysis, Get Answer, Expert's Help, K-means cluster analysis Discussions

Write discussion on K-means cluster analysis
Your posts are moderated
Related Questions
Profile plots  is a technique of representing the multivariate data graphically. Each of the observation is represented by a diagram comprising of a sequence of equispaced vertical

Collective risk models : The models applied to insurance portfolios which do not create direct reference to the risk characteristics of individual members of the portfolio when des

Computer-assisted interviews : A method or technique of interviewing subjects in which the interviewer reads the question from the computer screen instead of the printed page, and

Literature controls : The patients with the disease of interest who have received, in the past, one of two treatments under the investigation, and for whom the results have been pu

Mardia's multivariate normality test is a test that a set of the multivariate data arise from the multivariate normal distribution against departures due to the kurtosis. The test

Consolidated Standards for Reporting Trials (CONSORT) statement : The protocol for reporting the results of the clinical trials. The core contribution of the statement comprises of

Can I use ICC for this kind of data? Wind Month Day Temp(DV) 7.4 5 1 67 8 5 2 72 12.6 5 3 74 11.5 5 4 62 I am taking temp as the dependent variable. There are many more values.

Yate s' continuity correction : When the testing for independence in contingency table, a continuous probability distribution, known as chi-squared distribution, is used as the app

Hanging rootogram is   he diagram comparing the observed rootogram with the ?tted curve, in which dissimilarities between the two are displayed in relation to the horizontal axis,

Hello , I have a business statistic HW that is due after 23 hours exactly for now . I need full and details answers please , plus they must be in a done and typed in a word or exce