K-means cluster analysis, Advanced Statistics

Assignment Help:

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned, if suitable, to a different cluster in an attempt to optimize some predefined numerical criterion that measures in some sense the 'quality' of cluster solution. Several such clustering criteria have been suggested, but the most usually used arise from considering the features of the within groups, between groups and whole matrices of sums of squares and the cross products (W, B, T) which can be described for every partition of the observations into the particular number of groups. The two most ordinary of the clustering criteria developing from these matrices are given as follows

minimization of trace W

minimization of determinant W

The first of these has tendency to produce the 'spherical' clusters, the second to produce clusters that all have same shape, though this will not necessarily be spherical in shape. 

 


Related Discussions:- K-means cluster analysis

Explain kendall''s tau statistics, Kendall's tau statistics : The measures ...

Kendall's tau statistics : The measures of the correlation between the two sets of rankings. Kendall's tau itself (τ) is the rank correlation coefficient based on number of inversi

Linked micro map plot, Linked micro map plot is a plot which provides the ...

Linked micro map plot is a plot which provides the graphical overview and the details for spatially indexed statistical summaries. The plot shows the spatial patterns and statisti

Describe monty hall problem, Monty Hall problem : A apparently counter-intu...

Monty Hall problem : A apparently counter-intuitive problem in the probability which gets its name from the TV game show, 'Let's Make a Deal' hosted by the Monty Hall. On show a pa

Prognostic scoring system, Prognostic scoring system is a technique of com...

Prognostic scoring system is a technique of combining the prognostic information contained in the number of threat factors, in a manner which best predicts each patient's risk of

Explain lie factor, Lie factor : A measure suggested by Tufte for judging t...

Lie factor : A measure suggested by Tufte for judging the honesty of the graphical presentation of data. Which can be calculated as follows   The values close to one are desir

Extreme value distribution, The probability distribution, f (x), of largest...

The probability distribution, f (x), of largest extreme can be given as    The location parameter, α is the mode and β is the scale parameter. The mean, variance skewn

Independent component analysis (ica), Independent component analysis (ICA) ...

Independent component analysis (ICA) is the technique for analyzing the complex measured quantities thought to be mixtures of other more fundamental quantities, into their fundamen

Probability., 5. Packages from a machine a normally distributed with a mean...

5. Packages from a machine a normally distributed with a mean 200g and its standard deviation 2grams. Find the probability that a package from the machine weighs a) Less than

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd