Optimal number of cluster, Applied Statistics

Try different numbers of clusters in your program (K=2...15) and build a plot that shows the dependency between number K and value of RSS function on the last iteration. What is the optimal number of clusters K for a given data set? Did you get any empty clusters? What is the possible solution for this problem? Present output of your program in the report and give explanations.

Posted Date: 4/1/2013 6:04:55 AM | Location : United States







Related Discussions:- Optimal number of cluster, Assignment Help, Ask Question on Optimal number of cluster, Get Answer, Expert's Help, Optimal number of cluster Discussions

Write discussion on Optimal number of cluster
Your posts are moderated
Related Questions
Find the minimum constant workforce: ABC Company, a manufacturer of roofing supplies, has developed monthly forecasts for roofing tiles. The forecasted demand and the expected

In simple regression the dependent variable Y was assumed to be linearly related to a single variable X. In real life, however, we often find that a dependent variable may depend o

Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ...

The Case Study included information about the price for a full meal before and after the law change (in dollars).  Of interest is whether the differences in price for a full meal b

Test for Equality of Proportions For example, we may want to test whether the percentage of smokers (p 1 ) among the males equals the percentage of female smokers (p 2 ). W

Analysis of variance allows us to test whether the differences among more than two sample means are significant or not. This technique overcomes the drawback of the method used in

Simulation When decisions are to be taken under conditions of uncertainty, simulation can be used. Simulation as a quantitative method requires the setting up of a mathematical

Steps in ANOVA The three steps which constitute the analysis of variance are as follows: To determine an estimate of the population variance from the variance that exi

Weighted Harmonic Mean Weighted Harmonic Mean is calculated with the help of the following formula: WHM Case

Disadvantages The value of mode cannot always be determined. In some cases we may have a bimodal series. It is not capable of algebraic manipulations. For example, from t