Implement a simple k-means method, Applied Statistics

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.

Posted Date: 4/1/2013 5:55:54 AM | Location : United States







Related Discussions:- Implement a simple k-means method, Assignment Help, Ask Question on Implement a simple k-means method, Get Answer, Expert's Help, Implement a simple k-means method Discussions

Write discussion on Implement a simple k-means method
Your posts are moderated
Related Questions
Suppose that before the minimum wage law change, the underlying mean number of part-time employees per Burger King Restaurant in New Jersey was 20.3. It was thought that the increa

Of the 6,325 kindergarten students who participated in the study, almost half or 3,052 were eligible for a free lunch program. The categorical variable sesk (1 == free lunch, 2 = n

for this proportion, use the +-2 rule of thumb to determine the 95 percent confidence interval. when asked if they are satisfied with their financial situation, .29 said "very sat

When the number of farmers growing wheat in Russia increases, the increase in world supply lowers the world price of wheat. Draw an appropriate diagram to analyze how this chang

The file Midterm Data.xls has a tab labeled "Income Data 2009". This data is collected income data from a sample of 400 people in 2009. Use a hypothesis test to see whether the av

what are the challenges affecting population census in developing countries

The data le for this assignment is brain-body-wts.txt, which lists the averages brain weights (gm) and body weights (kg) for a number of animal species. Your task is to t an appr

Variance The term variance was used to describe the square of the standard deviation by R.A.Fisher. The concept of variance is highly important in areas where it is possible to

a 100 squash balls are bounce from height of 100 inches with average height 30 inch with standard deviation 3/4 inch. a ball is fast if bounce above 32 inch. what is chance of gett

#regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual