Implement a simple k-means method, Applied Statistics

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.

Posted Date: 4/1/2013 5:55:54 AM | Location : United States







Related Discussions:- Implement a simple k-means method, Assignment Help, Ask Question on Implement a simple k-means method, Get Answer, Expert's Help, Implement a simple k-means method Discussions

Write discussion on Implement a simple k-means method
Your posts are moderated
Related Questions
Grid is the set of pairs {1, 2, 3, 4} x {1, 2, 3, 4}. Image is the power set of Grid. An element of Image is a subset of Grid and can be represented by a diagram on a 4 by 4

#questionMaximize Z= 3x1 + 2X2 Subject to the constraints: X1+ X2 = 4 X1 - X2 = 2 X1, X2 = 0..

who invented the chi square test and why? what is central chi square and non central chi square test? what is distribution free statistics? what are the conditions when the chi squ

discuss the mathematical test of adequacy of index number of formulae. prove algebraically that the laspeyre, paasche and fisher price index formulae satisfies this test. What is

Definition of Central Tendency The central tendency of a variable means a typical value around which other values tend to concentrate which can be measured. Such concentration

Make a decision about the given claim. Do not use any formal procedures and exact calculation. Use only the rare event rule. Claim: A coin favors head when tossed, and there

Old Faithful Geyser in Yellowstone National Park derives its names and fame from the regularity (and beauty) of its eruptions. Rangers usually post the predicted times of eruptions

find the average rate of increase in population which in the first decade has increased 20%.in the second 25% and in the third 44%

Lifts usually have signs indicating their maximum capacity. Consider a sign in a lift that reads "maximum capacity 1400kg or 20 persons". Suppose that the weights of lift-users are

The Truly Canadian Restaurant stocks a private red table wine that it purchases from a local winery in the Niagara Falls region. The daily demand for the wine at the restaurant is