Implement a simple k-means method, Applied Statistics

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.

Posted Date: 4/1/2013 5:55:54 AM | Location : United States







Related Discussions:- Implement a simple k-means method, Assignment Help, Ask Question on Implement a simple k-means method, Get Answer, Expert's Help, Implement a simple k-means method Discussions

Write discussion on Implement a simple k-means method
Your posts are moderated
Related Questions
the president of a certain firm concerned about the safety record of the firms employee sets aside $50 million a year for safety education. the firms accountant believes that more

Lorenz Curve   It is a graphic method of measuring dispersion. This curve was devised by Dr. Max o Lorenz a famous statistician.  He used this technique for wealth it i

give me question on mean is the aimplest average to understand and easy to compute

Your organization purchases bottles of a popular commercial solvent for resale.  Each bottle is labeled as containing 32 fluid ounces of the solvent.  Your cont

Make a decision about the given claim. Use only the rare event rule, and make subjective estimates to determine whether events are likely. For example, if the claim is that a coi


The range of actuator design parameters have been provisionally assessed and are presented in Table (3). You are required to determine the following parameters: The circumfer

it is said that management is equivalent to decision making? do you agree? explain

For the data analysis project, you will address some questions that interest you with the statistical methodology we are learning in class.   You choose the questions; you decide h

Zinc is a trace element and it is important in wound healing, building up the immune system and DNA synthesis. The data in Table 1 represents the zinc intake (in milligrams) for a