Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Admissibility, Admissibility A very common concept which is applicable ...

Admissibility A very common concept which is applicable to any procedure of the statistical inference. The underlying notion is that the procedure/method is admissible if and o

Probability distribution and sampling distribution , a 100 squash balls are...

a 100 squash balls are bounce from height of 100 inches with average height 30 inch with standard deviation 3/4 inch. a ball is fast if bounce above 32 inch. what is chance of gett

Determine the compressive force, The weight of the engine in kN is given in...

The weight of the engine in kN is given in P2 and is suspended from a vertical chain at A. A second chain round the engine is attached at A, with a spreader bar between B and C. Th

Evaluate minimum capability requirement, You are currently working wit...

You are currently working with a supplier who is producing a shaft whose diameter specification is 6.00 ± .003 inches.  Currently, the process is yielding shafts wit

Physics, fixed capacitor and variable capacitor

fixed capacitor and variable capacitor

Simple linear regression, We are interested in assessing the effects of tem...

We are interested in assessing the effects of temperature (low, medium, and high) and technical configuration on the amount of waste output for a manufacturing plant. Suppose that

Eigenvalue-based rules, Henry Kaiser suggested a rule for selecting a numbe...

Henry Kaiser suggested a rule for selecting a number of components m less than the number needed for perfect reconstruction: set m equal to the number of eigenvalues greater than I

Analysis of variance for the data, Analysis of Variance for the data: ...

Analysis of Variance for the data: Draw a random sample of size 25 from the following data : (a) With Replacement and   (b) Without Replacement and obtain Mean and Varia

Weibull distribution, slope parameter of 1.4 and scale parameter of 550.cal...

slope parameter of 1.4 and scale parameter of 550.calculate Reliability, MTTF, Variance, Design life for R of 95%

Trying to find test statistic and P value, Ask question #Minimum The data i...

Ask question #Minimum The data in the accompanying table give the weights? (in g) of randomly selected quarters that were minted after 1964. The quarters are supposed to have a med

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd