Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Weighted arithmetic mean, Weighted Arithmetic Mean Another aspect...

Weighted Arithmetic Mean Another aspect to be considered is the importance we assign to each observation. The arithmetic mean as we calculated it so far gives equal

Descriptive statistics, find the average rate of increase in population whi...

find the average rate of increase in population which in the first decade has increased 20%.in the second 25% and in the third 44%

Financial payments technology, Suppose the money supply process is now repr...

Suppose the money supply process is now represented by the following function: where m measures the sensitivity of money supply with respect to the interest rate. (i) Us

Liner programming , Solve the following Linear Programming Problem using S...

Solve the following Linear Programming Problem using Simple method. Maximize Z= 3x1 + 2X2 Subject to the constraints: X1+ X2 = 4 X1 - X2 = 2 X1, X2 = 0

Use of calculators in statistics, In recent years a number of calculators a...

In recent years a number of calculators are available for doing statistical calculations over and above the usual addition, subtraction, multiplication and division. The fx-82 mode

Find the optimal adaptive meshes for a skewed beta density, Show that the I...

Show that the ISB in a bin containing the origin of the double exponen-tial density, f(x) = exp(-|x|)/2, is O(h 3 ); hence, the discontinuity in the derivative of f does not have a

Atmospheric circulation and precipitation, (a) Elevation (m)...

(a) Elevation (m) 0 400 800 1200 1600 2000 2400 2800 3200 4000 480

Job application, .what job can you after offering that course

.what job can you after offering that course

Muti linear regression model problem, Muti linear regression model problem ...

Muti linear regression model problem An investigator is studying the relationship between weight (in pounds) and height (in inches) using data from a sample of 126 high school

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd