Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Problem set for logistic regression, (1) What values can the response varia...

(1) What values can the response variable Y take in logistic regression, and hence what statistical distribution does Y follow? The response variable can take the value of either

Cartogram or mapograph, Cartogram or Mapograph:   Statistical maps are a...

Cartogram or Mapograph:   Statistical maps are also used to represent data like density of population indifferent states in the country or different countries in the world or th

Measures of dispersion, Other Measures of Dispersion In this section, ...

Other Measures of Dispersion In this section, we look at relatively less used measures of dispersion like fractiles, deciles, percentiles, quartiles, interquartile range and f

Rank correlation, Rank Correlation Sometimes the characteristics whose ...

Rank Correlation Sometimes the characteristics whose possible correlation is being investigated, cannot be measured but individuals can only be ranked on the basis of the chara

Find the mean and standard deviation, Problem : A company supplying ele...

Problem : A company supplying electrical products, places a rush order for electric wires. Consignments of wires are to be sent immediately when they are available. Previous

Data project, Choose any published database from the internet or Bethel lib...

Choose any published database from the internet or Bethel library (such as those from the Census Bureau or any financial sites). You may opt to use one of the data files provided b

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd