Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Schedule, Schedule Schedule is also used for the collection of primary ...

Schedule Schedule is also used for the collection of primary data. A schedule is a list of question. it is a device of obtaining answer to the questions in a form which is fill

Find the rank correlation coefficient, 1. Calculate the mean and mode of: ...

1. Calculate the mean and mode of: Central size 15 25 35 45 55 65 75 85 Frequencies 5 9 13 21 20 15 8 3 The following data shows the monthly expenditure of 80 students of

Pneumatic actuator design matrix, Pneumatic Actuator Design Matrix: The ra...

Pneumatic Actuator Design Matrix: The range of actuator design parameters have been provisionally assessed and are presented in Table. You are required to determine the following

JET Copies Case Problem, Read the “JET Copies” Case Problem on pages 678-67...

Read the “JET Copies” Case Problem on pages 678-679 of the text. Using simulation estimate the loss of revenue due to copier breakdown for one year, as follows: 1. In Excel, use a

HLT 362, What is an interaction? Describe an example and identify the varia...

What is an interaction? Describe an example and identify the variables within your population (work, social, academic, etc.) for which you might expect interactions?

..National Account- Descriptive Statistics, A country''s national accounts ...

A country''s national accounts are assumed to look as follows: GDP 1180 VAT and taxes 140 Commodity subsidies 60 Raw material and consumables 530 1. Calculate GVA 2. Calculate t

Lorenz curve , Lorenz Curve   It is a graphic method of measur...

Lorenz Curve   It is a graphic method of measuring dispersion. This curve was devised by Dr. Max o Lorenz a famous statistician.  He used this technique for wealth it i

Level of significance, Level of Significance: α The main purpose of hyp...

Level of Significance: α The main purpose of hypothesis testing is not to question the computed value of the sample statistic, but to make judgment about the difference between

Package design ratings, Consider the sample of 60 package design ratings gi...

Consider the sample of 60 package design ratings given in the table below.                                    A Sample of Package Design Ratings                 (Composite S

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd