Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Estimate the values of the dependent variable, 1. Suppose you are estimatin...

1. Suppose you are estimating the imports (from both the U.S. mainland and foreign countries) of fuels and petroleum products in Hawaii (the dependent variable). The values of the

Calculate the current ratio and quick ratio, You were recently hired by E&T...

You were recently hired by E&T Boats, Inc. to assist the company with its financial planning and to evaluate the company's performance.  E&T Boats, Inc. builds and sells boats to o

Simple linear regression model, A study was conducted to determine the amou...

A study was conducted to determine the amount of heat loss for a certain brand of thermal pane window. Three different windows were randomly subjected to each of three different ou

Multiple correspondence analysis, Correspondence analysis is an exploratory...

Correspondence analysis is an exploratory technique used to analyze simple two-way and multi-way tables containing measures of correspondence between the rows and colulnns of an

Regression analysis , The data used is from a statistical software Minitab;...

The data used is from a statistical software Minitab; London.MPJ is the file that consists of 1519 households drawn from 1980 - 1982 British Family Expenditure Surveys. Data that i

Harmonic mean, The Harmonic Mean is based on the reciprocals of numbers ave...

The Harmonic Mean is based on the reciprocals of numbers averaged. It is defined as the reciprocal of the arithmetic mean of the reciprocal of the given individual observations. Th

Calculate the normal loss and abnormal loss, Chemical processors manufactur...

Chemical processors manufacture wondercool using two processes- mixing and distillation. The following details relate to the distillation process for a period. No opening work i

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd