Implement a simple k-means method, Applied Statistics

Assignment Help:

There exists an unclassified data set with hidden data structures in it. The task in this assignment is to perform comprehensive Cluster Analysis in order to reveal the structures and similar data groups.

1. Implement a simple K-means method, which is able to handle real values data in attributes. Also you need to add functionality in your program that allows utilization of Euclidean, City Block, Euclidean Squared and Chebyshev distances. You are free to use any kind of weights (for feature or data instance) in the program if necessary.

2. Find unlabeled data set test.txt and initial centroids data set centroids.txt in the archive, both files have the following format: [attribute1_value attribute2_value ... attribute90_value]. The unlabeled data set includes 350 samples and the initial centroids set consists of 15 samples. Data instances in both files have 90 attributes.


Related Discussions:- Implement a simple k-means method

Z-score of a student, A study was designed to investigate the effects of tw...

A study was designed to investigate the effects of two variables - (1) a student's level of mathematical anxiety and (2) teaching method - on a student's achievement in a mathemati

Stratified random sampling, Stratified Random Sampling: This method of ...

Stratified Random Sampling: This method of sampling is used when the population is comprised of natural subdivision of units, The method consist in classifying the population u

Managerial report, A. Compute descriptive statistics for each stock and the...

A. Compute descriptive statistics for each stock and the S&P 500. Comment on your results. Which stocks are most volatile?

Empirical mode, Empirical Mode Where mode is ill-defined, its value may...

Empirical Mode Where mode is ill-defined, its value may be ascertained by the following formula based upon the empirical relationship between Mean, Median and Mode: Mode = 3

Types of correlation, Type of Correlation 1.      Positive and Negat...

Type of Correlation 1.      Positive and Negative Correlation: 2.      Simple Partial and Multiple Correlations. 3.      Linear and  Non linear or Correlations

Simple linear regression, We are interested in assessing the effects of tem...

We are interested in assessing the effects of temperature (low, medium, and high) and technical configuration on the amount of waste output for a manufacturing plant. Suppose that

Draw a cumulative frequency polygon, The following data give the repair cos...

The following data give the repair costs (in RM) for 30 randomly selected cars from a list of cars involved in collisions. a)  By using RM 1 as the lower limit of the first

Corelation regrassion, the two regrassion line will pass through the point ...

the two regrassion line will pass through the point (x,y)

Descriptive statistics, Descriptive Statistics : Carrying out an extens...

Descriptive Statistics : Carrying out an extensive analysis the data was not a subject to ambiguity and there were no missing values.  Below are descriptive statistics that hav

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd