Sizes of the clusters influence-k-means program, Programming Languages

Assignment Help:

Objective: to show if the sizes of the clusters influence the communication cost among a set on nodes

Write a K-Means  program  for a set of M points randomly distributed on an NxN  plane and a randomly distributed  of K cluster centers. The program should be completely documented.

The points on the plane are identified with integers 1, 2, 3, . . . M

The frequency of communication among pair of points (i, j) is f(i, j) =floor (abs(i - j))/2  for 0< i , j < M + 1.

f(i, j) =f(j, i) and the path of communication from i to j is the same with the path of communication from  j  to i.

The transmission distance of a center point is D = sqr (5)N/2, therefore any two center points that  are In distance  less or equal to D should  be connected and thus forming a 'backbone network'.  Construct an all short path table to be used for communication of center points by using a shortest path between them.

There are three types of communications that can affect the workload of a cluster center.

a. Inter-cluster communications (two points i, j that belong to the same cluster communicate via their  center point)

b. Intra -cluster communication (two points i, j that belong to two different clusters  communicate  as follows: point i with its cluster center , to the cluster center that j belongs to following a shortest path in the backbone network and from this cluster center to the point i)

c.  The 'door matt effect' on a cluster center when it participates as a stepping stone in a communication of points which neither one of the points belong to the cluster of the door matt center.

Run your program for each one of the K values below;

K=3, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24,25, 30

       N = 25x25                                           

      M = 100 the 100 points should be randomly distributed and remain the same for all runs.

      For each run calculate the communication cost for each one of the K centers. Tabulate the results for each run and plot then , i.e. 14 pots, one per run

Use the tables or the plots to compare the workloads of the K centers

What I am looking for is the correlation (if any) between the workloads of the centers and sizes of the clusters

Submit your program , the input to it, the various backbone networks (14 0f them), the tables (14 of them) and the plots (14 of them)

Also write a comprehensive report with your observations, justifications and comments.


Related Discussions:- Sizes of the clusters influence-k-means program

Linear time algorithm to find longest repeat prefix, A non-empty string β i...

A non-empty string β is called a repeat pre x of a string s if ββ is a pre x of s. Give a linear time algorithm to find the longest repeat prefix of s. Hint: Think of using lca

Computes the least squares fit of the data, Your solution to the problem sh...

Your solution to the problem should be housed within a while loop, which allows the grader to test your solution repeatedly without having to re-execute the script, as shown in cla

Arithmetic involving matrices, We next require taking a look at arithmetic ...

We next require taking a look at arithmetic involving matrices. We'll begin with addition and subtraction of two matrices. Therefore, assume that we have two n x m matrices, which

Shell script display time for user which is idle, Normal 0 fals...

Normal 0 false false false EN-US X-NONE X-NONE MicrosoftInternetExplorer4

.net class libraries in .net, .NET class libraries in .NET Whether you c...

.NET class libraries in .NET Whether you create value in C# or VB.NET, you are getting in touch with techniques in the same .NET category your local library. The same .NET str

Double roots, This case will lead to similar problem that we've had all oth...

This case will lead to similar problem that we've had all other time we've run in double roots or double eigenvalues. We only find a single solution and will require a second solut

Program for read minimum three command line arguments, 1. Write a program t...

1. Write a program that reads a minimum of three command line arguments and displays the program's results. a. Command Line arguments:  i. The first argument will be the program

Data normalization and schema refinement, What is Data Normalization and Sc...

What is Data Normalization and Schema Refinement? Define briefly..

C language, write a program to find the output 1 ...

write a program to find the output 1 1 2 1 1 2 3 2 1 1 2 3 4 3 2 1 1 2 3 2 1 1 2 1 1

Need help for crowdsporting website development, CrowdSporting Website Deve...

CrowdSporting Website Development I want an advanced web developer to spend 1-2 days coding our newly designed website. It will be a single page scrolling site, fully designed a

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd