Determine the ideal number of clusters

Assignment Help Computer Engineering
Reference no: EM131221035

Cluster Analysis

Attached Files:

• Week 4 Cluster Data.xlsx

Included with this assignment is an Excel spreadsheet that contains data with two dimension values.

The purpose of this assignment is to demonstrate steps performed in a K-Means Cluster analysis.

Review the "k-MEANS CLUSTERING ALGORITHM" section in Chapter 4 of the Sharda et. al. textbook for additional background.

Use Excel to perform the following data analysis.

1. Plot the data on a scatter plot.
2. Determine the ideal number of clusters.
3. Choose random center points (centroids) for each cluster. (Note: Each student will select a different random set of centroids.)
4. Using a standard distance formula measure the distance from each data point to each center point.
5. Assign each data point to an initial cluster region based on closeness.
6. For each cluster calculate new center points.
7. Repeat steps 4 through 6.

You will use Excel to help with calculations, but only standard functions should be used (i.e. don't use a plug-in to perform the analysis for you.) You need to show your work doing this analysis the long way. If you were to repeat steps 4 through 6, what will likely happen with the cluster centroids? The rubric for this assignment can be viewed when clicking on the assignment link.

Here is a link to an example spreadsheet using a smaller data set. It contains two tabs. The first tab is the raw data. The second tab contains the analysis that was performed. Make sure that you use a different starting center points from the example.

Attachment:- cluster_analysis-week4.xlsx

Reference no: EM131221035

Questions Cloud

What is net income for a merchandising company : What is net income for a merchandising company with the following data? Sales revenue $55000. Utilities $1100 . Inventory on Dec. 31st $9838. Inventory on Jan 1 $12000. Rent for shop $3049. Sales commissions $4300 . Purchases of Merchandise $36474 . ..
Latest annual income statement : A company reports sales of $2,245,000 in its latest annual income statement. The sales, however, are reported for a year that consists of 53 weeks. You want to project sales for the coming year based on the most recently reported year but you now nee..
Which is not a direct labor cost : Which of the following is not a direct labor cost? Which of the following is an indirect cost?
Banks composition of assets and liabilities : How does your bank's composition of assets and liabilities differ from averages for U.S. commercial banks? What explains these differences?
Determine the ideal number of clusters : Determine the ideal number of clusters. Choose random center points for each cluster. (Note: Each student will select a different random set of centroids.) Using a standard distance formula measure the distance from each data point to each center ..
Research paper on change in human resource development : For the final assignment of this course, you will write a research paper on change in a human resource development (HRD) organization that you work for, or would like to work for.
Write an evaluator for the language of sums and products : We can use structures to represent syntax trees in Prolog. For example, the expression, (3 * 4) + (5 + 6) can be represented by the syntax tree: sum(prod(num(3), num(4)), sum(num(5), num(6))) Write an evaluator for the language of sums and products
Country-el salvador : 1. The relationship over the last 5 years between your country's trade picture and the country's currency exchange rate. 2. The relationship over the last 5 years between your country's Foreign Direct Investment picture and the country's currency e..
Behaviorin extended organization : ApplyingFigure 14.2to what the video tells us about Numi'spolitical behaviorin this extended organization, summarize the role and nature of ethical considerations in this behavior.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Computing roots of the function f

The best known iterative method in order compute the roots of the function f (that is, the x-values for which f(x) is 0) is Newton-Raphson approximation.

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Define smaller industry-specific software developers

Does Microsoft's entry into industry-specific applications signal the end for smaller industry-specific software developers? What changes in strategy by such developers are necessary to compete with Microsoft.

  Inheritance is a method in object-oriented programming

Inheritance is a method in object-oriented programming that you derive new classes from existing classes in your code.

  Which jobs require specific hardware knowledge

Which jobs require specific hardware knowledge? Which jobs imply knowledge of computer hardware? Is there any correlation between the required hardware knowledge and the company or its location?

  Write a statement of scope

Write a statement of scope that describes the software. Be sure your statement of scope is bounded. If you're unfamiliar with robots, do a bit of research before you begin writing.

  Your boss has just heard regarding some nefarious computer

your boss has just heard about some nefarious computer activities called ping sweeps and port scans. he wants to know

  Educating about computer viruses and malware

The University of Calgary provides a senior-level computer science course known as, “Computer Viruses and Malware.” The course taught the students how to write the viruses, worms, and Trojan Horses. It also describes the history of the computer vi..

  In brief explain the given options for ending a computing

briefly describe the following options for ending a computing session log off option switch user option sleep option

  Questionq1 assume that the ith operation on a data

questionq1 assume that the ith operation on a data structure takes thetaui time where ui is the number of units in the

  From your knowledge and experience how are computer

from your knowledge and experience how are computer forensic investigators in todays world of complex technology are

  What is big-o running time of subsequent code fragment

What is the Big-O running time of the subsequent code fragment - If an ArrayList is passed.  Describe your answer.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd