Compute the ideal number of clusters

Assignment Help Computer Engineering
Reference no: EM131289255

Cluster Anaylsis

Included with this assignment is an Excel spreadsheet that contains data with two dimension values.

The purpose of this assignment is to demonstrate steps performed in a K-Means Cluster analysis.

Review the "k-MEANS CLUSTERING ALGORITHM" section in Chapter 4 of the Sharda et. al. textbook for additional background.

Use Excel to perform the following data analysis.

1. Plot the data on a scatter plot.
2. Determine the ideal number of clusters.
3. Choose random center points (centroids) for each cluster. (Note: Each student will select a different random set of centroids.)
4. Using a standard distance formula measure the distance from each data point to each center point.
5. Assign each data point to an initial cluster region based on closeness.
6. For each cluster calculate new center points.
7. Repeat steps 4 through 6.

You will use Excel to help with calculations, but only standard functions should be used (i.e. don't use a plug-in to perform the analysis for you.) You need to show your work doing this analysis the long way. If you were to repeat steps 4 through 6, what will likely happen with the cluster centroids? The rubric for this assignment can be viewed when clicking on the assignment link.

Here is a link to an example spreadsheet using a smaller data set. It contains two tabs. The first tab is the raw data. The second tab contains the analysis that was performed. Make sure that you use a different starting center points from the example.

Attachment:- Cluster_Data.xlsx

Reference no: EM131289255

Questions Cloud

What would you say about the potential benefit of encryption : If you were part of a government entity, what would you say about the potential benefits and drawbacks of encryption? Be sure to cite examples from credible sources to support you answers.
Work related internet usage policies : Most companies today have some type of restriction on Internet use and have policies in place, such as a Responsible Use of Technology policy.
Explain job analysis by hr manager : Explain job analysis by HR manager
Wi-fi upstream vs downstream : On wireless WANs, downstream data transmission is typically faster than upstream transmission. What types of limitations does this bring to an office environment? Why do you think this should be a consideration in WAN configuration?
Compute the ideal number of clusters : Determine the ideal number of clusters. Choose random center points (centroids) for each cluster. Using a standard distance formula measure the distance from each data point to each center point.
How can networking be used to increase personal effectivenes : How can networking be used to increase personal or organizational effectiveness? How can it work against personal or organizational effectiveness? In your opinion, is business networking ethical? Why or why not?
Integer together without doing proper type conversion : Part 1: Create a function that takes a string as a parameter and attempts to open a file with the string that is passed. Part 2: Create a function that takes a string as a parameter and attempts to use python's eval() function to evaluate that stri..
Develop five major goals for information technology plan : Develop at least five major goals for the information technology strategic plan. Speculate on three positive organizational impacts of implementing the strategic plan. Justify your response.
Determine the power dissipated in r1 and the current in r2 : For the circuit shown below, using an appropriate analysis method, determine the power dissipated in R1.- Using a suitable analysis method, determine the current in R2.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd