Outline the k-means clustering algorithm

Assignment Help Management Information Sys
Reference no: EM13960766

(a) The K nearest neighbour (KNN) algorithm uses a distance metric to order the training data in relation to a given test example. Given a problem with data in the form (x1,.....,xn,Y), where are independent variables, and y is the dependent variable for prediction, describe and explain an approach to weighting the k nearest neighbours so that nearer neighbours are more important when producing the final predicted y value for a test example.

(b) Outline the k-means clustering algorithm for a set of data defined as vectors xi. Include a diagram to support your algorithm description.

(c) Explain why the k-means clustering algorithm does not guarantee finding the optimal cluster locations for any given application of the algorithm. Given this non-optimal clustering, what does this imply in terms of how k-means should be used in practice to ensure a good clustering?

Reference no: EM13960766

Questions Cloud

Factor analysis project : Factor analysis project, Prepare a report of the results of 2 and a half double-spaced pages along with tables associated with the results. Also include a log stating the steps used in the research, and any pertinent SPSS printouts.
Refer to the template spreadsheet provided : Refer to the template spreadsheet provided.  Stock A has an annualized volatility equal to 18% for which you have just written an out-of-the-money 26 week call option.  The risk free rate is 2% per annum and the strike price is $100.  There is anothe..
Revenue recognition : Revenue Recognition -  Suppose for purposes of this question that Cisco closes its books quarterly.  What journal entry or entries did Cisco make on October 31, 2011?
Prepare a tax memo on these issues : Prepare a tax memo on these issues (no more than four pages), to the tax partner on this engagement, Robert Holder.  You need to read Sections 382 and 108 and the related regulations to develop your solution.
Outline the k-means clustering algorithm : Outline the k-means clustering algorithm for a set of data defined as vectors xi. Include a diagram to support your algorithm description.
Explain the role of sensitivity analysis : Explain the role of sensitivity analysis in terms of understanding the properties of a model. In particular, address the issue of how variation in model inputs can be assessed, and why this is important.
Venture capital and private equity : Venture Capital and Private Equity. You have decided to begin a new venture and are armed with an understanding of the market for your products or services. How do you figure out what resources (financial and nonfinancial) you will need to bring that..
Principal technologies and standards for wireless networking : What are the principal technologies and standards for wireless networking, communications, and Internet access? Define Bluetooth, wi-fi, WiMax, and 3G and 4G networks. Will these standards last until 2025? How often should they be updated?
What is the consumer product safety database : What is the Consumer Product Safety Database (CPSC) What problems are raised by this database? Why is it so controversial? Why is data quality an issue? Name two entities in the CPSC database and describe some of their attributes.

Reviews

Write a Review

Management Information Sys Questions & Answers

  Purchasing and supply managementbased on your experience or

purchasing and supply managementbased on your experience or readings discuss the interaction between purchasing and

  Discuss the technology behind the system

Consider again the telemedicine system discuss the technology behind the system, and how it will be updated to keep pace with emerging technology

  How the emergence of agile methodologies has changed

Discuss how the emergence of agile methodologies has changed the IT system building model

  Features about windows 7 and windows 8

Read the article titled "The Windows XP upgrade question: Windows 7 or Windows 8?" You can also use the Internet or Strayer Library to research articles on features about Windows 7 and Windows 8

  Annual statement of cash laws

Almondine compny sold a computer for $50,000. the computer's original cost was $250,000, and the accumulated depreciation at the date of sale was $180,000. the sale of the computer should appear on almondine's annual statement of cash laws(indirec..

  What is your decision-making style

Read the Harvard Business Review article "Good Data Won't Guarantee Good Decisions." Then, complete the "What's Your Decision-Making Style"

  Hyper-social organization and erp systemsi need help in

hyper-social organization and erp systemsi need help in answering these questions about hyper-social organization and

  Communicate with a relational database

SQL Joins and Typical Query Usage - Communicate with a relational database to create tables, and query and manipulate data.

  Software application failuredo you agree with the notion

software application failuredo you agree with the notion the bigger the software application and the larger the cost

  Part 1- social media strategy designfor this phase of the

part 1- social media strategy designfor this phase of the project you are required to formulate a social media strategy

  Advantages and disadvantages of restricting user interfaces

Explain the advantages and disadvantages of restricting user interfaces. (User interfaces can often be restricted, limiting the user's ability to navigate to other areas of the system, or out of the system.

  Describe the main tasks performed by a web server

Describe the main task(s) performed by a Web server. Define the term "static Web page" and outline the disadvantages of building a Web site using such pages.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd