Describe your implementation and experiment setup

Assignment Help Computer Engineering
Reference no: EM131919047

Assignment: Clustering

Your task for this assignment is to implement and evaluate the k-means clustering algorithm.

1. Implement the k-means clustering algorithm.

a. You can use any programming language that you are familiar with.

b. The program should be executable with at least 3 parameters: the name of the dataset file, k, and the name of the output file.

c. The output file should contain numerical class labels (formatted as one number per row) for all the records in the test dataset and report the sum squared error (SSE) in the last row.

d. You only need to handle numerical attributes (categorical attributes are not required).

2. Select two datasets from the UCI repository and evaluate the algorithm using SSE and another metric of your choice (e.g. BCubed precision and recall or Jaccard score if you have the class labels) with varying k. (I intend to run your implementation on the fisher iris dataset without the labels.

3. Write a brief report to:

a. Describe the datasets.

b. Describe your implementation and experiment setup, e.g. any preprocessing you performed on the dataset such as normalizing the attributes, distance metrics you used, etc.

c. Present the experiment results with varying k.

d. Discuss the insights and conclusions from your experiments.

4. This is an individual assignment.

5. Submission. You will upload two items to Canvas: your PDF report and a zip or tar file.

This zip/tar file must contain:

Your source files (include your name(s) in commented form at the top of all source files), the executable, a README file explaining how to compile/run your program, the output files for your test datasets.

Reference no: EM131919047

Questions Cloud

Design and operation of the new management accounting system : Write a report to senior management explaining how you, as the management accountant, may contribute to the design and operation of management accounting system
What is the probability that at least : If 7 people appear at random to give blood, what is the probability that at least one of them is Rh-Negative?
Calculate the expected average flow time : New Time Videos (NTV) is a new online video rental service. In the field, it is trying to compete by offering its customers access to all of the major.
Examine the number of parking spaces needed : Pizza Time Restaurants is building a new pizza place and needs to determine how big to make the various parts of its facility.
Describe your implementation and experiment setup : Describe your implementation and experiment setup, eg. any preprocessing you performed on dataset such as normalizing attributes, distance metrics you used etc.
How many customers on average can mike salon process : Assuming that the waiting area always has at least one customer in it, how many customers on average can Mike's salon process in a day.
Mean life of compact fluorescent light bulbs : If a light bulb manufacturing company wants to? estimate, with 95?% ?confidence, the mean life of compact fluorescent light bulbs to within ±175 hours
Identify and give examples of the steps that can be taken : Identify and give examples of the steps that can be taken at the functional level to improve Post's efficiency, product quality, an ability to innovate.
What was the average miles per hour per trip : What was the average miles per hour per trip? (Show work)

Reviews

Write a Review

Computer Engineering Questions & Answers

  Java program that allows a user to enter three words

design a Java program that allows a user to enter three words, and displays the appropriate three-letter acronym (constructed from the input) in all uppercase letters. If user enters more than three words, ignore the extra words.

  Emphasizing values in a column that satisfy some criterion

1. a presents specific data in an organized format this can be printed.formreportcontroldatabase2. the report creates

  Calculate the stress and strain in a steel rod of diameter d

Write down a main function and the following functions to compute the stress and strain in a steel rod of diameter D (inches) and length L (inches) subject to the compression loads p of 10,000 to 1,000,000 pounds in increments of 100,000 pounds. T..

  Write down or copy or paste the complete problem narrative

Write down or copy/paste the complete problem narrative, including tables. Show your Lindo model. Show your Lindo output. Answer all questions pertaining to the problem.

  Describe three potential risks associated with viruses

Name and describe three potential risks associated with viruses. Name and describe three potential risks areas associated with EDI applications.

  Visualize total bytes count of programming languages

Visualize the total bytes count of different programming languages used across ALL of specific author e.g. Torvalds' repositories

  Using microsoft visio powerpoint or word create a network

using microsoft visio powerpoint or word create a network diagram containing the following components workstation

  Create a new record for the player

Create a new record for the player. After the file is updated, display all the best scores stored in the file.

  Object oriented design of a class

Object oriented design to develop a parent class known as Book which will receive the ISBN, author, title and price of book and select and print the details of all books with the price of more than $50.00.

  Knowledge revolution and industrial revolution

What are your thought on this that would support notion that Knowledge Revolution was born from the Industrial Revolution?

  What is the definition of the term baud rate

What does it mean when a signal is self-clocking? What is the definition of the term baud rate? How does baud rate differ from bits per second?

  Define the structure of the graph

For each counterexample, define the structure of the graph and the cardinality of the variables, and show the ordering produced by each member of the pair.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd