What steps did you take to resolve the challenge

Assignment Help Management Information Sys
Reference no: EM132260309

Week 8 Individual Exercise

Deliverables: Two Files: (1) Submit this lab report with answers to all questions including output screenshots into the ‘Individual Exercises Week 8' assignment folder. (2) Submit an R script that contains all commands with comments that briefly describe each commands purpose.

Part 2 - Run an exercise on the Vehicle Solhouettes dataset from vehicle.csv, completing this report and providing the commands, output screenshots, and discussion/interpretation as requested. Ensure that all commands are saved in this report AND in an R script.

For Reference: UCI Machine Learning Repository: Vehicle Silhouettes

a. Introduction:

i. Based on what you have learned this week about k-means clustering, provide a one-paragraph masters-level response describing what you anticipate that the kmeans method will accomplish for the Vehicle Silhouettes data? Be specific about the behavior and output structure of k-means models. (80-120 words)

b. Data Pre-Processing: Load the Vehicle Silhouettes data into R Studio using the read.csv command (do not use File > Import Dataset > From CSV in the R Studio GUI as this uses read_csv() resulting in significant different variable types!!!).

i. Make a copy of the loaded Vehicle Silhouettes data you just imported and name the copy ‘myvehicle'. Keep the original import as you will need both the original and copy to complete this report. Include the command demonstrating this step below.

Command: >

ii. Remove the variable class from ‘myvehicle'. Include the command and answer to the question below.

Command: >

Why do we need to remove the class variable as part of the data preprocessing steps for k-means clustering?

iii. Run the scale() function on ‘myvehicle'. Include the command and answer to the question below. (Note: This command is NOT part of your tutorial. Consult the function help and use the default arguments. Hint: scale() is a function that outputs its results. You MUST save the scaled output back to the original ‘myvehicle'.

Command: >

Why must we scale data as part of the data preprocessing steps for k-means clustering?

iv. What additional data preprocessing steps (if any) did you need to execute? Include the command(s) and output screenshot below.

Command(s): >

Output:

c. K-Means Clustering - Running the Method (Hint: Record your results with k=4 in the table in part f):

i. Run ‘set.seed(12345)' and then run the kmeans method with k=4 and store the output to a variable named ‘kc'. Include the command, output screenshot, and discuss the input parameters you used.

Command: >

Output:

Discussion:

ii. Enter ‘kc' at the prompt. Provide the output below and then answer the following questions:

Output:

How many instances are in each cluster?

What information does the cluster means section provide and how were those numbers obtained?

What is the clustering vector?

What is the sum of squares by clusters and what does it mean?

iii. Run the ‘kc$iter' command. Include the command, output screenshot, and explain what the output shows.

Command: >

Output:

Discussion:

d. K-Means Clustering-Evaluate the Model:

i. Build the cross-tabulation to compare how the method clustered the vehicles from ‘myvehicle' to the actual vehicle class from your original import. Include the command, output screenshot, and answer the following questions:

Command: >

Output:

What is the dominant vehicle class in each cluster?

What is the dominant cluster for each vehicle class?

What percentage of vehicles were clustered in agreement with the actual class?

e. K-Means Clustering - Cluster Visualization:

i. Run the ‘clusplot(kc)' function to visualize your model. Modify the plot appearance to make your visualization clear and easy to interpret. Unlike previous exercises, your visualization will now be evaluated on clarity and aesthetics in addition to the standard command, output, and interpretation evaluation. Include the full command, output screenshot (zoomed in), and a one-paragraph, masters-level response with your interpretation of your plot.

(Hint: Your interpretation should discuss all of the visualized clusters and should begin to address specific observations (data points) within each that warrant discussion.)

Command: >

Output:

Interpretation: (80-120 words)

f. K-Means Clustering - Experiment with Different K Values (3 Runs Summarized):

i. Completely fill in the table below documenting the results of your experimentation with modifying the k value. You may use any k value other than 4 that is greater than 0. You do not need to provide any commands or output screenshots in this report. However, you will be evaluated on these commands being present in your R script!

k= Number of Instances in Each Cluster Between Clusters Sum of Squares Within Clusters Sum of Squares Number of Iterations
4

ii. What effect do you observe that modifying the k values has on the method results? Provide a one-paragraph, masters-level response below:

iii. What is an ideal value of k for the Vehicle Silhouettes data? This is a subjective and open-ended question. Challenge yourself and come up with a creative and well-supported answer for which value you believe is ideal. Provide a one-paragraph, masters-level response below: (80-120 words)

g. Summary:

i. What differences between k-means clustering and classification methods did you observe? Provide a one-paragraph, masters-level response. (80-120 words)

ii. Which part of this exercise did you find the most challenging and what steps did you take to resolve the challenge?

References

Reference no: EM132260309

Questions Cloud

Develop a new erm for their current organization : Would you recommend that the base their new ERM on PM2 Risk Scorecard or ISO 31000? Explain why you would choose one over the other.
Determine the requirements of the new internet-accessible : Determine the requirements of the new Internet-accessible SRS software system.
How was the organization impacted : Conduct a web search on organizations that were affected by Hurricane Katrina. How was the organization impacted? What losses did it suffer?
Define the pathophysiology of the disease in detail : Write a 300- 500 words discussion that describes the pathophysiology of the disease in detail, listing at least two (2) nursing diagnoses and current treatment.
What steps did you take to resolve the challenge : What differences between k-means clustering and classification methods did you observe? Provide a one-paragraph, masters-level response.
Develop a records inventory survey : State which of the three major areas or functional areas of the City General Hospital that you intend to focus on for your records inventory.
Describe the pathophysiology of the disease in detail : Students are to select a topic from your readings addressing a specific disease (see list below) and pharmacological treatment used to manage the disease.
Explain how the influence of the case study : MGT 522 Leadership and Communication - Abu Dhabi University - Explain the reasons behind the success or the failure of the case study
Does one or the other provide more ethical care : Two major models of care exist, for-profit and not-for-profit? Does one or the other provide more ethical care? Why or why not?

Reviews

Write a Review

Management Information Sys Questions & Answers

  Information technology and the changing fabric

Illustrations of concepts from organizational structure, organizational power and politics and organizational culture.

  Case study: software-as-a-service goes mainstream

Explain the questions based on case study. case study - salesforce.com: software-as-a-service goes mainstream

  Research proposal on cloud computing

The usage and influence of outsourcing and cloud computing on Management Information Systems is the proposed topic of the research project.

  Host an e-commerce site for a small start-up company

This paper will help develop internet skills in commercial services for hosting an e-commerce site for a small start-up company.

  How are internet technologies affecting the structure

How are Internet technologies affecting the structure and work roles of modern organizations?

  Segregation of duties in the personal computing environment

Why is inadequate segregation of duties a problem in the personal computing environment?

  Social media strategy implementation and evaluation

Social media strategy implementation and evaluation

  Problems in the personal computing environment

What is the basic purpose behind segregation of duties a problem in the personal computing environment?

  Role of it/is in an organisation

Prepare a presentation on Information Systems and Organizational changes

  Perky pies

Information systems to adequately manage supply both up and down stream.

  Mark the equilibrium price and quantity

The demand schedule for computer chips.

  Visit and analyze the company-specific web-site

Visit and analyze the Company-specific web-site with respect to E-Commerce issues

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd