What is the average distance in the least dense cluster

Assignment Help Other Subject
Reference no: EM132269084

Assignment 1 -

Answer the following questions based on the attached web file named FBS.

Please submit both the word document containing the relevant screenshots (tables generated after executing k-means clustering), as well as the Excel worksheets containing the generated tables.

1. The Football Bowl Subdivision (FBS) level of the National Collegiate Athletic Association (NCAA) consists of over 100 schools. Most of these schools belong to one of several conferences, or collections of schools, that compete with each other on a regular basis in collegiate sports. Suppose the NCAA has commissioned a study that will propose the formation of conferences based on the similarities of the constituent schools. The file FBS contains data on schools belong to the Football Bowl Subdivision (FBS). Each row in this file contains information on a school. The variables include football stadium capacity, latitude, longitude, athletic department revenue, endowment, and undergraduate enrollment.

a. Apply k-means clustering with k = 10 using football stadium capacity, latitude, longitude, endowment, and enrollment as variables. Be sure to Normalize Input Data and specify 10 iterations and 10 random starts in Step 2 of the k-Means Clustering procedure. Take screenshots of the three tables generated in the KMC_Output worksheet: Cluster Centers, Inter-Cluster Distances, and Cluster Summary. Analyze the resultant clusters. What is the size of the smallest cluster? What is the average distance in the least dense cluster? What makes the least dense cluster so diverse?

(Tips: The least dense cluster in k means is the one with highest average distance in the cluster. For the question "What makes the least dense cluster so diverse", you need to 1) describe the most unique characteristic of the least dense cluster, by referring to the table of Cluster Centers in the KMC_Output worksheet; 2) compare the inter-cluster distances, by referring to the table of Inter-Cluster Distances in the KMC_Output worksheet. What is the nearest distance between this cluster and the others?

b. What problems do you see with the plan with defining the school membership of the 10 conferences directly with the 10 clusters? (Tip: Consider the sizes of clusters)

c. Repeat part a, but this time do not Normalize Input Data in Step 2 of the k-Means Clustering procedure. Take screenshots of the three tables generated in the KMC_Output1 worksheet: Cluster Centers, Inter-Cluster Distances, and Cluster Summary. Analyze the resultant clusters. Do they look quite differ from those in part a? Identify the dominating factor(s) in the formation of these new clusters.

(Tips: Dominating factor is the variable which makes the non-normalized clustering different than the normalized clustering. You can confirm it by clustering the schools solely on the basis of the dominating factor and then noting the similarity of the resulting clusters to the clusters based on all (non-normalized) variables.)

Assignment 2 -

Answer the following questions based on the web file named FBS.

Please submit both the word document containing your answers, as well as the Excel worksheets containing the relevant tables, and Dendrogram generated after executing hierarchical clustering.

Refer to the clustering problem involving the file FBS described in Problem 1. Apply hierarchical clustering with 10 clusters using football stadium capacity, latitude, longitude, endowment, and enrollment as variables. Be sure to Normalize input data in Step 2 of the Hierarchical Clustering procedure. Use Ward's method as the clustering method. Please create the dendrogram on the HC_Dendrogram worksheet. Copy the dendrogram from Excel to Word. And

a) Draw a horizontal line at the distance 22 and indicate the composition of the clusters segment.

b) Draw a horizontal line at the distance 14 and indicate the composition of the clusters segment.

Tips: Please read the textbook (complete version) on the page 260 to 262, you need to draw a horizontal line and indicate the composition of the clusters segment, like the example provided on the page 261 of your textbook. If you use a customized textbook version, please read the pages 77-79.

Steps to draw a horizontal line on dendrogram: after the dendrogram was generated in Excel, right click the dendrogram and select Copy. And then paste it to your word document. Insert a line from the menu "Shapes" from the "Insert" tab.

Note - Attached are excel assignment and the other attachment is supporting doc like how to download the add on.

Attachment:- Assignment Files.rar

Reference no: EM132269084

Questions Cloud

Why you think the declaration has become the revered : Discuss whether you think the Declaration of Independence is relevant in your life today and why.
Industrial organization model and the resource-based model : How can Apple earn above-average returns if they used both the industrial organization model and the resource-based model?
Explain saras views of these issues : REL331: People need rules; that's why we have them in the first place. If we don't have rules, people will do whatever they want, and then where will we be?"
What would you teach a patient about the drug : Discuss the implications of a Respiratory Medication (Zafirlukast) and potential interactions when taken with foods and home remedies.
What is the average distance in the least dense cluster : MISY 5370 Assignment 1 - Apply k-means clustering with k = 10 using football stadium capacity, What is the average distance in the least dense cluster
Talk about why at such young age being president made : Talk about why at such young age being president made such an impact. Talk about the characteristics of the audience shaped the choices of the speaker.
Discuss one of those measures for change in your practice : Review one of the tutorials on quality measures from the AHRQ: National Quality Measures Clearinghouse website. Provide an overview of what you reviewed.
Potential to effectively engage target audience : Describes three marketing tools that have the potential to effectively engage the target audience: event sponsorship, product placements
What factors helped to bring on the american civil war : In what ways did the Civil War answer some problems even as it created new, unresolved issues for the future?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd