Modify the script to perform the hierarchical clustering

Assignment Help Other Subject
Reference no: EM131947515

Article : Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring By T. R. Golub and D. K. Slonim.

BIFS614 Data Structures and Algorithms

Assignment

K-Means Clustering in R (using R-Studio)

To complete this homework you will first have to download R and R-Studio.

Once you have installed R and R-Studio, make sure that you have the corresponding R script (BIFS614 Homework 4.R) on your desktop. Right click on it and open it in RStudio. The interface will open and should look like this:

If you are not familiar with R, the easiest way to execute the script will be one line at a time. Place your cursor at the end of the first line of the script and press the "RUN" button at the top of the upper left window. This will execute ONE LINE at a time. The bottom left window (the Console) will display the output results of the execution, while the upper right window will display any data objects created. The bottom right window will display any visuals (graphs/etc) created.

Step through the script one line at a time by pressing RUN. When you get to the line that loads the Bioconductor source, you should wait and be sure that the package is fully loaded before pressing RUN again. The next line, for biocLite, will also take some time to complete - just be patient.

You should also pay attention to the console output because it may ask you to update a package - if you are asked to update, you should go ahead and say yes. You'll have to place your cursor in the console window and type the letter it wants, which is a "y" for yes (but without the quotes) or an "a" for all if more than one package needs to be updated.

Be sure to read the script - there are many comments and instructions in there as well.

QUESTIONS TO ANSWER:

1. When you performed hierarchical clustering with the defaults set to 8, what did you see? What does this mean? Include an image as well as a description.

2. Modify the script to perform the hierarchical clustering for a different number of clusters (your choice but values between 5 and 12 are probably the most useful). Include an image of the new clustering - compare it to the original settings. What does this tell you?

3. The Golub et. al. (1999) paper describes this dataset. How does your clustering compare to the results that they found?

Reference no: EM131947515

Questions Cloud

Difference between training and performance consulting : Comment on the critical difference between training and performance consulting. Why is the Collaborative form of consulting most preferred (and successful)?
Monopolistic competition-perfect competition : If I open a new restaurant that specializes in soup (its a soup and salad bar) in a city with no restaurants that have soup and salad bars
Process of lending and relending : The process of lending and relending creates money throughout the banking system. As a result of Ellen's deposit, how much money, in the form of deposits
How much of delta airlines stock was outstanding at year end : How much of Delta Airlines stock was outstanding at year end. Did Delta Airlines pay any cash dividends during 2016?
Modify the script to perform the hierarchical clustering : When you performed hierarchical clustering with the defaults set to 8, what did you see? What does this mean? Include an image as well as a description.
What is your account margin in percent : The value of the stocks held short rises to $250,000. What is your account margin in percent?
South africa socio-economic problems : A market economy and a democratic elected government is the ideal solution for South Africa's socio-economic problems. Discuss this statement critically.
What is karla debt -to-equity ratio : Karla, a recent college graduate, is earning $43,000 per year with $36,000 in take-home pay. what is Karla’s debt -to-equity ratio?
Uses to report the net cash flows from operating activities : What method does Delta Airlines uses to report the net cash flows from operating activities?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd