Compute pairwise distances between sequences

Assignment Help Advanced Statistics
Reference no: EM131011075

Please assist with Higgins methods problem

Problem 1
Derive weights for sequences

ACTA
ACTT
CGTT
AGAT

using Thompson, Higgins, and Gibson method

Use the outline below (a-d) to solve this problem

a) compute pairwise distances between sequences

b) applyUPGMA method to join sequences and consequently the clusters)

c) build phylogenetic tree

d) derive sequence weights

Problem 2

We assumed additive property when constructed UPGMA tree in problem 1.

What is limitation of this assumption (if any)?

Problem 3

The protein sequence of bacterial species "B3" was used to blast against swissprot protein database. The query returned significant hits to four other bacterialproteins (B1,B2,B4, B5), and one protein in human genome (H). No other mammalian species have shown presence of protein that is similar to B3. Phylogenetic tree construction by several methods resulted in a tree shown below. Explain the presence of this gene in humans.

2358_Protein sequence of bacterial.jpg

Problem 4

Describe technical and theoretical challenges associated with building phylogenetic trees.

Problem 5

Compare and contrast parsimony, maximum likelihood, UPGMA, and neighbor-joining methods

Problem 6

Create multiple sequence alignment and phylogenetic tree in Rusing ape and clustalwby following steps below:

1. Install clutalw (depending you your OS) on your computer using https://www.clustal.org/clustal2/ link

2. Open R. (all of the following steps will be implemented in R)

3. Set a working directory

4. Install package "ape" from your R session by typing:

intall.packages("ape ")

5. Load "ape" package by typing

library("ape ")

6. Read accession numbers of sequences you downloaded for Homework 2 from GenBank; this step rather for exercising purposes since you have already downloaded these sequences.

7. Save the result from step 6 as <new.fas>file

8. Run clustalw by typing:
system(paste('"path_to_YOUR_clustalw/clustalw2.exe" new.fas'))

9. Read alignment file (*aln) it should be in your working directory

10. Create phylogenetic tree using neighbor-joining method

11. Plot the tree

Submit working R-code in a separate file

Reference no: EM131011075

Questions Cloud

Net present value of project : G Corporation is considering acquiring a newer, more modern machine. The machine, which requires an initial outlay of $4.5 million, will generate cash flows of $1.1 million at the end of each year for 5 years. Investors could earn 7.5 percent else..
How will information presented about emotional intelligence : At this point in the course you should have a good idea as to the topic area you will be considering for your dissertation. How will the information presented in this course guide your next steps? The topic that I am considering for my topic is emoti..
Provide a brief overview about why calculating roi : Provide a brief overview about why calculating ROI is strategically important and list common types of items and services that would be included in an ROI analysis.
Write c function to perform complex addition and subtraction : Write C functions to perform complex addition, subtraction, multiplication, and division using the complex structure dis­ cussed in this chapter. Add these functions to the calculator program. You will have to allow the user to specify a complex v..
Compute pairwise distances between sequences : compute pairwise distances between sequences - apply UPGMA method to join sequences and consequently the clusters) and build phylogenetic tree
The history of psychology in policing : In preparation for a PowerPoint- The history of psychology in policing and The role that the Americans with Disabilities Act of 1990 plays in the hiring and evaluation process of police officers
Determine the mean for the all the numerical columns : Divide the Costs by the Qty to develop a column of Cost per Unit, Use $ and two decimal points.
The ethics of euthanasia or physician assisted suicide : Here is the topic- The Ethics of Euthanasia or Physician Assisted Suicide- pro and con. Write a 3 - 4 page paper with the cover sheet, work cited page and in text citations
Charting to pick and keep an investment : Answer the following questions in 300+ words. Cite sources used in APA format. 1. Is it best to use technical analysis or charting to pick and keep an investment? Why or why not?

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Find an expression for the steady-state probability

Verify that the above hypothesis is correct. Find an expression for Π0. Find an expression for the steady-state probability that an arriving customer is discarded.

  Show that for an arbitrary renewal process number of renewal

The purpose of this exercise is to show that for an arbitrary renewal process, N(t), the number of renewals in (0, t] is a (non-defective) rv.

  Show that the expected first passage time from j to i

Show that the expected first passage time from j to i is the same in the modified and unmodified chains. Show by example that after the modification above.

  Explain about samples and data representation

Explain about Samples, data Representation and dample Statistics Olean. Variance. cal

  Examples of contigent liability

Unruh Co. is being sued for illness caused to local residents as a result of negligence on the company's part in permitting the local residents to be exposed to highly toxic chemicals from its plant.

  Find the probability density of the time between reversals

Find the probability density of the time between reversals. Find the density of the time from one A to B reversal to the next A to B reversal.

  Two twin brothers hank and crank are kicking a ball around

two twin brothers hank and crank are kicking a ball around in a park. tim challenges james to a contest on who can kick

  Find time-average fraction of time that the system is busy

Find the mean time between busy periods (i.e., the time until a new arrival occurs after the system becomes empty). Find the time-average fraction of time that the system is busy.

  What is the skewness of the distribution of prices

What is the highest price charged among all the regular brands for a 6-pack? State the dollar amount and what is the highest price charged among all thereduced calorie brands for a 6 pack? State the dollar amount.

  What is the marginal impact on sales

Estimate this model using regression analysis and what is the marginal impact of advertising on sales and what is the marginal impact on sales

  What is the probability that there is no storm in january

What is the probability that there is no storm in january and what is the probability that there is no damage-inducing storm in january

  What is the probability that first job to leave system one

What is the probability that the first job to leave system 1 after time t is the same as the first job that entered the entire system after time t?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd