Calculate the most likely hidden state sequence

Assignment Help Applied Statistics
Reference no: EM132252822

Machine Learning Homework - HMM for DNA Sequence

The goal of this assignment is for you to gain familiarity with the hidden Markov model (HMM). Specifically, you will use HMM to decode a simple DNA sequence. It is well known that a DNA sequence is a series of components from A, C, G, T. Now let's assume there is one hidden variable S that controls the generation of DNA sequence. S takes 2 possible states S1, S2. Assume the following transition probabilities for HMM

P(S1|S1) = 0.8, P(S2|S1) = 0.2, P(S1|S2) = 0.2, P(S2|S2) = 0.8

emission probabilities as following

P(A|S1) = 0.3, P(C|S1) = 0.2, P(G|S1) = 0.3, P(T|S1) = 0.2

P(A|S2) = 0.1, P(C|S2) = 0.4, P(G|S2) = 0.1, P(T|S2) = 0.4

and initial probabilities as following

P(S1) = 0.5, P(S2) = 0.5

All transition, emission, initial probabilities are together referred to as θ. Assuming the observation sequence is O = CGTCA, in the first part of this assignment, you will manually calculate the most likely hidden state sequence using the Viterbi algorithm.

In the second part of this assignment, you are provided with a new observation sequence of O = ATCG. Please compute the probability of observing O together with intermediate calculations. If you would like to report log-probability, that also works. Please use the natural logarithm.

Questions -

1. Manually calculate the most likely hidden state sequence using the Viterbi algorithm.

2. Report the decoded state sequence.

3. Together with intermediate calculations, including the V-matrix and backtracking matrix.

4. Provided with a new observation sequence of O = ATCG.

5. Compute the probability of observing O together with intermediate calculations.

6. Report log-probability. Please use the natural logarithm.

Attachment:- Assignment File.rar

Reference no: EM132252822

Questions Cloud

Is the following constant declaration valid : 1) Is the following constant declaration valid? 2) Which of the following C++ statements declares and initializes degrees to 3.25%?
Determine whether the manager is making good decisions : Given the importance of proper assumptions, your boss asked you to assess the accuracy of certain business assumptions.
Context free gramamr in chomsky normal form : Show that if G is a Context Free Gramamr in Chomsky normal form, then for any string ?? L(G), |?|=n=1, then exactly 2n-1 steps are required for anyderivation
How will you evaluate effectiveness : List appropriate nursing interventions for your chosen patient or community. How will you evaluate effectiveness? Include an evaluation tool or rubric.
Calculate the most likely hidden state sequence : CAP5610 Machine Learning Homework - HMM for DNA Sequence. Calculate the most likely hidden state sequence using the Viterbi algorithm
Display salesorderid-orderdate : Display salesorderid, orderdate, totaldue, and territory name from salesorderheader and salesterritory for all totaldue that are greater
Describe potential risks associated with this project : Share other important components that a project manager should consider as this project continues into the execution phase.
Explain what the processor will do in this fragment : Explain what the processor will do in this fragment? What will be stored in "m"?
Who are the project stakeholders : How should they communicate to different stakeholders during the project? What information should be shared with the project stakeholders?

Reviews

len2252822

3/10/2019 11:32:56 PM

Note: Homework modified from Eric Xing at Carnegie Mellon. (100 points) Please submit: A report named report first name lastname.pdf. Please report the de-coded state sequence (20 points), together with intermediate calculations, including the V-matrix (40 points) and backtracking matrix (40 points). (25 Bonus points) In the second part of this assignment, you are provided with a new observation sequence of O = ATCG. Please submit: A report named report first name lastname.pdf. Please compute the probability of observing O together with intermediate calculations. If you would like to report log-probability, that also works. Please use the natural logarithm.

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd