Find an expression for the posterior distribution

Assignment Help Advanced Statistics
Reference no: EM131071073

This assignment involves the so-called change-point problem and a Bayesian analysis of it.

1. The simple change-point problem can be described as follows. Here it is assumed that both p1(y) and p2(y) are known completely.

  • y1, . . . , y­τ|τ are independently identically distributed (iid) with distribution p1(y)

           yτ+1, . . . , yn|τ, are iid as p2(y)

           and τ takes values 1, . . . , n - 1

  • If τ = 0 it is assumed that

           y1, . . . , yn|τ = 0, are iid p2(y)

  • If τ = n it is assumed that

           y1, . . . , yn|τ = n, are iid p1(y)

The case τ = n corresponds to "no-change" and τ < n to "change".

(a) Find an expression for the posterior distribution of change-point for this simple model assuming the values τ = 0, 1, 2, . . . , n are allowed.

(b) Find the posterior distribution of the change-point for the British annual coal mining disasters data set for 1851 until 1962, available in the file coalminedata. R.

Assume that the distribution before the change is Poisson with mean 3.1 and after it is Poisson with mean 1.95.

Find the posterior distribution of the change-point and the mode of this distribution. What is an approximate 95% credible interval for τ? What is the posterior probability of "no-change"?

See the papers by Carlin et al, 1992, Hierarchical Bayesian Analysis of Change point Problems.

Applied Statistics.

Jarrett, 1979, A Note on the Intervals Between Coal-Mining Disasters. Biometrika.

2. (a) This question asks you to develop full conditional distributions for the Bayesian change-point problem which involves a change in mean but not variance of normally distributed data. Following on from Question 1, now take p1(y) to be the normal density with mean µ1 and precision (reciprocal variance) γ and p2(y) to be the normal density with mean µ2 and precision (reciprocal variance) γ. Note the two distributions have the same precisions. The values of the parameters µ1, µ2, γ are all assumed unknown.

For τ taking the values 1, . . . , n - 1 (i.e. at least one observation from each of p1 and p2) find the likelihood p(y|µ1, µ2, γ, τ ) simplified to provide a computationally efficient formula  as a function of the parameters µ1, µ2, γ, τ . Assuming uniform uninformative priors for the parameters µ1, µ2, log(γ), τ , that is

p(µ1, µ2, γ, τ ) ∝ 1/γ, -∞ < µ1 < ∞, -∞ < µ2 < ∞, 0 < γ, τ ∈ {1, . . . , n - 1}

find the four full conditional posterior distributions:

p(µ1|rest), p(µ2|rest), p(γ|rest), p(τ|rest),

where "rest" means all the other parameters and the data y.

Describe a Gibbs sampling algorithm for generating the posterior distributions of the four unknown parameters.

(b) The data to be analysed involve a sequence of so-called temperature anomalies for North Russia at 20 year intervals, 1001, 1021, ..., until recently. Data is also available for various other sites in the world from about 800 AD until recently. The source is an IPCC report. Jansen E, J Overpeck, KR Briffa, J-C Duplessy, F Joos, V Masson-Delmotte, D Olago, B OttoBliesner, WR Peltier, S Rahmstorf, R Ramesh, D Raynaud, D Rind, O Solomina, R Villalba and D Zhang (2007) Palaeoclimate. In Climate change 2007: the physical science basis. Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change, Solomon S, D Qin, M Manning, Z Chen, M Marquis, KB Averyt, M Tignor and HL Miller (eds.). Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA.

The data are found in the file nrussia. R

Develop a Gibbs Sampling algorithm to find the posterior distribution of the change point using the model developed in Question 2(a). Report a 95% credible interval for the change-point and for the two parameters µ1, µ2.

Comment on whether the change-point model seems a reasonable model for these data.

3. (a) Suppose y1, . . . yn given θ are independent Poisson(θ) data so that the likelihood is

p(y|θ) = e-nθθs/j=1nyj!                                 with s = j=1Σn yj.

The marginal likelihood (or evidence) is given by

p(y) = ∫θp(y|θ)p(θ)dθ.                                   (1)

Assuming that the prior for θ is given by a Gamma(α, β) distribution, show that the marginal likelihood, equation (1), is given by

p(y) = (1/j=1nyj!) (βα/Γ(α))(Γ(α + s)/(n + β)α+s)                   where s = j=1Σnyj.

Show that the same result for p(y) is found by using the identity

p(y) = p(y|θ)p(θ)/p(θ|y).

(b) For two models Mj, j = 1, 2, we can compute the posterior odds of model M1 to M2 as

p(M1|y)/p(M2|y) = (p(y|M1)/p(y|M2))(p(M1)/p(M2)).

For Poisson data with mean θ, we want to compare M1: θ = θ0, with the value of θ0 known, with M2: 0 < θ < ∞ with θ having prior Gamma(α, β).

Here- p(y|Mj ) = ∫p(y|θj, Mj)p(θj|Mj) dθj                              j = 1, 2.

That is, (1) computed for Mj, j = 1, 2.

Assuming p(M1) = p(M2), find the posterior odds p(M1|y)/p(M2|y).

Assuming θ0 = 1, compute this for s = n and s = 2n for n = 10(10)1000. Comment.

(c) For data y1, . . . yn, assume that the model with likelihood

p(y1, . . . , yn1, θ2) = j=1tPoisson(yj; θ1) × j=t+1nPoisson(yj; θ2)

and prior

p(θ1, θ2) = Gamma(θ1; α1, β1) × Gamma(θ2; α2, β2)

holds.

Describe in words what situations this probability model might represent.

Show that the marginal likelihood for this model is given by

(1/j=1nyj!) x (β1α1/Γ(α1))(Γ(α1 + st)/(t + β1)α1+st) × (β2α2/Γ(α2))(Γ(α2 + s′t)/(n - t + β2)α2+s′t)                (2)

using the results of Question 3(a) where st = j=1Σt yj and s′t = j=t+1Σnyj.

How can this expression, (2), be used to make inferences for the value of t if it is unknown (t = 1, . . . , n - 1).

Attachment:- russia and coalmine data.rar

Reference no: EM131071073

Questions Cloud

What did the police officer say to judy : Write an essay in which you set out what you believe Officer Paulson would say to Judy. Choose 5 definitions, examples and/or explanations from Dr. King's letter to support your conclusion. When using exact wording from the "Letter," you must qu..
Differences between a swot analysis : Discuss the similarities and differences between a SWOT analysis and a competitive market analysis. How can the results be used to define strategic alternatives in healthcare organizations during strategic planning?
Aligning hr strategy with organizational strategy : How well has your organization aligned its HR strategy with its overall organizational strategy? What are some of its best practices and what are some areas for improvement?
Discuss critical issues of life and crimes of aileen wuornos : Aileen Wuornos was executed by lethal injection on October 9th 2002, if you were the governor of the state of Florida in 2002 would you find any moral or ethical reasons to commute her death sentence to life in prison, and why or why not?
Find an expression for the posterior distribution : Find an expression for the posterior distribution of change-point for this simple model assuming the values τ = 0, 1, 2, . . . , n are allowed
Bumper supplies bumpers to several large auto manufacturers : QualityBumper supplies bumpers to several large auto manufacturers. Its production department works eight hours a day, five days a week and has 10 machines. Under ideal, short term operating conditions each machine is capable of producing 50 bumpe..
How would you use data from foreign exchange market : How would you use data from foreign exchange market to decide between these two hypotheses?
Hr as a competitive advantage : 1. What is your organization's chief competitive advantage in its market? 2. How has it positioned itself to lead its market segment? 3. What roles does the human resource (the people, not the practices) play in establishing this advantage?
When might it be a bad idea to use ppp theory in this way : When might it be a bad idea to use the PPP theory in this way?

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Estimates for the third and fourth quarters of 2013

Construct a less-than-cumulative frequency distribution of the data represented by the graph and represents the grouped frequency distribution of the time spent on the Internet by a randomly-selected sample of Internet subscribers during their most..

  Correlation and regression

What are the degrees of freedom for regression

  Computing fair value of coupon paying bond

A firm has an issue of $1,000 par value bonds with a 9% stated interest rate outstanding. The issue pays interest annually and has 20 years remaining to its maturity date.

  Find the steady-state probability of each given state

Find the steady-state probability of each state. Find the time-average rate at which customers leave node 1. Find the time-average rate at which a given customer cycles through the system.

  Presentation on statistical test of snack foods

List at least three qualitative attributes of the snack food about which they might want to ask consumers. Make sure at least one of them is nominal.

  What is the distribution of the time

What is the distribution of the time till the next epoch of N2(t) and what is the probability that the next epoch of N (t) is an epoch in N1(t)?

  Find the median or interval of median

Find the median of X for each of the cases below. We know that fX (x) is positive for all x > 1, but it is otherwise unknown. Find the median or interval of medians.

  Financial health of a business enterprise

What is captial budgeting? Why are capital budgeting decisions crucial to the long run financial health of a business enterprise?

  Schedulingscheduling is explicitly part of our lives we

schedulingscheduling is explicitly part of our lives. we schedule everything and need to in order to plan our

  Find an expression for the steady-state probability

Verify that the above hypothesis is correct. Find an expression for Π0. Find an expression for the steady-state probability that an arriving customer is discarded.

  Statistical tools-pros and cons

In detail, what statistical tools would apply and how they would be used when analyzing financial information for each of the selected organizations to assess for value. Discuss the pros and cons of each statistic tool.

  Find the mean and variance of x

Find the mean and variance of X. Assuming that {Xm; m ≥ 1} is an IID sequence with the distribution of X and that Sn = X1 + ··· + Xn for each n, find the mean and variance of Sn. (No explanations needed.)

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd