The mock data file consists of 3 columns each containing

Assignment Help Basic Statistics
Reference no: EM13371252

The mock data file consists of 3 columns, each containing 1000 numbers:

1. a flag indicating which data row

2. the sampled x value (1 - 1000)

3. the corresponding sampled y value (1 - 1000)

The challenge is essentially to determine the linear relationship between x and y using these 1000 data pairs. It divides up into three steps, of increasing complexity.

Step 1: Use ordinary least squares to fit the linear model  y = a + bx  to the mock data

          (a) compute LS estimators of a and b,

          (b) estimate the variance of the (assumed Gaussian) noise which has been added to the mock y values

          (c) estimate errors on a_LS and b_LS, and their covariance

Step 2: By casting the data analysis challenge not as a least squares problem, but as a maximum likelihood problem, form an appropriate likelihood function for the mock data, which depends on the parameters (a,b).

          Then, by computing the log likelihood on a rectangular grid of values of a and b (you need to think carefully about the range of a and b values you should consider, and the spacing between them), and in turn computing the value of chi-squared for each (a,b) pair on your grid, you should find the minimum value of chi-squared.  You then should turn your grid of values into a rectangular array of Delta chi-squared values.  Finally, using the information in the table in Section 6, you should compute and plot Bayesian credible regions  for the parameters at e.g. 68.3%, 95.4%, 99.73%. (Carrying out the calculations and making a contour plot from the results is straightforward in e.g. MATLAB, although you are welcome to use any programming language you wish).

Step 3: Finally, using the Metropolis algorithm, and assuming a Gaussian likelihood function for the model parameters a and b, write an MCMC code to generate a sample from the likelihood function - thinking carefully about your choices of proposal density and prior range for a and b. Use this sample to estimate the mean values, errors and covariance of the parameters a and b from their sampled marginal distributions. Devise a method for estimating and plotting Bayesian credible regions for the paramters, using your MCMC sample.

while Steps 2 and 3 both involve more sophisticated methods and will require you to write some simple computer code (e.g. in MATLAB)

Reference no: EM13371252

Questions Cloud

Part-1weekly discussion thread assignments require that you : part-1weekly discussion thread assignments require that you select and respond to one of the learning outcomes listed
Q1in this area we will discuss the significance of the : q1in this area we will discuss the significance of the accounting equation the rules of debit and credit and the steps
1 describe the role and importance of communication in the : 1. describe the role and importance of communication in the managers job. communication is the process of transmitting
1 consider the following distribution of grades in a : 1. consider the following distribution of grades in a class.nbsp think of this as your population.3540 2 3 4 4 550 0 0
The mock data file consists of 3 columns each containing : the mock data file consists of 3 columns each containing 1000 numbers1. a flag indicating which data row2. the sampled
Question 1 variable xx lt- c3 4 8 4 2 1 0 6 variable yy lt- : question 1 variable xx lt- c3 4 8 4 2 1 0 6 variable yy lt- c1 2 4 2 2 0 1 4 calculate z-scores for all values in the
A company specializing in earth-moving equipment is : a company specializing in earth-moving equipment is contemplating expanding its operations. it must decide whether to
Explain the process for handling conflicts taking place in : explain the process for handling conflicts taking place in the organization.analyze the elements of organizational
New keynesian model with technology shocks consider a new : new keynesian model with technology shocks consider a new keynesian economy with equilibrium conditions given bywhere

Reviews

Write a Review

Basic Statistics Questions & Answers

  Find probability all the hotel-s rooms will be rented

Assume that demand for the upcoming weekend is normally distributed with a standard deviation of 30. What is the probability all the hotel's rooms will be rented?

  Area of the shaded region

Find the area of the shaded region. Answer to the nearest tenth. Picture a square. The arcs have their centers at opposite vertices of the square and that is the region that is shaded.

  Find probability that three consumers recognize brand name

Find the probability that exactly 3 of the 5 consumers recognize the brand name. Also find the probability that the number who recognize it is not 3.

  Generic brand of soft drink

Market researchers would like to know if customers prefer a well-known brand over a generic brand of soft drink. They give a large sample of people the two drinks to taste in a random order and ask them which one tastes better.

  Hypothesis test to find difference between population means

Perform a hypothesis test by using.04 level of significance to find out if there is a difference between the population means.

  The p-value for the f test for interaction

An experiment was carried out to assess the effects of four tomato varieties and four planting densities on yield. There were 64 plots, and each of the 16 factor combinations was randomly assigned to 4 plots

  What is the probability-paid cash or check

Ms. Tina Stevens just purchased a new dress that cost $120. What is the probability she paid cash or check? (Round your answer to 3 decimal places.)

  Sample size for estimating the population proportion

Suppose that past data are not available for developing a planning value for p; therefore, you will use p = 0.050 for the planning average.

  Closest to the correlation for the data

A scatterplot shows a set of data points that are clustered close to a line that slopes down to the right. Which of the following values would be closest to the correlation for these data?

  Volunteers at a teenage distress center have been assigned

Volunteers at a teenage distress center have been assigned based on their skills and expertise. It's assumed that 40% are drug related, 25% sex related, 25% stress, and 10% education. For the study sample of 120 calls per month were categoruzed as fo..

  What is the probability that 3 heads are obtained

A coin with the probability of heads being 0.8 is tossed 4 tim

  Find the probability that team one wins

If the match ends when one of the teams has scored k more points than the other, find the probability that team 1 wins.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd