Calculate the coefficient of determination

Assignment Help Applied Statistics
Reference no: EM131220860

Homework:

Under the linear model Yi = β0 + β1Xi + ∈i, where ∈i, i = 1, ... , n are independent identical distributed. Assume ∈i ~ N(0, 1). Note that now σ2 = 1 is given.

1, Suppose we would like to know whether α00 + α1β1 = 0

1. write down the hypothesis for testing α00 + α1β1 = 0.

2. construct the test statistics T

3. create a criteria based on T to reject null hypothesis so that the type I error is controlled at α.

4. construct the 1 - α confidence interval

2, We know the s2 = Σi^i/n-2, and (n - 2)s22 ~ xn-22. Suppose we would like to know whether σ2 = 1

1. write down the hypothesis for testing σ2 = 1

2. construct the test statistics T

3. create a criteria based on T to reject null hypothesis so that the type I error is controlled at α.

4. construct the 1 - α confidence interval.

Reading assignment: Read the note on linear algebra.

3.

Let Y1, Y2, Y3 be independent response observations satisfying

         μ + β + ∈i i= 1,

Yi =

        μ + β + ∈i, i= 2, 3    

where μ, β are unknown parameters and ∈1, ∈2, ∈3 are independent N(0, σ2) variables for some unknown σ2 > 0.

(a) Represent the above setting in the form of a simple linear regression model and specify the values of the explanatory variable X for the three observations.

(b) Express the least squares estimates of μ and β in terms of Y1, Y2, Y3.

(c) Express the fitted values of Y1, Y2, Y3 in terms of Y1, Y2, Y3

(d) Express the residual sum of squares SSE in terms of Y1, Y2, Y3. What is the distribution of SSE?

(e) Suppose that (Y1, Y2, Y3) is observed to be (1, -2, 2). (1) Calculate the coefficient of determination R2.

(ii) Conduct an F test to determine if you have evidence in support of the hypothesis that Y1, Y2, Y3 are identically distributed. Give your answer on the basis of a p-value calculated for the F test.

4. Carry out a simple linear regression analysis on the following data.

Regressor,  -3 -2 -1 -1 0 1 1 2 2 3
Response,  114 112 110 107 107 105 104 104 101 96

(a) Find a 90% confidence interval for the true slope of the regression line.

(b) Find a 90% confidence interval for the true y-intercept of the regression line.

(c) Find a 90% confidence interval for σ-, the true standard deviation of Y. [Hint: The residual sum of squares is distributed as cr2x2f for some suitably chosen 1.]

(d) Find a 90% prediction interval for a future observation of Y at x = 1.5.

(e) Find a 90% prediction interval for the average of eight independent future observations of Y at X =1.5

(f) Find a 90% prediction interval for the difference between two future observations of Y, one observed at x = 2.5 and the other at x = 1.5.

(g) Find a 90% prediction interval for a future observation of Y at x = -2000, Comment on the validity of this interval.

5. A random sample of 18 U.S. males was selected, and the following information was recorded for each individual:

x = weight (in g) of fat consumed per day,

y = total cholesterol (in mg) in blood per deciliter.

The data are tabulated as follows:

Daily fat intake x, (in g) 29 43 52 56 64 77 81 84 93
Total cholesterol y, (in nigidl)  163 169 136 187 188 176 113 196 240
Daily fat intake x, (in g)  101 105 110 113 120 127 134 148 157
Total cholesterol y, (in mg/dl)  239 258 283 244 291 298 265 297 320

(a) Plot y against x.

(b) Fit a simple linear regression model to the dataset and plot the fitted regression line on the graph obtained in (a).

(c) Compile an ANOVA table for the model fitted in (b). Test at the 5% level whether "daily fat intake" is effective in explaining the variation in cholesterol level among the U.S. males.

(d) Construct a 95% confidence interval for the expected cholesterol level for people whose daily fat intake is 100g.

(e) Construct a 95% prediction interval for the cholesterol level of an individual whose daily fat intake is 100g.

(f) Calculate the coefficient of determination R2 for the simple linear regression model.

(g) A margarine manufacturer claims that the difference between the expected blood choles¬terol level of individuals consuming 100g of fat per day and that of those consuming 40g of fat per day does not exceed 35 mg/dl. If his claim is true, then perhaps some people would be willing to include extra fat in their diets, thinking that the resulting increase in cholesterol is small enough so that there is no need for concern.

Carry out a size 0.05 test for the manufacturer's claim.

Reference no: EM131220860

Questions Cloud

Prepare ten pages paper that addresses the given situations : Using the situations above, prepare a 5-10 page Microsoft Word document that addresses the above situations and meets APA standards.
Development in several states enacting voter id laws : Analyze and describe the pros and cons on both sides of the debate about these laws - Is voter fraud a major problem for our democracy or are some groups trying to make it harder for some segments of society to vote?
Describe the words in this language : Consider the language S*, where S = {a ab bal. Is the string (abbba) a word in this language? Write out all the words in this language with seven or fewer letters. What is another way in which to describe the words in this language? Be careful, th..
Explain the movements in the real exchange rate : Do a bit of Internet research on Russia and try to explain the movements in the real exchange rate.- Do movements in Russia's real exchange rate explain most of the movements in its nominal exchange rate?
Calculate the coefficient of determination : Calculate the coefficient of determination R2 for the simple linear regression model - create a criteria based on T to reject null hypothesis so that the type I error is controlled at α.
Different sequences of results are possible : A fair 6-sided die is rolled 5 times and the result is recorded for each roll. How many different sequences of results are possible? Explain how you got your answer.
Increasing at a rate proportional : Scientists began studying the elk population in Yellowstone Park in 1990 when there were 500 elk. They determined that t years after the study began the population size,N(t), was increasing at a rate proportional to 700 - N(t). If the population w..
Analyse the pros and cons for recruiting high quality talent : BUS201 Foundations of Workplace Success Group Assessment - Organisation Analysis Report. Research the industry in which this company belongs and critically analyse the pros and cons for recruiting high quality talent for this industry
Why do you think the microbead act became law so quickly : Why do you think the Microbead Act became law so quickly (especially in our legislative system) while the Main Street Fairness Act has yet to be passed?

Reviews

Write a Review

 

Applied Statistics Questions & Answers

  What is the probability that all will arrive on time

What is the probability that at least one of the flights will be late and why the binomial model is not appropriate for finding the probability that at least one flight will be late.

  Satisfy the stochastic difference equation

Satisfy the stochastic difference equation

  A company manufactures microwave for popcorn

A company manufactures microwave for popcorn and claims that only 2% of the popcorn failed to pop. Another company who is competitor believes that percentage could be much higher. So this second company does tests with 3000 kernels and finds ..

  The economic policy institute reports

The Economic Policy Institute reports that the average entry-level wage for male college graduates is $22.82 per hour and for female college graduates is $18.01 per hour.  The standard deviation for male graduates is $3.72 and for female graduates is..

  Calculate the meanand and the standard deviation

Calculate the mean, mx, the variance, sx , and the standard deviation, sx, of the exponential distribution of x. Find the probability that x will be in the interval [mx ± 2sx].

  A variable contains five categories

A variable contains five categories. It is expected that data are uniformly distributed across these five categories. To test this, a sample of observed data is gathered on this variable resulting in frequencies of 27, 30, 29, 21, and 24. Alpha is 0...

  Determine the percentages for each given interval

Use the information in the table to determine the percentages for each interval. Do the data below show a linear relation, non-linear relation, or no relation at all

  Odds ratio odds ratio or is sometimes reported instead of

odds ratio odds ratio or is sometimes reported instead of relative risk. this normally gives the odds or likelihood of

  The average monthly rent for one bedroom apartments

The average Monthly rent for one bedroom apartments in Chattanooga has been $700. Because of the downturn in the real estate market, it is believed that there has been a decrease in the average rental. State the null and alternative hypothesis

  What is the importance of the decision theory

Decision Trees are graphic displays of the decision process. When do you feel it is appropriate to use decision trees?

  What is the probability that the person

In a game there are 70 people in which 40 are boys and 30 are girls, out of which 10 people are selected at random. One from the total group, thus selected is selected as a leader at random. What is the probability that the person, chosen as the lead..

  The average number of days absent per term

A random sample of [n=64 children] of working mothers showed that they were absent from school a sample average of [x=5.3] days per term, with a standard deviation [s=1.8 days].  Provide a 96% confidence interval for the average number of days absent..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd