Calculate the coefficient of determination

Assignment Help Applied Statistics
Reference no: EM131220860

Homework:

Under the linear model Yi = β0 + β1Xi + ∈i, where ∈i, i = 1, ... , n are independent identical distributed. Assume ∈i ~ N(0, 1). Note that now σ2 = 1 is given.

1, Suppose we would like to know whether α00 + α1β1 = 0

1. write down the hypothesis for testing α00 + α1β1 = 0.

2. construct the test statistics T

3. create a criteria based on T to reject null hypothesis so that the type I error is controlled at α.

4. construct the 1 - α confidence interval

2, We know the s2 = Σi^i/n-2, and (n - 2)s22 ~ xn-22. Suppose we would like to know whether σ2 = 1

1. write down the hypothesis for testing σ2 = 1

2. construct the test statistics T

3. create a criteria based on T to reject null hypothesis so that the type I error is controlled at α.

4. construct the 1 - α confidence interval.

Reading assignment: Read the note on linear algebra.

3.

Let Y1, Y2, Y3 be independent response observations satisfying

         μ + β + ∈i i= 1,

Yi =

        μ + β + ∈i, i= 2, 3    

where μ, β are unknown parameters and ∈1, ∈2, ∈3 are independent N(0, σ2) variables for some unknown σ2 > 0.

(a) Represent the above setting in the form of a simple linear regression model and specify the values of the explanatory variable X for the three observations.

(b) Express the least squares estimates of μ and β in terms of Y1, Y2, Y3.

(c) Express the fitted values of Y1, Y2, Y3 in terms of Y1, Y2, Y3

(d) Express the residual sum of squares SSE in terms of Y1, Y2, Y3. What is the distribution of SSE?

(e) Suppose that (Y1, Y2, Y3) is observed to be (1, -2, 2). (1) Calculate the coefficient of determination R2.

(ii) Conduct an F test to determine if you have evidence in support of the hypothesis that Y1, Y2, Y3 are identically distributed. Give your answer on the basis of a p-value calculated for the F test.

4. Carry out a simple linear regression analysis on the following data.

Regressor,  -3 -2 -1 -1 0 1 1 2 2 3
Response,  114 112 110 107 107 105 104 104 101 96

(a) Find a 90% confidence interval for the true slope of the regression line.

(b) Find a 90% confidence interval for the true y-intercept of the regression line.

(c) Find a 90% confidence interval for σ-, the true standard deviation of Y. [Hint: The residual sum of squares is distributed as cr2x2f for some suitably chosen 1.]

(d) Find a 90% prediction interval for a future observation of Y at x = 1.5.

(e) Find a 90% prediction interval for the average of eight independent future observations of Y at X =1.5

(f) Find a 90% prediction interval for the difference between two future observations of Y, one observed at x = 2.5 and the other at x = 1.5.

(g) Find a 90% prediction interval for a future observation of Y at x = -2000, Comment on the validity of this interval.

5. A random sample of 18 U.S. males was selected, and the following information was recorded for each individual:

x = weight (in g) of fat consumed per day,

y = total cholesterol (in mg) in blood per deciliter.

The data are tabulated as follows:

Daily fat intake x, (in g) 29 43 52 56 64 77 81 84 93
Total cholesterol y, (in nigidl)  163 169 136 187 188 176 113 196 240
Daily fat intake x, (in g)  101 105 110 113 120 127 134 148 157
Total cholesterol y, (in mg/dl)  239 258 283 244 291 298 265 297 320

(a) Plot y against x.

(b) Fit a simple linear regression model to the dataset and plot the fitted regression line on the graph obtained in (a).

(c) Compile an ANOVA table for the model fitted in (b). Test at the 5% level whether "daily fat intake" is effective in explaining the variation in cholesterol level among the U.S. males.

(d) Construct a 95% confidence interval for the expected cholesterol level for people whose daily fat intake is 100g.

(e) Construct a 95% prediction interval for the cholesterol level of an individual whose daily fat intake is 100g.

(f) Calculate the coefficient of determination R2 for the simple linear regression model.

(g) A margarine manufacturer claims that the difference between the expected blood choles¬terol level of individuals consuming 100g of fat per day and that of those consuming 40g of fat per day does not exceed 35 mg/dl. If his claim is true, then perhaps some people would be willing to include extra fat in their diets, thinking that the resulting increase in cholesterol is small enough so that there is no need for concern.

Carry out a size 0.05 test for the manufacturer's claim.

Reference no: EM131220860

Questions Cloud

Prepare ten pages paper that addresses the given situations : Using the situations above, prepare a 5-10 page Microsoft Word document that addresses the above situations and meets APA standards.
Development in several states enacting voter id laws : Analyze and describe the pros and cons on both sides of the debate about these laws - Is voter fraud a major problem for our democracy or are some groups trying to make it harder for some segments of society to vote?
Describe the words in this language : Consider the language S*, where S = {a ab bal. Is the string (abbba) a word in this language? Write out all the words in this language with seven or fewer letters. What is another way in which to describe the words in this language? Be careful, th..
Explain the movements in the real exchange rate : Do a bit of Internet research on Russia and try to explain the movements in the real exchange rate.- Do movements in Russia's real exchange rate explain most of the movements in its nominal exchange rate?
Calculate the coefficient of determination : Calculate the coefficient of determination R2 for the simple linear regression model - create a criteria based on T to reject null hypothesis so that the type I error is controlled at α.
Different sequences of results are possible : A fair 6-sided die is rolled 5 times and the result is recorded for each roll. How many different sequences of results are possible? Explain how you got your answer.
Increasing at a rate proportional : Scientists began studying the elk population in Yellowstone Park in 1990 when there were 500 elk. They determined that t years after the study began the population size,N(t), was increasing at a rate proportional to 700 - N(t). If the population w..
Analyse the pros and cons for recruiting high quality talent : BUS201 Foundations of Workplace Success Group Assessment - Organisation Analysis Report. Research the industry in which this company belongs and critically analyse the pros and cons for recruiting high quality talent for this industry
Why do you think the microbead act became law so quickly : Why do you think the Microbead Act became law so quickly (especially in our legislative system) while the Main Street Fairness Act has yet to be passed?

Reviews

Write a Review

Applied Statistics Questions & Answers

  Find the best-predicted value of y given

Find the best-predicted value of y given

  Regression how do i examine correlations of all variables 1

how do i examine correlations of all variables 1 dv amp 5 iv and summarize their relationship? ltbrgti also want to

  Write the conclusion for this question

A metropolitan bus system sampler's rider counts on one of its express commuter routes for a week. Use the following data to establish whether the rider ship is evenly balanced by day of the week. Let α=0.05 . Day Monday Tuesday Wednesday Thursday Fr..

  What is the proportion of customers

Find the upper boundary of the 95% confidence interval for the average unload time and what is the proportion of customers who buy service plans when they buy a computer?

  What is the standard error of estimate

Interpret the coefficients - what does the regression line tell you about the heights of sons of tall fathers - what is the standard error of estimate? Interpret its value.

  1 a university has been tracking the percentage of alumni

1. a university has been tracking the percentage of alumni giving to its annual fund each for the past 10 years. the

  Find a point estimate of percent confidence interval

Find a point estimate of and a 95 percent confidence interval for the total number of unexcused absences by hourly workers in the last year.

  Report the average starting salary for recent graduates

According to a recent newspaper report the average starting salary for recent graduates in Electrical Engineering is at least $62,450.The Placement Director at Supreme State University would like to test the accuracy of the newspaper report. Th..

  Independent geometric random variable

let X and Y denote independent geometric random variable both of which have the parameter p.(a).compute P(X=Y)(b).compute P(X

  The power of the t test increases with

1. The power of the t test increases with ____.2. Having just made what you feel is to be a Type II error, using an independent groups design and a t test analysis, which of the following might you do in the next experiment to reduce the probability ..

  Gpa''s were then used to measure performance

1 way anova help. Question is should students listen to music, watch television, or go to library for their studies. 30 students were randomly selected to participate and randomly placed in the 3 groups evenly. Their GPA's were then used to measure p..

  Current design and design b is a proposed new design

You want to compare the daily sales for two different designs of Web pages for your internet business.  You assign the next 62 days to either Design A or Design B, 31 days each.   The summary statistics for the test are shown in part (d) on the next ..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd