Check the assumption of constant variance

Assignment Help Applied Statistics
Reference no: EM132121795

Question 1. For the prostate data set, fit a model with lpsa as the response, and the other variables as predictors.

(a) Suppose a new patient with the following values arrives:

lcavol = 1.45000, lweight = 3.59801, age = 63.00000, lbph = 0.30010,
svi = 0.00000, lcp = -0.79851, gleason = 7.00000, pgg45 = 15.00000.
Predict the lpsa for this patient along with an appropriate 95% prediction interval.

(b) Repeat the questions in (a) for a patient with the same values except that he is age 20. Explain why the prediction interval is wider.

(c) For the model of the previous question, remove all the predictors that are not significant at the 5% level. Using the reduced model recompute the predictions for the x values given in the previous questions (a) and (b). Are the new prediction intervals wider or narrower than in parts (a) and (b)? Which predictions would you prefer? Explain.

Question 2. Using the swiss data set, fit a model with Fertility as the response and all of the other variables as predictors. Answer the following.

(a) Produce a plot of the internally Studentized residuals ri versus the ordinary (least squares) residuals εˆi. (Show R code.)

(b) The points in this plot do not exactly fall on a straight line. Briefly explain why. [ Hint: What is the formula for the internally Studentized residuals? ]

(c) List the externally Studentized residuals ti (which are used as test statistics in the Mean Shift Test).

(d) Perform the Mean Shift Test without Bonferroni adjustment, using α = 0.05. Which provinces are identified as outliers?

(e) Perform the Mean Shift Test with Bonferroni adjustment, using α = 0.05. Which provinces are identified as outliers?

Question 3. Using the eco data set, fit a model with home as the response and all of the other variables as predictors. Answer the following parts.

In parts (b) through (g), you should draw a specific conclusion and clearly refer to the diagnostic tool(s) (plots or statistics) you used to draw your conclusion.

(a) Produce the four default diagnostic plots given by R.

(b) Using an appropriate diagnostic plot, check the functional form of the relationship between the mean response and the predictors.

(c) Check the assumption of constant variance.

(d) What is the largest (most positive) least-squares residual value? What is the smallest (most negative) least-squares residual value?

(e) Which observation has the greatest leverage value?

(f) Check for outliers. (You may use diagnostic plots - a formal test is not necessary.)

(g) Check for influential points.

Question 4. Using R, produce a grid of 9 normal probability plots (qqnorm) for samples of size n = 50 simulated independently from a geometric distribution with parameter prob equal to 0.4.

(Refer to the last section of the Diagnostics in Linear Regression slides. Use the R function rgeom to simulate from the geometric distribution.)

(a) Display your plots and the R code you used to produce them.

(b) Describe two distinct ways in which these plots tend to differ in appearance from what you would expect for normally-distributed data.

Verified Expert

We plot residuals vs. fitted values to check the regression assumption of independence. We'd like to see a band of randomly scattered residuals in a constant band around 0 as the fitted values get larger. A cone shape or inverse cone shape would be a violation of constant variance

Reference no: EM132121795

Questions Cloud

Does bmw have a guided missile corporate culture : Does BMW have a guided missile corporate culture, and incubator corporate culture, a family corporate culture, or an Eiffel tower corporate culture?
Calculate the effective tax rate for both companies : HI5020 Corporate Accounting Assignment - Calculate the effective tax rate for both companies that you have selected
Constructing the facility and simply keeping the money : At what interest rate would an investor be indifferent between constructing the facility and simply keeping the money?
Who has the absolute advantage in the production of wheat : Who has the absolute advantage in the production of wheat? Who has the absolute advantage in the production of cotton?
Check the assumption of constant variance : STAT 425 - Display your plots and the R code you used to produce them - Describe two distinct ways in which these plots tend to differ in appearance
What is the present value of cash flows : If the discount rate is 7 percent, what is the present value of these cash flows?
How much will the yearly scholarship be for : As a wealthy graduate of the University, you have decided to give back to the University in the form of a scholarship.
Growth rate falling off to a constant : XL Co.'s dividends are expected to grow at a 20% rate for the next 3 years, with the growth rate falling off to a constant 6% thereafter.
Task - Analyze certain situations of conflict : MAN501 Written Assignment - Task: Analyze certain situations of conflict and come up with a solution. Identification of personal stage of moral development

Reviews

urv2121795

11/1/2018 5:01:22 AM

great job, now I do not have any problem. I hope it stay like that. Sometimes I have problem but they try their best to sort it out. thank you, that's what a client requires. This is perfectly done! Thank you so much for your assistance!

len2121795

9/25/2018 12:52:02 AM

swiss data can be loaded by typing help(swiss) in r code and other dataset are from faraway packages. most of answer should be R code. A reminder: Unless otherwise stated, all data sets are from the faraway package in R.

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd