Do the residuals appear to have a normal distribution

Assignment Help Simulation in MATLAB
Reference no: EM131498458

Assignment

Data on air pollution were collected from 41 U.S. cities. The type of air pollution under study was the annual mean concentration of sulfur dioxide. The values of six explanatory variables were also recorded. The variables in the data are as follows:

y : the annual mean concentration of sulfur dioxide (micrograms per cubic meter)
?x1 : average annual temperature in oF
?x2 : number of manufacturing enterprises emplying 20 or more workers
?x3 : population size (thousands)
?x4 : average annual wind speed (mph)
?x5 : average annual precipitation (inches)
?x6 : average number of days with precipitation per year

A model relating y to the six explanatory variables is of interest in order to determine which of the six explanatory variables are related to sulfur dioxide pollution and to be able to predict air pollution for given values of the explanatory variables.

1. Relationship between y and x, and collinearity.

(a) Plot y versus each of the explanatory variables. From your plots determine if higher order terms are needed in any of the explanatory variables.

(b) Using correlation coecients, determine whether there is any evidence of collinearity in the data.

(c) Obtain VIF for each of the explanatory variables from tting a regresson model with y as the response and all six explanatory variables, x1 through x6, as predictors. Does there appear to be any collinearity problems based on the VIF values?

2. Model selection.

(a) Use the best subset regression to obtain the two best models of all possible sizes of p. Obtain values for R2, R2adj , Cp, and s(i.e.,s) for each of the models.

(b) Based on the information from part (a) and using R2adj as your model selection criterion, select the model that you think is best.

(c) Using the information from part (b), which variables were most highly related to sulfur dioxide air pollution?

3. Checking model assumptions. Using your model selected from 2(b), do the following:

(a) Do the residuals appear to have a normal distribution? Justify your answer.

(b) Does the condition of constant variance appear to be satised? Justify your answer.

(c) Find an appropriate transformation of Y so that the assumptions for regression will be satised. Find the "best" model using the transformed Y and the backward variable selection method.

4. Outlying and inuential observations: based on the model you selected in problem 3(c) using transformed Y , do the following:

(a) Do any of the data points appear to have high inuence? Leverage? Justify your answer.

(b) If you identied any high leverage or high inuence points in part (a), compare the estimated models with and without these points

5. Prediction for new observations: based on the model you selected in problem 3(c) using transformed Y , do the following:

(a) Estimate the average level of sulfur dioxide content of the air in a city having the following values for the six explanatory variables: x1 = 60, x2 = 150, x3 = 600, x4 = 10, x5 = 40, and x6 = 100.

(b) Place a 95% condence interval on your estimated sulfur dioxide level and interpret this interval.

(c) Place a 95% prediction interval on your estimated sulfur dioxide level and interpret this interval.

Reference no: EM131498458

Questions Cloud

Applications of the scientific method : Demonstrate how to use the scientific method to make decisions and solve problems in your field of study or everyday life.
Why is organizational development planned change : Why is organizational development planned change? Explain how planned change is important for organizations in today's dynamic environment.
Earnings and profits at the close of current taxable year : S Corporation elected S corporation status beginning in 2001 and will have Subchapter C earnings and profits at the close of the current taxable year.
Difference between management and leadership : Describe the difference between management and leadership. How are both used in sport management? Provide specific practical examples used in the sports.
Do the residuals appear to have a normal distribution : Do the residuals appear to have a normal distribution? Does the condition of constant variance appear to be satised? Justify your answer.
What are fluoride levels in the water you are drinking : What are fluoride levels in the water you are drinking? Would you keep fluoride in the water? Why or why not
Discuss action research and appreciative inquiry : How would an organization decide between action research and appreciative inquiry? Explain briefly how each brings about organizational change.
Expected return on its common stock after refinancing : what is the expected return on its common stock after refinancing?
Shareholder and corporate level tax consequences : Consider the shareholder and corporate level tax consequences of the following alternative transactions:

Reviews

Write a Review

Simulation in MATLAB Questions & Answers

  Calculate the stress intensity factor

Use the three-parameter zone finite element method or the boundary collocation method to calculate the stress intensity factor K, at the crack tip for the plate

  Build a simulation using newtons laws of motion

Build a new and different simulation of your own using Newtons laws of motion and Show the code and describe how it works

  Write the specification of load mover

Write the specification of LOAD MOVER detailed of the whole design and precise for automatic control section and divide the design into various modules and Is the kernel required if yes which one?

  Design the automatic control section using statecharts

Aim of this project is to design an embedded system which can move loads from one place to another. The system can be operated manually, automatically and wirelessly.

  Need an expert who can model a drill in simulink

Need an expert who can model a drill in Simulink. Working model of a drill needing for an improvment to behave more realistically as a drill to drill through plastic block.

  Project is on load frequency control using fpid

Project is on load frequency control using FPID tuned using GA and PSO algorithm and the system is a two area system.

  Number of packets received with time

Let x be the number of packets received with time -

  Build a matlab based graphical user interface

Build a Matlab based graphical user interface (GUI) that operates in conjunction with a base Matlab/ Simulink simulation program. Any base simulation is considered acceptable.

  Build a matlab based graphical user interface

Build a Matlab based graphical user interface (GUI) that operates in conjunction with a base Matlab/ Simulink simulation program. Any base simulation is considered acceptable.

  Simulate the standardised sum of independent

Simulate the standardised sum of independent and identically distributed variates - Fit a linear regression model as in Q5, and plot your estimates for β0 and β1 as N increases, together with a line indicating their true values. Supply your code.

  Plot the original periodic square wave

Plot the original periodic square wave on the same graph. Comment on the difference between the original periodic square wave and its truncated Fourier series presentation.

  Use matlab to plot the function

Plot the original periodic square wave on the same graph. Comment on the difference between the original periodic square wave and its truncated Fourier series presentation.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd