Linear regression assignment help, Advanced Statistics

Assignment Help:

Using World Bank (2004) World Development Indicators; Washington: International Bank for Reconstruction & Development/ The World Bank, located in the reference section of the Learning Centre (Stats 330.9 WOR), collect a sample of data comprising cereal yield in kg/ha and fertilizer consumption in hundreds of grams /ha of arable land for the period 2000-2002 from 25 countries around the world. These data can be found in Table 3.3 pp123-126 and Table 3.2 pp 119-122 respectively.

For the regression analysis, use cereal yield as the dependent variable (Y) and fertilizer consumption as the independent variable (X). Enter the two variables into an SPSS file and carry out the following exercise: 

1.         Regress cereal yield (Y) on fertilizer consumption (X).

2.         Produce a plot of the 30 observations, the calculated regression line and the 95% confidence limits.                                                               

3.         What is the correlation between cereal yield and fertilizer consumption?             

4.          State whether the modeled regression relationship is significant.

 5.         Examine the plotted residuals and attempt to explain two of the extreme positive and negative values (max 300 words).                       

6.         Calculate the runs test and the Durbin-Watson statistic on the residuals and indicate whether auto-correlation is present at the 0.05 significance level.

 

1)

We run the regression of cereal yield on fertilizer consumption. The fitted regression line is given by:

 

cereal yield= 2254.069+0.253 * fertilizer consumption

 

 

Regression

 

Variables Entered/Removed

Model

Variables Entered

Variables Removed

Method

1

fertilizer consumptiona

.

Enter

a. All requested variables entered.

 

b. Dependent Variable: cereal yields

 

 

Residuals Statisticsa

 

Minimum

Maximum

Mean

Std. Deviation

N

Predicted Value

2254.07

3054.49

2400.90

195.404

25

Residual

-2.251E3

4481.879

.000

1822.796

25

Std. Predicted Value

-.751

3.345

.000

1.000

25

Std. Residual

-1.209

2.407

.000

.979

25

a. Dependent Variable: cereal yields

 

 

 

 

3) The Correlation coefficient is 0.01

 

4) From the ANOVA table we see that the regression is not significant at 5% level of significance.

 

ANOVAb

Model

Sum of Squares

df

Mean Square

F

Sig.

1

Regression

916388.422

1

916388.422

.264

.612a

Residual

7.974E7

23

3467045.255

 

 

Total

8.066E7

24

 

 

 

a. Predictors: (Constant), fertilizer consumption

 

 

b. Dependent Variable: cereal yields

 

 

 

 

6)

The Durbin-Watson statistic is 1.588 which is close to 2 indicating there may be no or little positive autocorrelation

Model Summary

Model

R

R Square

Adjusted R Square

Std. Error of the Estimate

Durbin-Watson

1

.107a

.011

-.032

1862.000

1.588

a. Predictors: (Constant), fertilizer consumption

 

b. Dependent Variable: cereal yields

 

 

However we next perform the run test which clearly implies that the residuals are independent at 5% level.

 

NPar Tests 

 

Runs Test

 

Standardized Residual

Test Valuea

-.15308

Cases < Test Value

12

Cases >= Test Value

13

Total Cases

25

Number of Runs

15

Z

.417

Asymp. Sig. (2-tailed)

.676

a. Median

 

 

 

7)

For less developed countries the intercept term will be very low as compared to high developed countries.

Moreover the slope of the fertilizer consumption will also be low in less developed countries indicating slow growth rate of cereal yield.

 

7.         What differences would you expect to find between less developed and more developed countries in terms of the relationship between cereal yield and fertilizer consumption?


Related Discussions:- Linear regression assignment help

Explain perturbation theory, Perturbation theory : The theory useful in ass...

Perturbation theory : The theory useful in assessing how well a specific algorithm or the statistical model performs when the observations suffer less random changes. In very commo

Bioinformatics, Bioinformatics : Essentially the application of the informa...

Bioinformatics : Essentially the application of the information theory to biology to deal with the deluge of the information resulting from the advances in molecular biology. The m

Confidence interval estimation, An auditor for a government agency needs to...

An auditor for a government agency needs to evaluate payments for doctors' office visits paid by Medicare in a small regional town during the month of June. A total of 25,056 visit

Effect sparsity, The term which is used in the industrial experimentation, ...

The term which is used in the industrial experimentation, where there is commonly a large set of candidate factors believed to have the possible significant influence on the respon

Naor''s distribution, Naor's distribution is the discrete probability dist...

Naor's distribution is the discrete probability distribution which arises from the following model; Assume an urn contains n balls of which one is red and the remainder is whit

Computer-intensive methods, Computer-intensive methods : The statistical me...

Computer-intensive methods : The statistical methods which require almost identical computations on the data repeated number of times. The term computer intensive is, certainly, a

Hypothesis testing and chi-square tests.., The results of a survey determin...

The results of a survey determined whether the age of a driver 21 years and older has any effect on the number of motor vehicle accidents in which he/she is involved. Question 1:

Persson rootze ´n estimator, Persson Rootze ´n estimator  is an estimator f...

Persson Rootze ´n estimator  is an estimator for the parameters in the normal distribution when the sample is truncated so that all the observations under some fixed value C are re

Explain intervention analysis in time series, Intervention analysis in time...

Intervention analysis in time series : The extension of the autoregressive integrated moving average models applied to time series permitting for the study of the magnitude and str

Curvature measures, The diagnostic tools or devices used to approach the cl...

The diagnostic tools or devices used to approach the closeness to the linearity of the non-linear model. They calculate the deviation of so-called expectation surface from the plan

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd