Explain ridge regression, Applied Statistics

Assignment Help:

Using log(x1), log(x2) and log(x3) as the predictors, do pair wise scatterplots of all pairs of variables (including the response) and comment (use the pairs function). Do you think that multi collinearity might be a problem with these data?

Plot the ridge trace for a grid of 50 values for the shrinkage parameter  over the range [0; 1]. Based on this plot suggest a reasonable value for . Find the estimates of the coecients for a ridge re gression with your chosen value of  (using centred and scaled predictors).

(The following question is based on Exercise 8.5 of Myers (1990), Classical and Modern Regression with Applications (Second Edition)," Duxbury).

With centred and scaled predictor variables, the ridge regression estimator for the coecients of the predictors is where y is the vector of responses, X is the design matrix for the centred and scaled predictors, is

1709_basic linear models.png

the shirnkage parameter and I denotes the identity matrix. We write n for the number of observations and k for the number of predictors. Writing biR for the ith component of bR, we will prove in this question that where 2 is the variance of the responses, and vi, i = 1,.......k are the eigenvalues of XTX. The di erent parts of the question below lead you through the proof.

735_basic linear models1.png

(a) Write XTX = QDQT for the eigenvalue decomposition of XTX, where D = diag(v1,........vk) is the diagonal matrix of eigenvalues and Q is an orthogonal matrix (QTQ = I) where the columns are the eigenvectors of XTX. Show that XTX +I = Q(D+I)QT .

2344_basic linear models2.png

where V ar(bR) denotes the covariance matrix of bR. (Hint: recall the result from basic linear models that if Y is a k  1 random vector with V ar(Y ) = V and if A is a k  k matrix and Z = AY then V ar(Z) = AV AT ).


Related Discussions:- Explain ridge regression

Determine the effects of stopping smoking on weight gain, Determine the Eff...

Determine the Effects of Stopping Smoking On Weight Gain As part of a study to determine the effects of stopping smoking on weight gain, nine females were weighed on the day t

Assumptions in anova, Assumptions in ANOVA The various populations f...

Assumptions in ANOVA The various populations from which the samples are drawn should be normal and have the same variance. The requirement of normality can be discarded if t

Factor analysis, Factor analysis (FA) explains variability among observed r...

Factor analysis (FA) explains variability among observed random variables in terms of fewer unobserved random variables called factors. The observed variables are expressed in

Modified distribution mathod, a b c d e supply p 3 4 6 8 8 20 q 2 6 0 5 8...

a b c d e supply p 3 4 6 8 8 20 q 2 6 0 5 8 30 r 7 11 20 40 3 15 s 1 0 9 14 6 13 d 15 3 12 10 20

Correlation, prove that coefficient of correlation lies between -1 and+1

prove that coefficient of correlation lies between -1 and+1

Active control equivalence studies (aces), Active Control Equivalence Studi...

Active Control Equivalence Studies (ACES) Clinical trials the field in which the object is easy to show that the new treatment is  as good as the existing treatment. Such type

Express the null hypothesis, Examine the given statement, then express the ...

Examine the given statement, then express the null hypothesis H0 and the alternative hypothesis H1 in symbolic form. The mean weight of women who won a beauty pageant is equal t

Skewness, Skewness Meaning and Definition  Literal meaning of skew...

Skewness Meaning and Definition  Literal meaning of skewness is lack of symmetry; it is a numerical measure which reveals asymmetry of a statistical series. According t

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd