Explain ridge regression, Applied Statistics

Assignment Help:

Using log(x1), log(x2) and log(x3) as the predictors, do pair wise scatterplots of all pairs of variables (including the response) and comment (use the pairs function). Do you think that multi collinearity might be a problem with these data?

Plot the ridge trace for a grid of 50 values for the shrinkage parameter  over the range [0; 1]. Based on this plot suggest a reasonable value for . Find the estimates of the coecients for a ridge re gression with your chosen value of  (using centred and scaled predictors).

(The following question is based on Exercise 8.5 of Myers (1990), Classical and Modern Regression with Applications (Second Edition)," Duxbury).

With centred and scaled predictor variables, the ridge regression estimator for the coecients of the predictors is where y is the vector of responses, X is the design matrix for the centred and scaled predictors, is

1709_basic linear models.png

the shirnkage parameter and I denotes the identity matrix. We write n for the number of observations and k for the number of predictors. Writing biR for the ith component of bR, we will prove in this question that where 2 is the variance of the responses, and vi, i = 1,.......k are the eigenvalues of XTX. The di erent parts of the question below lead you through the proof.

735_basic linear models1.png

(a) Write XTX = QDQT for the eigenvalue decomposition of XTX, where D = diag(v1,........vk) is the diagonal matrix of eigenvalues and Q is an orthogonal matrix (QTQ = I) where the columns are the eigenvectors of XTX. Show that XTX +I = Q(D+I)QT .

2344_basic linear models2.png

where V ar(bR) denotes the covariance matrix of bR. (Hint: recall the result from basic linear models that if Y is a k  1 random vector with V ar(Y ) = V and if A is a k  k matrix and Z = AY then V ar(Z) = AV AT ).


Related Discussions:- Explain ridge regression

Help!, in a normal distribution with a mean of 85 and a STD of 5, what is t...

in a normal distribution with a mean of 85 and a STD of 5, what is the percentage of scores between 75 and 90?

Histogram, Histogram: It is generally used for charting continuous fre...

Histogram: It is generally used for charting continuous frequency   distribution. In histogram, data are plotted as a series  of rectangle one over the other. Class intervals

Scatter diagram - correlation analysis, Scatter Diagram The first step ...

Scatter Diagram The first step in correlation analysis is to visualize the relationship. For each unit of observation in correlation analysis there is a pair of numerical value

Standard deviation and variance, Explanation of standard deviation and vari...

Explanation of standard deviation and variance Describe the importance of standard deviation and variance, what they calculate and why they are required. Importance of char

Risk of portfolios, Risk of Portfolios So far, we have seen the applic...

Risk of Portfolios So far, we have seen the application of standard deviation in the context of risk in single investment. But usually most investors hold portfolios of securi

Mode, Mode The mode is the value which occurs most frequ...

Mode The mode is the value which occurs most frequently in a set of observations on the point of maximum frequency and around which other items of the set cluste

Break-even analysis, a. How can break-even analysis be used in selecting a ...

a. How can break-even analysis be used in selecting a new plant site? b. What are potential advantages and disadvantage of locating a production facility in foreign country i

Hypothesis testing, the president of a certain firm concerned about the saf...

the president of a certain firm concerned about the safety record of the firms employee sets aside $50 million a year for safety education. the firms accountant believes that more

Regression, Regression line drawn as Y=C+1075x, when x was 2, and y was 239...

Regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Determine market interest rate, The interest rate on the three year loan is...

The interest rate on the three year loan is 0.087. Whereas the interest rate on the two year loan is 0.085 as given in A. Suppose that the liquidity premium at t=1 is 0.002 and tha

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd