Best subsets regression, Advanced Statistics

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.

Posted Date: 3/4/2013 6:44:10 AM | Location : United States







Related Discussions:- Best subsets regression, Assignment Help, Ask Question on Best subsets regression, Get Answer, Expert's Help, Best subsets regression Discussions

Write discussion on Best subsets regression
Your posts are moderated
Related Questions
Build-Rite construction has received favorable publicity from guest appearances on a public TV home improvement program. Public TV programming decisions seem to be unpredictable, s

Blinder Oaxaca method: A method or technique used for assessing the effect of the role of income on racial wealth gap. The method or technique is based on the decomposition of the

The time series for RESI1, HI1 and COOK1 have appeared again with different outlier values even though the 17 outliers found early were removed.

This is an alternative to the Newton-Raphson technique for optimization (finding out the minimum or the maximum) of some function, which includes replacing the matrix of second der

Clustered data : The term applied to both the data in which the sampling units are grouped into the clusters sharing some common feature, for instance families or geographical reg

Lattice distribution : A class of probability distributions to which most of the distributions for discrete random variables used in statistics belongs. In such type of distributio

Interval-censored observations are the  observations which often occur in the context of studies of time elapsed to the particular event when subjects are not monitored regularl

Omitted covariates is a term generally found in the connection with regression modelling, where the model has been incompletely specified by not including significant covariates.

Mixture experiment is an experiment in which the two or more ingredients are blended together to form an end product. The measurements are taken on the several blends of the ingre

The model which is applicable to the longitudinal data in which the dropout process might give rise to the informative lost values. Specifically if the study protocol specifies the