Best subsets regression, Advanced Statistics

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.

Posted Date: 3/4/2013 6:44:10 AM | Location : United States







Related Discussions:- Best subsets regression, Assignment Help, Ask Question on Best subsets regression, Get Answer, Expert's Help, Best subsets regression Discussions

Write discussion on Best subsets regression
Your posts are moderated
Related Questions
Procrustes analysis is a technique of comparing the alternative geometrical representations of a group of multivariate data or of the proximity matrix, for instance, two competing

Probability distribution : For the discrete random variable, a mathematical formula which provides the probability of each value of variable. See, for instance, binomial distributi

Probability weighting is the procedure of attaching weights equal to inverse of the probability of being selected, to each respondent's record in the sample survey. These weights

Chi-squared distribution : It is the probability distribution, f (x), of the random variable de?ned as the sum of squares of the number (v) of independent standard normal variables

MEANING ,IMPORTANCE AND RELEAVANCE OF SCATTER DIAGRAM

Ask quesoil company is considering whether or not to bid for an offshore drilling contract. If they bid, the value would be $600m with a 65% chance of gaining the contract. The com

Play-the-winner rule is a process sometimes considered in the clinical trials in which the response to treatment is positive (a success) or negative (a failure). One of two treatm

Pie chart is an extensively used graphical technique for presenting relative frequencies related with the observed values of the categorical variable. The chart comprises of a cir

Cohort component method : A broadly used method or technique of forecasting the age- and sex-speci?c population to the upcoming years, in which the initial population is strati?ed

Non-randomized clinical trial is the clinical trial in which the series of consecutive patients receive a new treatment and those which respond (according to some of the pre-defin