Best subsets regression, Advanced Statistics

Assignment Help:

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.


Related Discussions:- Best subsets regression

Cohort study, Cohort study : An investigation in which the group of individ...

Cohort study : An investigation in which the group of individuals (or the cohort) is identi?ed and followed prospectively, possibly for many years, and their subsequent medical his

Doubly multivariate data, This term is sometimes used for the data collecte...

This term is sometimes used for the data collected in those longitudinal studies in which more than the single response variable is recorded for each subject on each occasion. For

Cube law, A law supposedly applicable to voting behaviour which has a histo...

A law supposedly applicable to voting behaviour which has a history of several decades. It may be stated thus: Consider a two-party system and suppose that the representatives of t

Factorial moment generating function, The function of a variable t which, w...

The function of a variable t which, when extended formally as a power series in t, yields factorial moments as the coefficients of the respective powers. If the P(t) is probability

Projection pursuit, Projection pursuit is a procedure for attaning a low-d...

Projection pursuit is a procedure for attaning a low-dimensional (usually two-dimensional) representation of the multivariate data, which will be particularly useful in revealing

Convex hull trimming, Convex hull trimming : A procedure which can be appli...

Convex hull trimming : A procedure which can be applied to the set of bivariate data to permit robust estimation of the Pearson's product moment correlation coef?cient. The points

Evidence-based medicine (ebm), Described by the leading proponent as 'the c...

Described by the leading proponent as 'the conscientious, explicit, and judicious uses of present best evidence in making the decisions about the care of individual patients, and

O''brien''s two-sample tests, O'Brien's two-sample tests are the extension...

O'Brien's two-sample tests are the extensions of the conventional tests for assessing the differences between treatment groups which take account of the possible heterogeneous nat

Intercropping experiments, Intercropping experiments are the experiments i...

Intercropping experiments are the experiments including growing two or more crops at same time on the same patch of land. The crops are not required to be planted nor harvested at

Relative poverty statistics, Relative poverty statistics is the statistics...

Relative poverty statistics is the statistics on the properties of populations falling below given fractions of average income which play a central role in debate of poverty. The

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd