Best subsets regression, Advanced Statistics

Assignment Help:

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.


Related Discussions:- Best subsets regression

Find distribution - expected value and variance, We are installing a router...

We are installing a router for our network. We believe that the time between the arrival of packets will be exponentially distributed with parameter R = 2 packets/second, and th

Describe nuisance parameter, Nuisance parameter : The parameter of the mode...

Nuisance parameter : The parameter of the model in which there is no scienti?c interest but whose values are generally required (but in usual are unknown) to make inferences about

Ecme algorithm, The Expectation/Conditional Maximization Either algorithm w...

The Expectation/Conditional Maximization Either algorithm which is the generalization of ECM algorithm attained by replacing some of the CM-steps of ECM which maximize the constrai

Individual differences, Individual differences scaling is a form of multid...

Individual differences scaling is a form of multidimensional scaling applicable to the data comprising of a number of proximity matrices from the different sources that is differe

Independent component analysis (ica), Independent component analysis (ICA) ...

Independent component analysis (ICA) is the technique for analyzing the complex measured quantities thought to be mixtures of other more fundamental quantities, into their fundamen

Explain initial data analysis (ida), Initial data analysis (IDA): The firs...

Initial data analysis (IDA): The first phase in the examination of the data set which comprises  number of informal steps including the following steps * checking the quality o

Quality control procedures, Quality control procedures is the statistical ...

Quality control procedures is the statistical process designed to ensure that the precision and accuracy of, for instance, a laboratory test, are maintained within the acceptable

Cluster analysis, Cluster analysis : A set of methods or techniques for con...

Cluster analysis : A set of methods or techniques for constructing a sensible and informative classi?cation of an initially unclassi?ed set of data, using variable values observed

Reinterviewing, Reinterviewing  is the second interview for a sample of sur...

Reinterviewing  is the second interview for a sample of survey respondents in which questions of the original interview (or the subset of them) are repeated again. The same methods

Exponential order statistics model, The model which arises in the context o...

The model which arises in the context of estimating the size of the closed population where individuals within the population could be identified only during some of the observatio

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd