Best subsets regression, Advanced Statistics

Assignment Help:

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.


Related Discussions:- Best subsets regression

Ecme algorithm, The Expectation/Conditional Maximization Either algorithm w...

The Expectation/Conditional Maximization Either algorithm which is the generalization of ECM algorithm attained by replacing some of the CM-steps of ECM which maximize the constrai

Coincidences, Coincidences : Astonishing concurrence of the events, perceiv...

Coincidences : Astonishing concurrence of the events, perceived as meaningfully related, with no apparent causal connection. Such type of events abounds in everyday life and is oft

Mantel haenszel estimator, Mantel Haenszel  estimator is  an estimator o...

Mantel Haenszel  estimator is  an estimator of assumed common odds ratio in the series of two-by-two contingency tables arising from the different populations, for instance, occ

Conditional probability, Conditional probability : The probability that an ...

Conditional probability : The probability that an event occurs given the outcome of other event. Generally written, Pr(A|B). For instance, the probability of a person being color b

Business forcastin.., elements , importance, limitation, and theories

elements , importance, limitation, and theories

Factorial moment generating function, The function of a variable t which, w...

The function of a variable t which, when extended formally as a power series in t, yields factorial moments as the coefficients of the respective powers. If the P(t) is probability

Describe meta-analysis, Meta-analysis is the collection of techniques wher...

Meta-analysis is the collection of techniques whereby the results of two or more independent studies are statistically combined to yield the overall answer to a question of intere

Buffon''s needle problem, Buffon's needle problem : A problem proposed and ...

Buffon's needle problem : A problem proposed and solved by the scientist Comte de Buffon in 1777 which includes determining the probability, p, which a needle of length l will inte

Explain randomized response technique, Randomized response technique : The ...

Randomized response technique : The procedure for collecting the information on sensitive issues by means of the survey, in which an element of chance is introduced as to what quer

Probability., 5. Packages from a machine a normally distributed with a mean...

5. Packages from a machine a normally distributed with a mean 200g and its standard deviation 2grams. Find the probability that a package from the machine weighs a) Less than

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd