Best subsets regression, Advanced Statistics

Assignment Help:

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.


Related Discussions:- Best subsets regression

Accelerated life testing, Normal 0 false false false EN...

Normal 0 false false false EN-US X-NONE X-NONE

Range, Range is the difference between the largest and smallest observatio...

Range is the difference between the largest and smallest observations in the data set. Commonly used as an easy-to-calculate measure of the dispersion in the set of observations b

Comparative exposure rate, Comparative exposure rate : A measure of allianc...

Comparative exposure rate : A measure of alliance for use in a matched case-control study, de?ned as the ratio of the number of case-control pairs, where the case has greater expos

Factorial moment generating function, The function of a variable t which, w...

The function of a variable t which, when extended formally as a power series in t, yields factorial moments as the coefficients of the respective powers. If the P(t) is probability

Finite mixture distribution, The probability distribution which is a linear...

The probability distribution which is a linear function of the number of component probability distributions. This type of distributions is used to model the populations thought to

Lipstick Dilemma, For a career woman, wearing lipstick has become an integr...

For a career woman, wearing lipstick has become an integral part of her daily life. It is not unusual for a woman to look for a lipstick that will stay on her lips and not smudge

Naor''s distribution, Naor's distribution is the discrete probability dist...

Naor's distribution is the discrete probability distribution which arises from the following model; Assume an urn contains n balls of which one is red and the remainder is whit

Generaliz ability theory, The theory of measurement which recognizes that i...

The theory of measurement which recognizes that in any measurement situation there are multiple (actually infinite) sources of variation (known as facets in the theory), and that a

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd