Best subsets regression, Advanced Statistics

In the time series plot and scatter graphs there were many outliers that were clearly visible. These have been removed to identify if they were influential or had high leverage and in order to see if the multiple regression model assumptions have been met.

Below are the rows of the outliers that I removed out of the 1519 observations:

77, 674, 448, 757, 317, 549, 1187, 1198, 26, 456, 405, 307, 1205, 1348, 611, 368, 309

Best Subsets Regression: wfood versus totexp, income, age, nk

Response is wfood

                                                                   t i

                                                                   o n

                                                                    t c

                                                                    e o a

                               Mallows                         x m g n

Vars  R-Sq  R-Sq(adj)       Cp         S             p e e k

   1  22.9       22.9     67.4            0.092326  X

   1   5.5        5.4      424.9           0.10222    X

   2  24.8       24.7     31.3            0.091236  X     X

   2  24.2       24.1     42.7           0.091572  X   X

   3  26.1       26.0      6.1            0.090461  X   X X

   3  24.8       24.7     32.3           0.091239  X X   X

   4  26.3       26.1      5.0            0.090397  X X X X

The best subset is a way of identifying which independent variable such as the totexp, income, age and nk are best suited to the regression model.  According to the results above income is the variable that has the highest Cp and the lowest R-squared value therefore it will be the variable that will be dropped to see if the data fits the model.

Posted Date: 3/4/2013 6:44:10 AM | Location : United States







Related Discussions:- Best subsets regression, Assignment Help, Ask Question on Best subsets regression, Get Answer, Expert's Help, Best subsets regression Discussions

Write discussion on Best subsets regression
Your posts are moderated
Related Questions
a psychic claims to be able to "feel colors" there are three pieces of colored paper(red, blue,green) he will place his hand on radomly selected pieces while blindfolded. you perfo

Bayesian network : It is essentially an expert system in which the uncertainty is dealt with using the conditional probabilities and Bayes' Theorem. Formally such type of network c

MEANING ,IMPORTANCE AND RELEAVANCE OF SCATTER DIAGRAM

Hello-goodbye effect : The phenomenon initially described in psychotherapy research, but one which might arise whenever a subject is assessed on two occasions, with some interventi

Paul Jordan has just been hired as a management analyst at Digital Cell Phone Inc. Digital Cell manufactures a broad line of phones for the consumer market. Paul's boss, John Smith

Human capital model : The model for evaluating the economic implication of the disease in terms of the economic loss of a person succumbing to morbidity or the mortality at some pa

Normal approximation : Normal distributions which approximate other distributions; such as, a normal distribution with the mean np and variance np(1 - p) which acts as an approxima

Introduction to Generalized Linear Models (GLM) We introduce the notion of GLM as an extension of the traditional normal-theory-based linear regression models. This will be very

Perturbation theory : The theory useful in assessing how well a specific algorithm or the statistical model performs when the observations suffer less random changes. In very commo

Given: There are 4 jobs and 4 persons. The cost incurred for each person and each job is as follows: Persons Job 1 Job 2 Job 3 Job 4 A 10 9 21 11 B 15 12 25 17 C 12 10 20 12 D 17