Classification and regression tree technique (cart), Advanced Statistics

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory variables most significant for prediction of the response variable. Rather than ?tting the model to the sample data, a tree structure is obtained by dividing the sample recursively into the various of sets, each division being chosen so as to maximize some measure of difference in the response variable in the resulting two sets. The resulting structure often gives us the easier interpretation than a regression equation, as those variables most significant for the prediction can be quickly identi?ed. In addition this approach does not need distributional assumptions and is also more resistant to the effects of the outliers. At each stage the sample is divided on the basis of a variable, xi, according to answers to such questions as 'Is xi c' (univariate split), is ' Paixi c' (which is linear function split) and 'does xi A' (if xi is the categorical variable).
A design of the application of this method or technique is shown in the figure 35.

Posted Date: 7/26/2012 6:24:44 AM | Location : United States

Related Discussions:- Classification and regression tree technique (cart), Assignment Help, Ask Question on Classification and regression tree technique (cart), Get Answer, Expert's Help, Classification and regression tree technique (cart) Discussions

Write discussion on Classification and regression tree technique (cart)
Your posts are moderated
Related Questions
It is used generally for the matrix which specifies a statistical model for a set of observations. For instance, in a one-way design with the three observations in one group, tw

Cauchy integral : The integral of the function, f (x), from a to b are de?ned in terms of the sum   In the statistics this leads to the below shown inequality for the expecte

There is high level of fluctuation in a zigzag pattern in the time series for RESI1 which indicates that there is possibly negative autocorrelation present. Column C11 show

Mortality odds ratio  is the ratio equivalent to the odds ratio used in case-control studies where the equivalent of the cases are deaths from the cause of interest and the equival

we are testing : Ho: µ=40 versus Ha: µ>40 (a= 0.01) Suppose that the test statistic is z0=2.75 based on a sample size of n=25. Assume that data are normal with mean mu and standa

Mixture experiment is an experiment in which the two or more ingredients are blended together to form an end product. The measurements are taken on the several blends of the ingre

Collector's problem : A problem which derives from the schemes in which packets of a particular brand of coffe, cereal etc., are sold with coupons, cards, or other tokens. There ar

The procedures used for determining how the quality of life is affected by the environment, in particular by factors such as air and solid wastes, water pollution, hazardous substa

Multiple comparison tests : Procedures for detailed examination of the differences between a set of means, generally after a general hypothesis that they are all equal has been rej

Cascadedparameters: A group of parameters which is interlinked and where selecting the value for the ?rst parameter affects the choice and option available in the subsequent param