Classification and regression tree technique (cart), Advanced Statistics

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory variables most significant for prediction of the response variable. Rather than ?tting the model to the sample data, a tree structure is obtained by dividing the sample recursively into the various of sets, each division being chosen so as to maximize some measure of difference in the response variable in the resulting two sets. The resulting structure often gives us the easier interpretation than a regression equation, as those variables most significant for the prediction can be quickly identi?ed. In addition this approach does not need distributional assumptions and is also more resistant to the effects of the outliers. At each stage the sample is divided on the basis of a variable, xi, according to answers to such questions as 'Is xi c' (univariate split), is ' Paixi c' (which is linear function split) and 'does xi A' (if xi is the categorical variable).
1423_regression.png
A design of the application of this method or technique is shown in the figure 35.

Posted Date: 7/26/2012 6:24:44 AM | Location : United States







Related Discussions:- Classification and regression tree technique (cart), Assignment Help, Ask Question on Classification and regression tree technique (cart), Get Answer, Expert's Help, Classification and regression tree technique (cart) Discussions

Write discussion on Classification and regression tree technique (cart)
Your posts are moderated
Related Questions
Continuous variable : The measurement which is not restricted to the particular values except in so far as this is constrained by the accuracy of measuring instrument. General exam

This is the powerful visualization tool for studying how the response relies on an explanatory variable given the values of other explanatory variables. The plot comprises of a num

Cluster analysis : A set of methods or techniques for constructing a sensible and informative classi?cation of an initially unclassi?ed set of data, using variable values observed

Geometric distribution: The probability distribution of the number of trials (N) before the first success in the sequence of Bernoulli trials. Specifically the distribution is can

Atomistic fallacy : A fallacy which arises because of the association between two variables at the individual level might vary from the association between the same two variables m

Prior distributions : The probability distributions which summarize the information about a random variable or parameter known or supposed at a given time instant, prior to attaini

Behrens Fisher problem : The difficulty of testing for the equality of the means of the two normal distributions which do not have the equal variance. Various test statistics have

The model which arises in the context of estimating the size of the closed population where individuals within the population could be identified only during some of the observatio

Incidental parameter problem is a problem which sometimes occurs when the number of parameters increases in the tandem with the number of observations. For instance, models for pa

Occam's razor  is an early statement of the parsimony principle, which was given by William of Occam (1280-1349) namely 'entia non sunt multiplicanda praeter necessitatem'; which m