Classification and regression tree technique (cart), Advanced Statistics

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory variables most significant for prediction of the response variable. Rather than ?tting the model to the sample data, a tree structure is obtained by dividing the sample recursively into the various of sets, each division being chosen so as to maximize some measure of difference in the response variable in the resulting two sets. The resulting structure often gives us the easier interpretation than a regression equation, as those variables most significant for the prediction can be quickly identi?ed. In addition this approach does not need distributional assumptions and is also more resistant to the effects of the outliers. At each stage the sample is divided on the basis of a variable, xi, according to answers to such questions as 'Is xi c' (univariate split), is ' Paixi c' (which is linear function split) and 'does xi A' (if xi is the categorical variable).
1423_regression.png
A design of the application of this method or technique is shown in the figure 35.

Posted Date: 7/26/2012 6:24:44 AM | Location : United States







Related Discussions:- Classification and regression tree technique (cart), Assignment Help, Ask Question on Classification and regression tree technique (cart), Get Answer, Expert's Help, Classification and regression tree technique (cart) Discussions

Write discussion on Classification and regression tree technique (cart)
Your posts are moderated
Related Questions
For a career woman, wearing lipstick has become an integral part of her daily life. It is not unusual for a woman to look for a lipstick that will stay on her lips and not smudge

Interim analyses : An analysis made before the planned end of a clinical trial, typically with the aim of detecting the treatment differences at the early stage and thus preventing

Modern hotels and certain establishments make use of an electronic door lock system. To open a door an electronic card is inserted into a slot. A green light indicates that the doo

K-means cluster analysis is the method of cluster analysis in which from an initial partition of observations into K clusters, each observation in turn is analysed and reassigned,

Non-response is the term generally used for the failure to give the relevant information being collected in the survey. Poor response can be because of the variety of causes, for

The graphical process most frequently used in the analysis of data from a two-by-two crossover design. For each of the subject the difference between the response variable values o

The method of summarizing the large amounts of data by forming the frequency distributions, scatter diagrams, histograms, etc., and calculating statistics like means variances and

Grade of membership model: This is the general distribution free method for the clustering of the multivariate data in which only categorical variables are included. The model ass

Kleiner Hartigan trees is a technique for displaying the multivariate data graphically as the 'trees' in which the values of the variables are coded into length of the terminal br

The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedasticity errors and no heteroscedasticity exists The Alternative Hypothesis - H1: β 1 ≠ 0 i.e. there is no homoscedasti