Classification and regression tree technique (cart), Advanced Statistics

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory variables most significant for prediction of the response variable. Rather than ?tting the model to the sample data, a tree structure is obtained by dividing the sample recursively into the various of sets, each division being chosen so as to maximize some measure of difference in the response variable in the resulting two sets. The resulting structure often gives us the easier interpretation than a regression equation, as those variables most significant for the prediction can be quickly identi?ed. In addition this approach does not need distributional assumptions and is also more resistant to the effects of the outliers. At each stage the sample is divided on the basis of a variable, xi, according to answers to such questions as 'Is xi c' (univariate split), is ' Paixi c' (which is linear function split) and 'does xi A' (if xi is the categorical variable).
1423_regression.png
A design of the application of this method or technique is shown in the figure 35.

Posted Date: 7/26/2012 6:24:44 AM | Location : United States







Related Discussions:- Classification and regression tree technique (cart), Assignment Help, Ask Question on Classification and regression tree technique (cart), Get Answer, Expert's Help, Classification and regression tree technique (cart) Discussions

Write discussion on Classification and regression tree technique (cart)
Your posts are moderated
Related Questions
What is the EM?

Duck Lovers Unlimited (DLU) Inc. assembles specially configured light jet aircrafts for airborne duck hunting. The quarterly demand forecasts for the upcoming fiscal year are:

The Null Hypothesis - H0: β 1 = 0 i.e. there is homoscedasticity errors and no heteroscedasticity exists The Alternative Hypothesis - H1: β 1 ≠ 0 i.e. there is no homoscedasti

Incubation period is the time elapsing amongs the receipt of infection and the appearance of the symptoms. The length of the incubation time period depends on the disease, ranging

Per-experiment error rate is the possibility of the incorrectly rejecting at least one null hypothesis or assumption in the experiment including one or more tests or comparisons,

regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

You have learned that there are 3 major central measures of any data set. Namely: mean, median, and mode. Which of the three, do the outliers affect the most?

Whats the independent variable in the following sentence? -1) In a drug prevention program for boys and girls, will family-participation result in effective drug use reduction?

Minimum volume ellipsoid is a term for ellipsoid of the minimum volume which covers some specified proportion of the set of multivariate data. It is commonly used to construct rob

Geographical information system (gis): The software and hardware configurations through which the digital georeferences are processed and displayed. Used to recognize the geograph