Classification and regression tree technique (cart), Advanced Statistics

Assignment Help:

Classification and regression tree technique (CART): The alternative to the multiple regression and associated techniques or methods for determining subsets of the explanatory variables most significant for prediction of the response variable. Rather than ?tting the model to the sample data, a tree structure is obtained by dividing the sample recursively into the various of sets, each division being chosen so as to maximize some measure of difference in the response variable in the resulting two sets. The resulting structure often gives us the easier interpretation than a regression equation, as those variables most significant for the prediction can be quickly identi?ed. In addition this approach does not need distributional assumptions and is also more resistant to the effects of the outliers. At each stage the sample is divided on the basis of a variable, xi, according to answers to such questions as 'Is xi c' (univariate split), is ' Paixi c' (which is linear function split) and 'does xi A' (if xi is the categorical variable).
1423_regression.png
A design of the application of this method or technique is shown in the figure 35.


Related Discussions:- Classification and regression tree technique (cart)

RESEARCH METHODS AND STATISTICS.., a researcher is interested in whether st...

a researcher is interested in whether students who attend privte high schools have higher average SAT Scores than students in the general population. a random sample of 90 student

Explain Geometric distribution, Geometric distribution: The probability di...

Geometric distribution: The probability distribution of the number of trials (N) before the first success in the sequence of Bernoulli trials. Specifically the distribution is can

Degenerate distributions, The special cases of the probability distribution...

The special cases of the probability distributions in which the random variable's distribution is concentrated at one point only. For instance, a discrete uniform distribution when

Describe Generalized principal components analysis, Generalized principal c...

Generalized principal components analysis: The non-linear version of the principal components analysis in which the goal is to determine the non-linear coordinate system which is

Inferetial statistics, wat iz z difference b/n logistic regression and mul...

wat iz z difference b/n logistic regression and multiple regression analysis /

Extreme values, The biggest and smallest variate values among the sample of...

The biggest and smallest variate values among the sample of observations. Significant in various regions, for instance flood levels of the river, speed of wind and snowfall.

Explain perturbation theory, Perturbation theory : The theory useful in ass...

Perturbation theory : The theory useful in assessing how well a specific algorithm or the statistical model performs when the observations suffer less random changes. In very commo

Historigram, difference between histogram and historigram

difference between histogram and historigram

Cluster analysis, Cluster analysis : A set of methods or techniques for con...

Cluster analysis : A set of methods or techniques for constructing a sensible and informative classi?cation of an initially unclassi?ed set of data, using variable values observed

Current status data, The Current status data arise in the survival analysis...

The Current status data arise in the survival analysis if the observations are limited to the indicators of whether or not the event of interest has happened at the time the sample

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd