Error rate estimation, Advanced Statistics

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.

Posted Date: 7/27/2012 6:53:05 AM | Location : United States







Related Discussions:- Error rate estimation, Assignment Help, Ask Question on Error rate estimation, Get Answer, Expert's Help, Error rate estimation Discussions

Write discussion on Error rate estimation
Your posts are moderated
Related Questions
regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual


a psychic claims to be able to "feel colors" there are three pieces of colored paper(red, blue,green) he will place his hand on radomly selected pieces while blindfolded. you perfo

You have learned that there are 3 major central measures of any data set. Namely: mean, median, and mode. Which of the three, do the outliers affect the most?

Mauchly test is a test which a variance-covariance matrix of pair wise differences of responses in the set of longitudinal data is the scalar multiple of identity matrix, a proper

Maximum likelihood estimation is an estimation procedure involving maximization of the likelihood or the log-likelihood with respect to the parameters. Such type of estimators is

Tracking is the term sometimes used in the discussions of data from the longitudinal study, to describe the ability to predict the subsequent observations from previous values. In

Non parametric maximum likelihood (NPML) is a likelihood approach which does not need the specification of the full parametric family for the data. Usually, the non parametric max

Basic reproduction number : A term used in the theory of infectious diseases for the number of secondary cases which one case would generate in a completely susceptible population.

Common cause failures (CCF): Simultaneous failures of the number of components due to a same reason. A reason can be external to the components, or it can be the single failure wh