Error rate estimation, Advanced Statistics

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.

Posted Date: 7/27/2012 6:53:05 AM | Location : United States







Related Discussions:- Error rate estimation, Assignment Help, Ask Question on Error rate estimation, Get Answer, Expert's Help, Error rate estimation Discussions

Write discussion on Error rate estimation
Your posts are moderated
Related Questions
It is used generally for the matrix which specifies a statistical model for a set of observations. For instance, in a one-way design with the three observations in one group, tw

Conditional logistic regression : The form of logistic regression designed to work with the clustered data, such as data including matched pairs of the subjects, in which subject-s

MEANING ,IMPORTANCE AND RELEAVANCE OF SCATTER DIAGRAM

Graphical deception : Statistical graphics which are not as honest as they should be. It is relatively simple. To mislead the unwary with the graphical material. For instance, c

The regression analysis is used to fit a model describing the relationship of a dependent variable with independent variable(s). Here we have fitted three regression models:

Hurdle Model:  The model for count data which postulates two processes, one generating the zeros in the data and one generating positive values. The binomial model decides the bina

Looking for the correct answer.Y=50+.079(149)-.261(214)=

Normality - Reasons for Screening Data Prior to analyzing multivariate normality, one should consider univariate normality Histogram, Normal Q-Qplot (values on x axis

1. The production manager of Koulder Refrigerators must decide how many refrigerators to produce in each of the next four months to meet demand at the lowest overall cost. There i

i need help for my assignment and the deadline is Friday