Error rate estimation, Advanced Statistics

Assignment Help:

The term used for the estimation of the misclassification rate in the discriminant analysis. Number of techniques has been proposed for two-group situation, but the multiple-group situation has rarely been addressed. The easiest procedure is the resubstitution technique, in which the training data are classified using the estimated classification rule and proportion incorrectly placed used as the estimate of misclassification rate. This technique is known to have a large optimistic bias, but it has the benefit that it can be applied to the multigroup problems with no modification required. An alternative technique is the leave one out estimator, in which each of the observation in turn is removed from the data and the classification rule recomputed using remaining data. The proportion improperly classified by the procedure will have reduced bias compared to resubstitution technique. This method can also be implied to the multi group problem with no modification but it has the large amount of variance.


Related Discussions:- Error rate estimation

Odds ratio, Odds ratio is the ratio of the odds for the binary variable in...

Odds ratio is the ratio of the odds for the binary variable in two groups of the subjects, such as, males and females. If the two possible states of variable are labeled as 'succe

Empirical likelihood, An approach of using the likelihood as the basis of e...

An approach of using the likelihood as the basis of estimation without the requirement to specify a parametric family for data. Empirical likelihood can be viewed as the example of

Weathervane plot, Weathervane plot is the graphical display of the multiva...

Weathervane plot is the graphical display of the multivariate data based on bubble plot. The latter is enhanced by the addiction of the lines whose lengths and directions code the

Multidimensional scaling (mds), Multidimensional scaling (MDS)  is a generi...

Multidimensional scaling (MDS)  is a generic term for a class of techniques or methods which attempt to construct a low-dimensional geometrical representation of the proximity matr

Define matching coefficient, Matching coefficient is a similarity coeffici...

Matching coefficient is a similarity coefficient for data consisting of the number of binary variables which is often used in cluster analysis. It can be given as follows    he

Factor scores, The values assigned to factors for the individual sample uni...

The values assigned to factors for the individual sample units in a factor analysis. The most common approach is "regression method". When the factors are seen as the random variab

Distribution free methods, The statistical methods for estimation and infer...

The statistical methods for estimation and inference which are based on a function of sample observations, probability distribution of which does not rely upon a complete speci?cat

Omitted covariates, Omitted covariates is a term generally found in the co...

Omitted covariates is a term generally found in the connection with regression modelling, where the model has been incompletely specified by not including significant covariates.

Explain historical controls, Historical controls : The group of patients tr...

Historical controls : The group of patients treated in the past with the standard therapy, taken in use as the control group for evaluating the new treatment on the present patient

Describe hurdle model, Hurdle Model:  The model for count data which postul...

Hurdle Model:  The model for count data which postulates two processes, one generating the zeros in the data and one generating positive values. The binomial model decides the bina

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd