Discriminant analysis, Advanced Statistics

A term which covers the large number of techniques for the analysis of the multivariate data which have in common the aim to assess whether or not the set of variables distinguish or discriminate between the two or more groups of the individuals. In medicine, for instance, this type of methods are generally applied to the problem of using optimally the results from the various tests or the observations of various symptoms to make the diagnosis which can only be confirmed perhaps by the post-mortem examination. In the two group case the mainly used method is Fisher's linear discriminant function, in which a linear function of variables giving the maximal separation between the groups is then determined. This results in the classification rule which may be used to assign the new patient to one of the two groups. The derivation of the linear function supposes that the variance-covariance matrices of the two groups are the same. If they are not then a quadratic discriminant function might be essential to distinguish between the groups. Such a function comprises of powers and cross-products of variables. The sample of the observations from which the discriminant function is derived is commonly known as the training set. When more than two groups are involved then it is possible to determine the several linear functions of the variables for separating them. In common the number of such functions which can be derived is the smaller of q and g-1 where q is the number of variables and g is the number of groups. The collection of the linear functions for discrimination is called as canonical discriminant functions or simply as canonical variates.

Posted Date: 7/27/2012 3:17:02 AM | Location : United States







Related Discussions:- Discriminant analysis, Assignment Help, Ask Question on Discriminant analysis, Get Answer, Expert's Help, Discriminant analysis Discussions

Write discussion on Discriminant analysis
Your posts are moderated
Related Questions
Lie factor : A measure suggested by Tufte for judging the honesty of the graphical presentation of data. Which can be calculated as follows   The values close to one are desir

Latin square  is an experimental design targeted at removing from the experimental error the variation from two extraneous sources so that a more sensitive test of the treatment ef

Oracle property is a name given to techniques for estimating the regression parameters in the models fitted to high-dimensional data which have the property that they can correctl

Maximum likelihood estimation is an estimation procedure involving maximization of the likelihood or the log-likelihood with respect to the parameters. Such type of estimators is

i have an assignment for experimental design which is must done by SAS program can you help me also i need to hand in the assignment till thursday shall i send it for you ?

This is an attempt to measure the suffering caused by the illness which takes into the account both the years of the potential life lost due to the premature mortality as well as t

Hypothesis testing is a  general term for procedure of assessing whether the sample data is consistent or otherwise with statements made about the population. It basically tells u

Primary Model Below is a regression analysis without 17 outliers that have been removed Regression Analysis: wfood versus totexp, income, age, nk The regression equat

Randomized consent design is the design at first introduced to overcome some of the perceived ethical problems facing clinicians entering patients in the clinical trials including

The generalization of the normal distribution used for the characterization of functions. It is known as a Gaussian process because it has Gaussian distributed finite dimensional m