Discriminant analysis, Advanced Statistics

A term which covers the large number of techniques for the analysis of the multivariate data which have in common the aim to assess whether or not the set of variables distinguish or discriminate between the two or more groups of the individuals. In medicine, for instance, this type of methods are generally applied to the problem of using optimally the results from the various tests or the observations of various symptoms to make the diagnosis which can only be confirmed perhaps by the post-mortem examination. In the two group case the mainly used method is Fisher's linear discriminant function, in which a linear function of variables giving the maximal separation between the groups is then determined. This results in the classification rule which may be used to assign the new patient to one of the two groups. The derivation of the linear function supposes that the variance-covariance matrices of the two groups are the same. If they are not then a quadratic discriminant function might be essential to distinguish between the groups. Such a function comprises of powers and cross-products of variables. The sample of the observations from which the discriminant function is derived is commonly known as the training set. When more than two groups are involved then it is possible to determine the several linear functions of the variables for separating them. In common the number of such functions which can be derived is the smaller of q and g-1 where q is the number of variables and g is the number of groups. The collection of the linear functions for discrimination is called as canonical discriminant functions or simply as canonical variates.

Posted Date: 7/27/2012 3:17:02 AM | Location : United States







Related Discussions:- Discriminant analysis, Assignment Help, Ask Question on Discriminant analysis, Get Answer, Expert's Help, Discriminant analysis Discussions

Write discussion on Discriminant analysis
Your posts are moderated
Related Questions
The procedures used for determining how the quality of life is affected by the environment, in particular by factors such as air and solid wastes, water pollution, hazardous substa

Modern hotels and certain establishments make use of an electronic door lock system. To open a door an electronic card is inserted into a slot. A green light indicates that the doo

A manufacturing company has two factories F 1 and F 2 producing a certain commodity that is required at three retail outlets M 1 , M 2 and M 3 . Once produced, the commodity is

Regression to the mean is the procedure first noted by Sir Francis Galton that 'each peculiarity in man is shared by his kinsmen, but on average to the less degree.' Hence the ten

Case-control study : The traditional case-control study is the common research design in the epidemiology where the exposures to risk factors for cases (individuals getting the dis

Histogram is the graphical representation of the set of observations in which class frequencies are represented by the regions of rectangles centred on the class interval. If the f

regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Bartlett's test for variances : A test for equality of the variances of the number (k)of the populations. The test statistic can be given as follows   where s square is an


Bayesian confidence interval : An interval of the posterior distribution which is so that the density of it at any point inside the interval is greater than that of the density at