Data reduction, Applied Statistics

Assignment Help:

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower dimensions for analysis. It produces a small number of derived variables that are uncorrelated and that account for most of the variation in the original data set.'By reducing the number of variables'in this way, we can understand the underlying structure of the data. 'The derived variables are combinations of the original variables. For example, it might be that students take I0 examinations and some students do well in one examination while other students do better in another. It is difficult to compare one student with another when we have 10 marks to consider. One obvious way of comparing students is to calculate the mean score.

This is a constructed combination of the existing variables. However, one might get a more useful comparison of overall performances by considering other constructed cwbinations of the 10 exam marks. The PCA is one way of constructing such combinations, doing so in such a way as to account fer the maximum possible variation in the original data. We can then compare students' performance by considering this much smaller number of variables.

PCA states and then solves a well-defined statistical problem, and except for special cases always gives a unique solution wi.th some very nice mathematical properties. We can even describe some very artificial practical problems for which PCA provides the exact solution. The difficulty comes in trying to relate PCA to real-life scientific problems; the match is simply not very good. Actually PCA often provides a good approximation to common factor analysis, but that feature is now unimportant since both methods are now easy enough.


Related Discussions:- Data reduction

Statistics, Theories of Business forecasting

Theories of Business forecasting

Lorenz curve , Lorenz Curve   It is a graphic method of measur...

Lorenz Curve   It is a graphic method of measuring dispersion. This curve was devised by Dr. Max o Lorenz a famous statistician.  He used this technique for wealth it i

Statisttics., Explain any two applications of statistics

Explain any two applications of statistics

Hypothesis testing, the president of a certain firm concerned about the saf...

the president of a certain firm concerned about the safety record of the firms employee sets aside $50 million a year for safety education. the firms accountant believes that more

Question with R - Bioinformatics, Hi There, I have a question regarding R,...

Hi There, I have a question regarding R, and I am wondering if anyone can help me. Here is a code that I would like to understand: squareFunc g f(x)^2 } return(g) } sin

Using the asymptotic distribution test the hypothesis, You are interested i...

You are interested in testing the distance of two golf balls, Brand A and Brand B. You take a random sample of 100 golfers, each of whom hits Brand A once and Brand B once. Define

Frailty in multi state models, how can i use continuous frailty in multi st...

how can i use continuous frailty in multi state models?

Probability theory, Origin and Development of probability Theory: The c...

Origin and Development of probability Theory: The credit for origin and development of probability goes to the European gamblers of 17 th century. They  used to gamble  on gam

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd