Data reduction, Applied Statistics

Assignment Help:

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower dimensions for analysis. It produces a small number of derived variables that are uncorrelated and that account for most of the variation in the original data set.'By reducing the number of variables'in this way, we can understand the underlying structure of the data. 'The derived variables are combinations of the original variables. For example, it might be that students take I0 examinations and some students do well in one examination while other students do better in another. It is difficult to compare one student with another when we have 10 marks to consider. One obvious way of comparing students is to calculate the mean score.

This is a constructed combination of the existing variables. However, one might get a more useful comparison of overall performances by considering other constructed cwbinations of the 10 exam marks. The PCA is one way of constructing such combinations, doing so in such a way as to account fer the maximum possible variation in the original data. We can then compare students' performance by considering this much smaller number of variables.

PCA states and then solves a well-defined statistical problem, and except for special cases always gives a unique solution wi.th some very nice mathematical properties. We can even describe some very artificial practical problems for which PCA provides the exact solution. The difficulty comes in trying to relate PCA to real-life scientific problems; the match is simply not very good. Actually PCA often provides a good approximation to common factor analysis, but that feature is now unimportant since both methods are now easy enough.


Related Discussions:- Data reduction

Mode, Mode The mode is the value which occurs most frequ...

Mode The mode is the value which occurs most frequently in a set of observations on the point of maximum frequency and around which other items of the set cluste

Determine nash equilibria, Two students are sitting in a lecture and consid...

Two students are sitting in a lecture and considering whether to ask a question from the professor (both of them are considering the same question). If they both ask, the questi

Simple random sampling, Simple Random Sampling In Simple Random Sampli...

Simple Random Sampling In Simple Random Sampling each possible sample has an equal chance of being selected. Further, each item in the entire population also has an equal chan

Perform a one-way anova, The Tastee Bakery Company supplies a bakery produc...

The Tastee Bakery Company supplies a bakery product to many supermarkets in a metropolitan area. The company wishes to study the effect of shelf display height employed by the supe

Control chart, construction of control chart,n chart

construction of control chart,n chart

Determine the compressive force, The weight of the engine in kN is given in...

The weight of the engine in kN is given in P2 and is suspended from a vertical chain at A. A second chain round the engine is attached at A, with a spreader bar between B and C. Th

mathematical anxiety, A study was designed to investigate the effects of t...

A study was designed to investigate the effects of two variables - (1) a student's level of mathematical anxiety and (2) teaching method - on a student's achievement in a mathemati

Mean deviation, First Moment of Dispersion or Mean Deviation Mean devia...

First Moment of Dispersion or Mean Deviation Mean deviation or the average deviation is the measure if dispersion which   is based upon all the items in a variable .It is the a

Displacement of a simply supported beam, The displacement of a simply suppo...

The displacement of a simply supported beam subject to a uniform load is given by the solution of the following differential equation (for small displacements); and q is th

Cluster analysis, Cluster Analysis could be also represented more formally ...

Cluster Analysis could be also represented more formally as optimization procedure, which tries to minimize the Residual Sum of Squares objective function: where μ(ωk) - is a centr

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd