Data reduction, Applied Statistics

Assignment Help:

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower dimensions for analysis. It produces a small number of derived variables that are uncorrelated and that account for most of the variation in the original data set.'By reducing the number of variables'in this way, we can understand the underlying structure of the data. 'The derived variables are combinations of the original variables. For example, it might be that students take I0 examinations and some students do well in one examination while other students do better in another. It is difficult to compare one student with another when we have 10 marks to consider. One obvious way of comparing students is to calculate the mean score.

This is a constructed combination of the existing variables. However, one might get a more useful comparison of overall performances by considering other constructed cwbinations of the 10 exam marks. The PCA is one way of constructing such combinations, doing so in such a way as to account fer the maximum possible variation in the original data. We can then compare students' performance by considering this much smaller number of variables.

PCA states and then solves a well-defined statistical problem, and except for special cases always gives a unique solution wi.th some very nice mathematical properties. We can even describe some very artificial practical problems for which PCA provides the exact solution. The difficulty comes in trying to relate PCA to real-life scientific problems; the match is simply not very good. Actually PCA often provides a good approximation to common factor analysis, but that feature is now unimportant since both methods are now easy enough.


Related Discussions:- Data reduction

Chi square test, who invented the chi square test and why? what is central ...

who invented the chi square test and why? what is central chi square and non central chi square test? what is distribution free statistics? what are the conditions when the chi squ

Data project, Dr. Jim Mirabella UNIT EIGHT: DATA ANALYSIS PROJECT All Excel...

Dr. Jim Mirabella UNIT EIGHT: DATA ANALYSIS PROJECT All Excel output should be copied into a single Word document where you must enter all of your responses to the questions below.

Type i and ii errors, TYPE I AND II Errors If a statistical hypothesis ...

TYPE I AND II Errors If a statistical hypothesis is tested, we may get the following four possible cases: The null hypothesis is true and it is accepted; The

Quota sampling, Quota sampling Under this method enumerators shall sele...

Quota sampling Under this method enumerators shall select the respondents in place of those not available, as per the quota fixed according  to guide lines   provided to them.

Kolmogorov-smirnov - normal probability plot, The Null Hypothesis - H0:  Th...

The Null Hypothesis - H0:  The random errors will be normally distributed The Alternative Hypothesis - H1:  The random errors are not normally distributed Reject H0: when P-v

Mean and median, The amounts of money won by the top ten finishers in a fam...

The amounts of money won by the top ten finishers in a famous car race are listed below. $1,172,246    $163,659    $440,584    $350,634     $290,596 $186,731    $145,809     $143,2

Importance and application of probability, Importance and Application of pr...

Importance and Application of probability: Importance of probability theory  is in all those areas where event are not  certain to take place as same  as starting with games of

student is chosen randomly, In a management class of 100 childerns' 3 lang...

In a management class of 100 childerns' 3 languages are offered as an additional subject viz. Hindi, English and Kannada. There are 28 childrens taking Hindi, 26 taking Hindi and 1

Correlation coefficient test, 1. If you are calculating a correlation coeff...

1. If you are calculating a correlation coefficient testing the relationship between height and weight, state the null and alternative hypotheses. 2. What kind of relationship d

Advantages of sampling, Advantages of Sampling Why should we settle on ...

Advantages of Sampling Why should we settle on a sample instead of studying the entire population?  Sampling has the following advantages over a census (study of the entire pop

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd