Data reduction, Applied Statistics

Assignment Help:

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower dimensions for analysis. It produces a small number of derived variables that are uncorrelated and that account for most of the variation in the original data set.'By reducing the number of variables'in this way, we can understand the underlying structure of the data. 'The derived variables are combinations of the original variables. For example, it might be that students take I0 examinations and some students do well in one examination while other students do better in another. It is difficult to compare one student with another when we have 10 marks to consider. One obvious way of comparing students is to calculate the mean score.

This is a constructed combination of the existing variables. However, one might get a more useful comparison of overall performances by considering other constructed cwbinations of the 10 exam marks. The PCA is one way of constructing such combinations, doing so in such a way as to account fer the maximum possible variation in the original data. We can then compare students' performance by considering this much smaller number of variables.

PCA states and then solves a well-defined statistical problem, and except for special cases always gives a unique solution wi.th some very nice mathematical properties. We can even describe some very artificial practical problems for which PCA provides the exact solution. The difficulty comes in trying to relate PCA to real-life scientific problems; the match is simply not very good. Actually PCA often provides a good approximation to common factor analysis, but that feature is now unimportant since both methods are now easy enough.


Related Discussions:- Data reduction

Measurement errors models, How can we analyse data with four bilateral resp...

How can we analyse data with four bilateral response variables measured with errors and three covariated measured without errors?

Statisttics., Explain any two applications of statistics

Explain any two applications of statistics

Create the venn diagram, Create the Venn diagram: A   - you work for a...

Create the Venn diagram: A   - you work for an insurance company.  80% of your company's staff is sales force and 70% of your company's sales is force is male. in your company

Confirmatory factor analysis, Confirmatory factor analysis (CFA) seeks to d...

Confirmatory factor analysis (CFA) seeks to determine whether the number of factors and the loadings of measured (indicator) variables on them conform to what is expected on the ba

Financial payments technology, Suppose the money supply process is now repr...

Suppose the money supply process is now represented by the following function: where m measures the sensitivity of money supply with respect to the interest rate. (i) Us

Disadvantages of median, Disadvantages For calculating median it is ...

Disadvantages For calculating median it is necessary to arrange the data; other averages do not need any arrangement. Since it is a positional average, its value is not d

Difference in goals between pca and fa, In PCA the eigknvalues must ultimat...

In PCA the eigknvalues must ultimately account for all of the variance. There is no probability,'no hypothesis, no test because strictly speaking PCA is not a statistical procedure

Describe the opportunities for statistical learning, 1. Recognize and expla...

1. Recognize and explain the opportunities for statistical learning. 2. Describe how the use of statistics supports student learning. 3. Recognize appropriate data displays a

Expected utility maximizer, The investor has constant wealth 1 and is o?ere...

The investor has constant wealth 1 and is o?ered to invest in shares of a project that either gains 3=2 or loses 1 with equal probabilities. Therefore, if the investor obtains sha

Coefficient of determination, Coefficient of Determination The c...

Coefficient of Determination The coefficient of determination is given by r 2 i.e., the square of the correlation coefficient. It explains to what extent the variation

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd