Data reduction, Applied Statistics

Assignment Help:

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower dimensions for analysis. It produces a small number of derived variables that are uncorrelated and that account for most of the variation in the original data set.'By reducing the number of variables'in this way, we can understand the underlying structure of the data. 'The derived variables are combinations of the original variables. For example, it might be that students take I0 examinations and some students do well in one examination while other students do better in another. It is difficult to compare one student with another when we have 10 marks to consider. One obvious way of comparing students is to calculate the mean score.

This is a constructed combination of the existing variables. However, one might get a more useful comparison of overall performances by considering other constructed cwbinations of the 10 exam marks. The PCA is one way of constructing such combinations, doing so in such a way as to account fer the maximum possible variation in the original data. We can then compare students' performance by considering this much smaller number of variables.

PCA states and then solves a well-defined statistical problem, and except for special cases always gives a unique solution wi.th some very nice mathematical properties. We can even describe some very artificial practical problems for which PCA provides the exact solution. The difficulty comes in trying to relate PCA to real-life scientific problems; the match is simply not very good. Actually PCA often provides a good approximation to common factor analysis, but that feature is now unimportant since both methods are now easy enough.


Related Discussions:- Data reduction

Histogram, Histogram: It is generally used for charting continuous fre...

Histogram: It is generally used for charting continuous frequency   distribution. In histogram, data are plotted as a series  of rectangle one over the other. Class intervals

Cluster sampling, Cluster Sampling Here the population is divide...

Cluster Sampling Here the population is divided into clusters or groups and then Random Sampling is done for each cluster. Cluster Sampling differs from Stratified Sampl

Introduction to multiple regression, In simple regression the dependent var...

In simple regression the dependent variable Y was assumed to be linearly related to a single variable X. In real life, however, we often find that a dependent variable may depend o

B) Distinguish between:, X 110 120 130 120 140 135 155 160 165 155 ...

X 110 120 130 120 140 135 155 160 165 155 Y 12 18 20 15 25 30 35 20 25 10

Statistical difference, Using the raw measurement data presented below, cal...

Using the raw measurement data presented below, calculate the t value for independent groups to determine whether or not there exists a statistically significant difference between

Evaluation tracking system, BCBSRI was able to reduce MSD related Workers C...

BCBSRI was able to reduce MSD related Workers Compensation cases with lost workdays by implementing a New Ergonomic Program in March 2000 and increasing workstation evaluations. Ex

Calculate the mean, Question: The weights of 60 children born to mother...

Question: The weights of 60 children born to mothers in a small rural hospital were recorded. 3.63 3.54 3.15 3.90 4.29 4.06 2.91 3.36 3.3

Harmonic mean, The Harmonic Mean is based on the reciprocals of numbers ave...

The Harmonic Mean is based on the reciprocals of numbers averaged. It is defined as the reciprocal of the arithmetic mean of the reciprocal of the given individual observations. Th

Show the hypothesis test, The file Midterm Data.xls has a tab labeled "Inc...

The file Midterm Data.xls has a tab labeled "Income Data 2009". This data is collected income data from a sample of 400 people in 2009. Use a hypothesis test to see whether the av

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd