Data reduction, Applied Statistics

Assignment Help:

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower dimensions for analysis. It produces a small number of derived variables that are uncorrelated and that account for most of the variation in the original data set.'By reducing the number of variables'in this way, we can understand the underlying structure of the data. 'The derived variables are combinations of the original variables. For example, it might be that students take I0 examinations and some students do well in one examination while other students do better in another. It is difficult to compare one student with another when we have 10 marks to consider. One obvious way of comparing students is to calculate the mean score.

This is a constructed combination of the existing variables. However, one might get a more useful comparison of overall performances by considering other constructed cwbinations of the 10 exam marks. The PCA is one way of constructing such combinations, doing so in such a way as to account fer the maximum possible variation in the original data. We can then compare students' performance by considering this much smaller number of variables.

PCA states and then solves a well-defined statistical problem, and except for special cases always gives a unique solution wi.th some very nice mathematical properties. We can even describe some very artificial practical problems for which PCA provides the exact solution. The difficulty comes in trying to relate PCA to real-life scientific problems; the match is simply not very good. Actually PCA often provides a good approximation to common factor analysis, but that feature is now unimportant since both methods are now easy enough.


Related Discussions:- Data reduction

#regression, #regression line drawn as Y=C+1075x, when x was 2, and y was 2...

#regression line drawn as Y=C+1075x, when x was 2, and y was 239, given that y intercept was 11. calculate the residual

Break-even analysis, a. How can break-even analysis be used in selecting a ...

a. How can break-even analysis be used in selecting a new plant site? b. What are potential advantages and disadvantage of locating a production facility in foreign country i

Deviation measures, Deviation Measures The drawback of the range as a m...

Deviation Measures The drawback of the range as a measure of dispersion is that it takes into account the values of only two data points - the largest and the smallest. One

Statistical keys, Statistical Keys To do statistical o...

Statistical Keys To do statistical operations we must first set the calculator on SD mode [SD stands for "standard deviation" which is the usual st

Determine how the ordinary least squares, Question Following the general...

Question Following the general methodology used by econometricians as explained in the session for week 1 (eight steps), explain how you would proceed to determine if a good com

Half of market share, Your company has developed a new product .Your compan...

Your company has developed a new product .Your company is a reputed company with 50% market share of same range of products. Your competitors also come with their new products equa

Introduction to multiple regression, In simple regression the dependent var...

In simple regression the dependent variable Y was assumed to be linearly related to a single variable X. In real life, however, we often find that a dependent variable may depend o

Define the term multicollinearity, Question: (a) (i) Define the term ...

Question: (a) (i) Define the term multicollinearity. (ii) Explain why it is important to guard against multicollinearity. (b) (i) Sometimes we encounter missing values

Two-tailed and one-tailed tests, If the test is two-tailed, H1:  μ ≠  μ 0  ...

If the test is two-tailed, H1:  μ ≠  μ 0  then the test is called two-tailed test and in such a case the critical region lies in both the right and left tails of the sampling distr

Find the inverse laplace transform, Q. Find the inverse Laplace transform o...

Q. Find the inverse Laplace transform of Y (s) = s-4/s 2 + 4s + 13 +3s+5/s 2 - 2s -3. Q. Use the Laplace transform to solve the initial value problem y''+ y = cos(3t), y(0) =

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd