Data reduction, Applied Statistics

Assignment Help:

The PCA is amongst the oldest of the multivariate statistical methods of data reduction. It is a technique for simplifying a dataset, by reducing multidimensional datasets to lower dimensions for analysis. It produces a small number of derived variables that are uncorrelated and that account for most of the variation in the original data set.'By reducing the number of variables'in this way, we can understand the underlying structure of the data. 'The derived variables are combinations of the original variables. For example, it might be that students take I0 examinations and some students do well in one examination while other students do better in another. It is difficult to compare one student with another when we have 10 marks to consider. One obvious way of comparing students is to calculate the mean score.

This is a constructed combination of the existing variables. However, one might get a more useful comparison of overall performances by considering other constructed cwbinations of the 10 exam marks. The PCA is one way of constructing such combinations, doing so in such a way as to account fer the maximum possible variation in the original data. We can then compare students' performance by considering this much smaller number of variables.

PCA states and then solves a well-defined statistical problem, and except for special cases always gives a unique solution wi.th some very nice mathematical properties. We can even describe some very artificial practical problems for which PCA provides the exact solution. The difficulty comes in trying to relate PCA to real-life scientific problems; the match is simply not very good. Actually PCA often provides a good approximation to common factor analysis, but that feature is now unimportant since both methods are now easy enough.


Related Discussions:- Data reduction

Weighted arithmetic mean, Weighted Arithmetic Mean Another aspect...

Weighted Arithmetic Mean Another aspect to be considered is the importance we assign to each observation. The arithmetic mean as we calculated it so far gives equal

Correlation - cause and effect, Cause and Effect Even a highly signifi...

Cause and Effect Even a highly significant correlation does not necessarily mean that a cause and effect relationship exists between the two variables. Thus, correlation does

Compare the t interval with the bootstrap interval, Jocko's Garage has been...

Jocko's Garage has been accused of insurance fraud. Data on estimates made by Jocko and another garage were obtained for 10 damaged vehicles (available in 'jockogarage.txt'). Here

Three types of food question?, #There were three types of food, and the res...

#There were three types of food, and the researcher recorded which foods were bought. Peanut Butter Banana Hamburger 15

Write down the payoff matrix, Two individuals, player 1 and player 2, are  ...

Two individuals, player 1 and player 2, are  competing in an auction to obtain a valuable object. Each player bids in a sealed envelope, without knowing the bid of the other player

Testing of hypothesis, Testing of Hypothesis One objective of sampling...

Testing of Hypothesis One objective of sampling theory is Hypothesis Testing. Hypothesis testing begins by making an assumption about the population parameter. Then we gather

Descriptive statistics for every stock, Simple Linear Regression One ca...

Simple Linear Regression One calculate of the risk or volatility of an individual stock is the standard deviation of the total return (capital appreciation plus dividends) over

Circul;atory ststistics Lab, What statistics can be obtained from a circula...

What statistics can be obtained from a circulatory lab?

Modified distribution mathod, a b c d e supply p 3 4 6 8 8 20 q 2 6 0 5 8...

a b c d e supply p 3 4 6 8 8 20 q 2 6 0 5 8 30 r 7 11 20 40 3 15 s 1 0 9 14 6 13 d 15 3 12 10 20

Simple linear regression, Simple Linear Regression   While correlati...

Simple Linear Regression   While correlation analysis determines the degree to which the variables are related, regression analysis develops the relationship between the var

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd