A multivariate analysis

Assignment Help Mathematics
Reference no: EM131113006

1. A multivariate analysis has been undertaken on 22 non-metric (binary observation) variables recorded from 2026 skulls belonging to skeletons recovered from two burial sites of different historic periods; 18th cen- tury Spitalfields, London, and 4th century Poundbury, Dorset. The main aim of the analysis was to investigate differences between the skulls of the two sites on the basis of the recorded data. Using infor- mation collected on each skeleton, investigators were able to classify each skull as that of a female, juvenile or male, so that six subgroups were easily identified within the data. It was not possible in all cases to determine the sex of the juvenile skeletons so these skulls are included as a single group at each site. Inevitably, due to the conditions of the skulls, it was not possible to observe the presence or absence of all 22 features on each skull, so that the data set contains a lot of missing values, and one of the impor- tant aspects of the analysis undertaken was to take account of different methods for adjusting for this. The first analysis presented below con- siders the matrix of (squared) pythagorean distances calculated using all of the 22 variables following an angular transformation, and making adjustments for bias based on the number of observations. Three bias adjustments were made resulting in three distance matrices, D1, D2 and D3, the adjustments accounting for the total group size, average number of observations for each variable (within each group) or mini- mum number of observations, respectively. Table 1 shows the matrix of squared pythagorean distances DZ between the six groups (labelled PF, Pi' and PM for the female, juvenile and male groups from Poundbury, and SF, SJ and SM for the corresponding Spitalfields groups), which has been bias-adjusted using the average number of observations recorded for each variable. Table 1. Matrix of Squared Pythagoras Distances,

 

840_1.png

(a) Dixuss !Indy the aims clucceJ and ordinal Ealing geeetelly. making elate thy difference. tnutn the tan approached. 

(tt) thttetie why Eddins the ethic= sea tree is theful ha de-teerniniug whither there see ear .thuietloia w of ettnet • low. taco& repethenntion of richielinsensional &thorns 

(t) Whet Jo Tivoli] sod :at ate.th the relmacishaps between the ma arid age Leonia of each of the Sythdleldi and Poundbory Uwe, Are ear <12-Terraces apps to itelletelt let ten pints? In each cath. does the supenzapoio; =ere= e70=31t tele inetiCato thy peahens with the reprocsitthota? 

378_1.png

 

584_1.png

 

2136_2.png

 

(e) Explain why a classical scaling of the RLa values is an appropri­ate means of comparing the representations of the 'skull' groups. Comment on what Figure 3 tells us about the differences between configurations based on the differing bias adjustments adopted in each distance measure.

The analysis was repeated using canonical variates, using just five of the original (presence/absence) variables, which were chosen to maximize the number of complete measurements across all six population groups. The number of measurements used in the analysis are recorded in Table 3 below.Table 3. Complete measurements for each group on five variables

 

 

Female

Juve.

Male

Spitalfields 
Poundbriry

112 
274

27 
34

115 
272

The canonical variates analysis produced the following eigenvalues of W-113:

Al = 0.6693, A2 = 0.2240, A3 = 0.0089, A4 = 0.0038, A5 = 0.0005.

The group mean vectors have been plotted with respect to the first two canonical variates in Figure 4, with 95% confidence circles for the true means, and Figure 5 adds the minimum spanning tree. Additionally, Table 4 below shows the (squared) mahalanobis distances calculated between each group.

Table 4. Matrix of Squared Mahalanobis Distances

PF

0.000

 

 

 

 

 

PJ

0.264

0.000

 

 

 

 

PM

2.083

1.769

0.000

 

 

 

SF

1.012

0.837

3.560

0.000

 

 

SJ

1.549

1.624

5.110

0.243

0.000

 

SM

4.368

3.285

1.290

3.911

5.449

0.000

 

PF

PJ

PM

SF

SJ

SM

 (f) Comment on the preceding analysis in as much detail as possi­ble. What do the canonical variates represent, and how does this analysis compare with that presented earlier based on a classical scaling of the distance matrix D2?

1715_2.png

 

271_2.png

2. (a) (i) Let x be apx 1 random vector with mean vector ps and co­variance matrix E. In the context of Factor Analysis, define carefully what it means to say that the k factor model holds for x and show what structure this imposes on the covariance matrix E.

(u) Suppose a random sample is taken from a multivariate distri­bution in which the k factor model is assumed to hold. The maximum likelihood estimates of the factor loadings matrix A and the diagonal matrix of communalities * minimize the function F given by

F(A, If) = trace(E-1S) +            -1121Si - P

where S denotes the sample covariance matrix and E the pop­ulation covariance matrix as a function of A and W. Explain carefully how this function can be interpreted. Describe in de­tail how it can be used as the basis of an asymptotic goodness­of-fit test for the assumed k factor model.

 

(b) The data for this question concern variables thought to be related to electricity consumption in 48 states in the US. Descriptions of the 15 variables are given in Table 1 below, and the sample correlation matrix It of the variables is given in Table 2.

Table 3

X1 log(price of gas)

X2 log(degree days heating)

X3 log(degree days cooling)

X4 log(percent homes with separate food freezers)

X5 log(percent homes with air conditioning)

X6 log(number of rooms per house)

X7 log(percent single-family dwellings)

X8 log(percent with electric water heaters)

X9 log(percent with electric heat)

X10 log(percent with electric cooking)

X11 log(price of electricity)

X12 log(population density)

X13 log(disposable per capita income)

X14 log(percent rural households)

X15 log(kilowatt hours per customer)

Maximum likelihood factor analysis was carried out on the cor­relation matrix, and six factors were extracted. The estimated raw factor loadings are given as Table 3 and following a varimax rotation in Table 4. ... continued


 

1069_2.png

885_2.png

1362_2.png

How can you interpret the calculated row and column sum of squares totals?

(ii)    The factor axes defining the 6-dimensional factor space have been rotated (by varimax) so as to aid factor interpretation. Propose some desirable properties of factor loadings that might make interpretation easier, and that the varimax criterion at­tempts to produce.

 

(iii) Comment briefly on how the original correlation matrix and as fully as possible on the analysis. How would you interpret the 6 factors? What further comments on the variables, the factors or the analysis might you make?


Attachment:- 20111118100211413.rar


Attachment:- problems.zip

Reference no: EM131113006

Questions Cloud

Reflect upon the leadership theories and styles : Write a journal entry that describes each value and explains how it will support and guide your leadership - Conclude by explaining the type of leader you will be when you put these values into action.
Determine the tension in the cable : the reactions at A and B. Assume that the hinge at B does not exert any axial thrust.
Is wilson guilty of committing a crime : Jones then handed the bond to Wilson who redeemed it at the corporation's home office and received $1,000; that Wilson gave Jones $100 of the proceeds.Is Wilson guilty of committing a crime?If so what was the crime?Is Jones guilty of committing a ..
Prepare the required journal entries for restin corporation : Restin Co. uses the gross method to record sales made on credit. On June 1, 2010, it made sales of $50,000 with terms 3/15, n/45. On June 12, 2010, Restin received full payment for the June 1 sale.
A multivariate analysis : 1. A multivariate analysis has been undertaken on 22 non-metric (binary observation) variables recorded from 2026 skulls belonging to skeletons recovered from two burial sites of different historic periods; 18th cen- tury Spitalfields, London, and 4t..
Determine the work and heat transfer for each of the process : A piston-cylinder assembly contains 2 lb of water, initially at 300°F. The water undergoes two processes in series: constant-volume heating followed by a constant-pressure process. At the end of the constant-volume process, the pressure is 100 lbf..
Increase or decrease in total revenue : To increase revenue and profit, a firm is considering a 4% increase and an 11% increase in advertising. If the price elasticity of demand is -1.5 and the advertising elasticity of demand is +0.6, would you expect an increase or decrease in total r..
Point of diminishing marginal returns : Obviously there is only so much that can be accomplished in an existing establishment before the owner reached the point of diminishing marginal returns, or where their output reaches a point where additional tables may cause crowding and profits ..
The high cost of health care : This research paper needs six outside sources of information - create an outline of the content of your paper, to demonstrate that your main ideas and research are logically organized.

Reviews

Write a Review

Mathematics Questions & Answers

  What is the probability with a standard deviation

The mean of the sales is 141.1 with a standard deviation of 13.62. What is the probability that he will not be able to sell 160 or more widgets in the next year?

  What is the proportion of people who smoke

The following data comes from the 2011 Behavioral Risk Factor Surveillance System (BRFSS) survey, which is run by the Centers for Disease Control (CDC). This is a joint probability table for the proportions of survey respondents who smoke and who hav..

  Subspaces of a finite dimensional vector space

Let U, V ⊂ W be subspaces of a finite dimensional vector space W . Define U + V = {w ∈ W | w = u + v, u ∈ U, v ∈ V }. (a) Prove that dim(U + V ) ≤ dim U + dim V .

  Find the dot product

Find the dot product

  What is meant by a function

Explain what is meant by a function. Give two real-world examples of relationships that are functions and two real-world examples that are not functions.

  Find the volume of a frustum of a pyramid with square

Find the volume of a frustum of a pyramid with square base of side 30, square top of side 15 and height 10.

  Depict some typical supply curve

some appropriate equilibrium price. Label the horizontal axis as x and the vertical as p. Shade in the area which represents the producer's surplus.

  One truck may deliver to more than one grocery formulate

four trucks are available to deliver milk to five groceries. the capacity and daily operating cost of each truck are

  Find the value of the constant

Find the value of the constant, Constant, in the equation below. Round your answer to one decimal place.

  At what rate is the volume decreasing at this instant

Suppose that at a certain instant the volume is 550 cubic cm, the pressure is 80 kPa, and the pressure is increasing at a rate of 100 kPa per min. At what rate is the volume decreasing at this instant?

  Find the equation of the line tangent to the curve

Find the equation of the line tangent to the curve of y = x2 ln x at the point (1,0).

  Calculate the required rate of return

XYZ company paid a dividend of $4.37 in the past 12 months, and it has a growth rate of 6.5%. The stock currently sells for $175. Calculate the required rate of return. Do not use a financial calculater or an online calculator. You must show your ..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd