Build up a two-way anova analysis

Assignment Help Advanced Statistics
Reference no: EM1383296

Problem 1

1. There is a formula for sample size n with given margin of error m and con dence level C for population proportion.

n = 1/4 (z*/m)2

What assumption is made for 1/4 in the formula above? Please give a 1-2 sentences brief explanation to your choice.

(a). Random guess

(b). Assume sample proportion in the future is 1/2

(c). m is half the length of con dence interval

2. Which one is NOT a linear regression models? Please give a 1-2 sentences brief explanation to your choice.

(a). yi = 0 + exp ( β1xi) +  ∈i, i = 1, 2,........, n
(b). yi = β0 + β1xi+ β2xi2 + ∈i, i = 1, 2,........, n
(c). yi = β0 exp(xi) + β2xi7 + ∈i, i = 1, 2,........, n

3. Suppose X and Y has linear correlation coecient r = 0:5, and there are 77 observations, what is the test statistic for the hypothesis test

H01= 0  vs Ha1≠ 0

where 1 comes from the simple linear regression model below? Please give a 1-2 sentences brief explanation to your choice.

Y = β0 + β1+ ∈

(a). Not enough information
(b). 5
(c). 0.25

97_Build up a two-way ANOVA analysis.png

4. What can you see from the graph above? Please give a 1-2 sentences brief explanation to your choice.

(a). None-constant variance
(b). Dependent error terms
(c). None-linearity

5. Which model is more possible to have smaller R2? Please give a 1-2 sentences brief explanation to your choice.

2149_Build up a two-way ANOVA analysis1.png

where Y and X1 in model A and B are the same.
(a). Not enough information
(b). Model A
(c). Model B

6. Suppose we have designed an experiment to compare the 5 di erent treatment on the weight gain of rat's liver, and i is the population mean of the weight gain of rats under treatment i. Which one below is not a contrast? Please give a 1-2 sentences brief explanation to your choice.

200_Build up a two-way ANOVA analysis2.png

 

Problem 2

Suppose we have a partial R output below. Please answer the questions and show your steps.

1894_Build up a two-way ANOVA analysis3.png

(a). What is the SSE of the model?
(b). What is the linear correlation coecient between x and y?
(c). Suppose we know  y is 1.44, what is  x?
(d). Find the P-value for H0 : 1 = 0:5 vs: Ha : 1 < 0:5.
(e). What is the 95% CI for the ^ y at x = 3?
(f). Basing on the output, can you give a reasonable guess of the probability that the y at x = 2 is larger than 0:2.

Problem 3

For example, if Y is annual income ($1000/year), X1 is educational level (number of years of schooling), X2 is number of years of work experience, and X3 is gender (X3 = 0 is male, X3 = 1 is female), then after a linear regression of the data collected and assume the estimated parameters are the true values, then we have

Y = 15 + 0:8X1 + 0:5X2 - 3X3 + ε
(a). What is the average di erence of annual income between women and men if their other conditions are the same?

(b). What is the average annual income of a female with 10 years of eduction and 10 years of working experience? What about a male of the same condition?

(c). What is the probability that a female with 16 years education and no work experience will earn more than $30,000/year?

(d). Suppose a female has 15 years of education, and she has 4 years of working experience, how many more years of working experience will make her expected annual income is no less than $28,000?

Problem 4

A sheries biologist is interested in determining a set of optimal conditions for growing hatchery trout. The two factors that are most easily controlled at the hatchery are water temperature (A) and fungicide (B). The biologist designs an experiment consisting of 2 di erent water temperatures, 3 di erent levels of fungicide, and 5 observations on each of the temperature-fungicide combinations. The following summary data resulted on the response variable y= weight of a hatchery trout.

2050_Build up a two-way ANOVA analysis4.png

(a). Filling in the missing entries above.
(b). Test for the null hypothesis of no interaction between water temperature and level of fungicide. If appropriate, perform tests of the main e ects for the two factors: Water Temperature and Fungicide as well. Use signi cance level = 5%.
(c). What conclusion you will make basing on the analysis on part (b).

Problem 5

Rats were given one of four di erent diets at random, and the response measure was liver weight as a percentage of body weight. There are 5 observations for Treatment 1, 6 for Treatment 2, 6 for Treatment 3 and 8 for Treatment 4. Below are the some R output of the analysis:

1874_Build up a two-way ANOVA analysis5.png

Problem 6

Using the data in `P6 data.txt' with a single response Y and three predictors X1, X2 and X3.

(a). Perform the backward and forward variable selection procedure using AIC as the criterion for this data set. Please report your nal models rst, then show your R commands and related outputs.

(b). In this speci c case, which variable selection procedure (backward or forward) is better? Why?

(c). Find the AIC, BIC and Cp for the model lm(Y  X1 + X3).

Problem 7

Pine oleoresin is obtained by tapping the trunks of pine trees. Tapping is done by cutting a hole in the bark and collecting the resin that oozes out. This experiment compares four shapes for the holes and the ecacy of acid treating the holes. Twenty- four pine trees are selected at random from a plantation, and the 24 trees are assigned at random to the eight combinations of whole shape (circular, diagonal slash, check, rectangular) and acid treatment (yes or no). The response (y) is total grams of resin collected from the hole. Data is available in `resin.csv'.

(a). Summarize the data graphically (interaction plots and pro le plots) and tell what you can see from the graphs.

(b). Build up a two-way ANOVA analysis containing all the main e ects and the interaction e ects. Are all the e ects are signi cant?

(c). If some e ects are not signi cant, what does that mean? Please show how to understand the p-value in the rst row of your ANOVA table. I

know you can nd the general answer for it somewhere, and it is not good enough since I need you to specify your understanding on this speci c case (the speci c test, speci c sample size, speci c population and so on).

(d). Please check with all the assumptions for this two-way ANOVA analysis(model in part (b)) graphically and give speci c comments to each of the assumptions.

(e). Do you think a transformation of the data is necessary? If yes, how? If no, why?

(f). Basing on the model suggested from part (e), please simplify the model by dropping the insigni cant terms (you need to check with the assumptions whenever you have a new/di erent model).

(g). Please draw conclusions of your two-way ANOVA analysis.

(h). Now, you have all the analysis of this problem. Please compile a brief statistic report to show the whole idea. You can follow the format of the example report. The report should be no more than 3 pages (including all the essential graphs).

Reference no: EM1383296

Questions Cloud

Find the approximate probability : Do the data demonstrate that the true mean photometer reading for the color density is significantly different from 12 at 3 percent level of significance?
Required to dewater a construction site : It is required to dewater a construction site 80 m by 80 m. The bottom of the construction will be 1.5 m below the initial water surface elevation of 90 m.
Create and implement a lexical analyzer for c : Create and implement a lexical analyzer for C-- as follows: Write the set of token types to be returned by lexical analyzer. Explain regular expressions for this set of token types.
Accumulation of acetyaldehyde : It is the accumulation of acetyaldehyde that results in flushed face, dizziness, and light headedness responses that are associated with ethanol intoxication.
Build up a two-way anova analysis : Formula for sample size n with given margin of error m and con dence level C for population proportion and please simplify the model by dropping the insigni cant terms
Find the genotypes of parents in the cross : Assume you are studying the inheritance of flower color and seed color in navy beans and the yellow flower color is dominant over white and white seed color is dominant over beige.
Four-lane highway for adequate ssd : A horizontal curve was designed for a four-lane highway for adequate SSD. Lane widths are 12 feet, and the superelevation is 0.06 and was set assuming maximum fx.
Distinguish web pages or web servers use for task : Suppose the role of the IT consultant to new nonprofit organization, Free Flu, to provides flu shots to the elderly. The organization requires the domain name. Distinguish between any Web pages or Web servers you would use for task.
Problem on coefficient of rolling resistance : What is the minimum distance from the stalled car at which the driver could apply the brakes and still stop before hitting it? Assume coefficient of rolling resistance of 0.013.

Reviews

Write a Review

Advanced Statistics Questions & Answers

  Create a scatter plot - correlation and regression

What would you predict the final exam points to be for a student with a GPA of 3.35 and an IQ of 120 and what is the relationship between a student's short-term IQ score, previous GPA, and final exam points?

  Plan the analysis

Plan the analysis

  Standard deviation of complaints received per week

What is the probability that a randomly chosen package contains between 47 and 52 clips (inclusive) per package and What is the standard deviation of complaints received per week?

  Quality control analysis

Determining the root causes

  Do a one-way anova to test the hypothesis

Do a one-way ANOVA to test the hypothesis that the population means are equal and what are the hypotheses to be tested here? What is the value of F ?

  Logistic regression model

Compute the log-odds ratio for each group in Logistic regression model.

  Modelise as a markov chain

modelise as a markov chain

  Problems on advanced computer networks

Identify and explain the events that can change the state of the system also determine the percent of time that this storage space will be adequate to accommodate newly arrived jobs-CS524 Advanced Computer Networks

  Probability and statistics

The tubes produced by a machine are defective. If six tubes are inspected at random , determine the probability that.

  What is the estimated multiple linear regression equation

What is the estimated multiple linear regression equation - calculate the F statistic and should the Null hypothesis be rejected at the 5% level of significance?

  Relationship between speed, flow and geometry

Write a project proposal on relationship between speed, flow and geometry on single carriageway roads.

  Quantitative analysis

State the hypotheses that you are going to test.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd