Describe omitted variable bias problem in linear regression

Assignment Help Basic Statistics
Reference no: EM131208842

Q1. Explain carefully the difference between an unbiased estimator and a consistent estimator. Does one imply the other? If so, show mathematically, and if not provide examples.

Q2. In this question, we will use a data set on housing prices from Albuquerque, New Mexico, collected in 1993. The data set itself is called final_housingdata.dta and a description of the variables is in final_housingdata_description.txt. For the following questions, fit a linear regression with PRICE as the outcome variable, and SQFT, AGE, FEATS, NE, CUST, COR and TAX as covariates.

(a) Test whether the coefficient of FEATS is statistically significant at the 5% significance level. State the test statistic used and its null distribution.

(b) Interpret the economic (or practical) significance of SQFT on PRICE as elasticity and in terms of standard deviations.

(c) Describe the assumptions under which the test used in part (a) is valid.

(d) What expected value of housing price would you predict for a house with 3000 ft of living space, is 28 years old, has 4 features, is not located in the northeast, is a custom home, is not on the corner, and has annual taxes of $2000? Provide a point estimate and a standard error.

(e) Explain the difference between homoskedasticity and heteroskedasticity. What is the difference in terms of the assumptions of the regression model between the two cases? Give an example of why heteroskedasticity is a more reasonable assumption in this particular setting.

(f) Undr what assumptions are the OLS estimates consistent? Unbased?

Q3. Describe the omitted variable bias problem in linear regression. How does an instrumental variable addresses this problem? Explain the assumptions under which an instrumental variable is valid. What is the interpretation of the coefficient on the covariate of interest (the endogenous one) in an instrumental variable regression?

Q4. The current framework of the U.S. unemployment insurance (UI) system provides short-term monetary assistance to the involuntarily unemployed. A frequent criticism of the system has been that the unemployment insurance benefit acts as a disincentive for job-seekers and prolongs the duration of unemployment spells. During the 1980's, several controlled experiments tested alternative compensation schemes for UI. In these experiments, UI claimants were offered a cash bonus if they found a job within some specified period of time and if the job was retained for some specified duration. The question of interest was: would the promise of a monetary lump-sum benefit provide a significant inducement for more intensive job-seeking?

We consider data from the Pennsylvania reemployment bonus experiment, which was conducted in Pennsylvania from July 1988 to October 1989. The data is in f inal_pabonusdata.csv on Canvas. The variables are:

-row. names: id for person;

-duration: duration in weeks of the person's unemployment's spell;

-treatment: dummy variable for whether the person was randomly assigned to be in treatment group that is eligible to receive bonus (if treatment,=0, the person was assigned to be in the control group that just receives usual UI without any chance to receive a bonus);

-female: dummy variable for female;

-black: dummy variable for black;

-hispanic: dummy variable for Hispanic;

-ndependents: number of dependents, with 2 indicator two or more dependents;

-recall: dummy variable for whether person "expected to be recalled;"

-young: dummy variable for whether person was less than 35;

-old: dummy variable for whether person was greater than 54;

-durable: dummy variable for whether person was empoloyed in the durable goods industry;

-lusd: dummy variable for whether person lives in a low employment district (Coatesville Reading or Lancaster).               ,

We use the Weibull model. Let Xi denote the covariates for subject i and Yi the duration of the unemployment spell for subject i.

463_Figure.png

where γ > 0.

(a) Write down the log likelihood function.

(b) Use the fact that the Weibull model has the property that E(logYi|Xi) = α + β'Xi to estimate α and β by least squares.

(c) The method of maximum likelihood is more efficient than least squares (in the normal linear regression, the maximum likelihood estimates are the same as the least squares estimates but this is not the case for the Weibull model). Estimate the parameters of the model by maximum likelihood. For one set of starting values, use the estimates in part (b) along with γ = 1. Try several other starting values. (For full credit, you will have to solve a constrained optimization problem since γ > 0. For example, to constrain γ > 0 in the optim function in R, use the options method="L-BFGS-B", lower=0, upper=Inf. See the help file for the optim function. In Matlab, you can use the fmincon function. Alternatively, you can set γ = 1.08 and get partial credit.)

(d) Find the standard errors for the maximum likelihood estimates. For which covariates would we reject the null hypothesis that the covariate has no effect on duration holding the other covariates fixed for significance level 0.05.

(e) Test whether the parameters for the old and young covariates are both equal to zero using the likelihood ratio test. (Like in part (c), you can either do a constrained optimization to get full credit, or set 7 = 1.08 and you will get partial credit.)

(f) The hazard rate at a duration y is the instantaneous rate of a duration ending given that it has lasted up until time y. For the Weibull model, the hazard rate is

h(y|x) = (γ/exp(α+β'x))(y/exp(α + β'x))γ-1

Show that for the Weibull model, if covariate j is increased by one unit, the hazard rate for each y is multiplied by exp (βjγ). (The fact that the hazard rate for each y is multiplied by the same factor is called the proportional hazards property.)

(g) Use the bootstrap to find an approximate 95% confidence interval for how much treatment multiplies the hazard rate compared to control when the other covariates are held fixed. (Again, you can set γ = 1.08 to get partial credit.)

(h) What do you conclude about the effect of the UI bonus treatment from the analyses you have done in (a)-(g)?

Q5. Suppose that X has the following density

1940_Figure1.png

The table below shows data on a random sample X1, ..., X20 from this distribution and some summary statistics.

i

X

ln(X)

1

0.68

-0.39

2

0.16

-1.83

3

0.11

-2.25

4

0.26

-1.34

5

0.63

-0.47

6

0.59

-2.53

7

0.95

-0.05

8

0.88

-0.13

9

0.90

-0.10

10

0.86

-.015

11

0.64

-0.44

12

0.95

-0.05

13

0.27

-1.30

14

0.68

-0.39

15

0.77

-0.26

16

0.31

-1.17

17

0.79

-0.24

18

0.57

-0.57

19

0.99

-0.01

20

0.74

-0.29

Sum

12.73

-11.98

Average

0.64

-0.60

(a) Show that E(X) = (θ/1+θ). Write down the moment condition based on this equation and find an estimator for 8 using the (generalized) method of moments. Compute the estimate with the given sample.

(b) For a given θ, the random variable U1/θ where U ~ Uniform (0, 1), has the same distribution as the random variable X (this is true and you do not need to prove it). Using this procedure to generate random draws from X, compute a 95% confidence interval using bootstrapping for the method of moments estimator derived in part (a).

(c) Find the maximum likelihood estimator (MLE) of θ (verify that your estimator is indeed the maximizer of the likelihood). Calculate the estimate with the given sample.

(d) Calculate the asymptotic standard error for the MLE estimator derived in part (c).

(e) Which of the two estimators above would you prefer and explain why.

Q6. For this question you have to use the data in the ascii file yogurt_2008.txt. The data consists of observations on 437 households making 2582 yogurt purchases. They purchase one of three brands. The five variables in the data set are, (i) the household id, running from 1 to 437, (ii) the choice, running from 1 to 3, (iii) the price, in Cents, of the yogurt brand 1, (iv) the price, in cents, of the yogurt brand 2, (v) the price, in cents, of the yogurt brand 3. Let j index the choice, running from 1 to 3, t index the purchase, running from 1 to Ti, and i the household, running from 1 to 437. The number of purchases made by each household differs. For example, the first two purchases come from household 1, the next two from household 2, and the next eight from the third household.

We focus on a discrete choice model where the utility for individual i associated with choice j, in purchase t is

Uijt = αj + β· Pijt + εijt,

where Pijt is the price of brand j for household i at purchase time t. We assume the εijt are independent across time, choice and household. Normalize α1 = 0, so that there are three free parameters, α2, α3, and β.

(a) Calculate the mean price for each brand, by the choice made. That is, calculate the average price of brand 1, 2 and 3 for households choosing brand 1, calculate the average price of brand 1, 2 and 3 for households choosing brand 2, and calculate the average price of brand 1, 2 and 3 for households choosing brand 3. Do the patterns make sense? That is, are the prices for brand j lowest among the households choosing brand j?

(b) Suppose we want to estimate the conditional logit model. Because of the independence assumption on the Eijt, we can ignore the fact that some purchases come from the same household, and the likelihood function is

1023_Figure2.png

Find the maximum likelihood estimator (α^2, α^3, β).

(c) Next we explore a random coefficient version of the conditional logit model. Rather than continuous mixtures, we use a simple version with a binary mixture. We will let the coefficient on price vary by household. Thus, we model the latent utility as

Uijt = αj + βi · Pijt + εijt,

where βi ∈ {βL, βH} with Pr = (βi = βL) = π. Show that the likelihood function for this mixture model is

L(α2, α3, βL, βH, π) = i=1N                          

1362_Figure3.png

(d) Plot the log likelihood function, for π = 0.5, at

65_Figure4.png

as a function of c, from c = 0 to c = 0.2. Compare the value of the log likelihood function at the value of c that maximizes it with the value at c = 0. Does it appear that allowing for the heterogeneity in price sensitivity is important?

(e) Explain why the random coefficient model might be preferred to the conditional logit model. Which (undesirable) property of the conditional logit model can we avoid by using the random coefficient model? Why is this property undesirable?

Reference no: EM131208842

Questions Cloud

Estimate the maximum permissible conversion time for the adc : Suppose it is desired to digitize a 250-kHz analog signal to 10 bits using a successive-approximation ADC. Estimate the maximum permissible conversion time for the ADC.
Write the mesh equations for the network of given figure : Write the mesh equations for the network of given figure, and -determine the current through the 10-kΩ resistor.
What is the minimum sampling period that can be used : What is the minimum sampling period that can be used to ensure that the Nyquist criterion is satisfied?
Was the decision effective : Think of the last time you had to make a decision in the workplace. Was the decision effective? In your opinion, what makes a decision effective?
Describe omitted variable bias problem in linear regression : Describe the omitted variable bias problem in linear regression. How does an instrumental variable addresses this problem? Explain the assumptions under which an instrumental variable is valid. What is the interpretation of the coefficient on the ..
Potential real property issues : What are potential real property issues that could arise from a family farm with a business on it. To give an example if Bill and Mary had a farm and on that farm was a store. They also have a son, John, living on the farm with his wife Petie and ..
Find the minimum number of bits required for the adc : The output voltage of an aircraft altimeter is to be sampled using an ADC. The sensor outputs 0 V at 0 m altitude and outputs 10 V at 10,000 m altitude. If the allowable error in sensing (±1 2 LSB) is 10 m, find the minimum number of bits required..
Which is most likely to bring about the facts of a situation : Which of the following examples illustrates the interviewer's growth AND development in the application of attending skills during counseling interviews?
Determine the current through the inductive element : Write the mesh equations for the network of given figure, and - determine the current through the inductive element.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Using the six-step critical value approach at the 005 level

you are the manager of a restaurant that delivers pizza to collect dormitory rooms. you have just changed your

  Summary points to the middle point

Equation of the line passing through the summary points. Calculate distance from line through the outer summary points to the middle point. Equation of the median-median line of best fit for data.

  Determine average life expectancy of female bostonians

Conduct a hypothesis test to determine whether the average life expectancy of female Bostonians is higher than that of male Bostonians. Test at the 10% significance level. Run both a critical value and p-value test

  Explain chi-square test of independence

Describe the chi-square test of independence. How is this test different from the chi-square goodness-of-fit test? Are the assumptions underlying each of these tests the same?

  Determining the number of employees

The state office of Australia Post believed that mail throughput at suburban post offices increases with the number of employees. From a random sample of ten major suburban post offices, the following regression equation was estimated: Y = -11.76 ..

  Question regarding the binomial distribution

It is given that 60% of the employees at a large company are women. A simple random sample of 15 employees is selected from this company.

  Estimating the expectationa measurement follows the normal

estimating the expectationa measurement follows the normal distribution with standard deviation that is equal to 15 and

  Determining single sample test

Think about a real world situation or problem that could be addressed using a single-sample t test. For this situation or problem, clearly identify the independent and dependent variables you would study

  Statistics-five-step hypothesis test

A clinical investigator is interested in understanding the relationship between demographic factors and congestive heart failure, measured by a cardiac index (CI).

  How to select the target parameters

A study reported in "Inside Higher Education News" (May, 2006) found that financial incentives can improve low-income college students' grades and retention.

  Proportion of customers-make a valid warranty claim

If there is one-year warranty on the equipment, what proportion of the customers will actually make a valid warranty claim?

  A group of 5 statistics students receive the following

a group of 5 statistics students receive the following grades on a test 81 74 62 92 and 76. an intervention plan is

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd