Calculate the mean of each variable for treatment

Assignment Help Econometrics
Reference no: EM131967969

Economics of Global Poverty Problem Set -

After your report from Problem Set 2, the Minister of Agriculture from Nicaragua was convinced that the data you previously had access to were insufficient to generate an unbiased estimate of the causal impact of the Rural Business Development (RBD) Program. Recall that the RBD provided training and credit to farmers in order to raise farm productivity and household income. The government has extended your contract for one more week in order to generate a new and improved estimate of the ATT before deciding whether to scale up the RBD.

To assist you, the Minister sent out a team to re-survey the same 1,684 households for which you had data on income and program participation in Problem Set 2. The Minister instructed the team to ask each of the households about the 2014 values of 5 key characteristics that the Minister felt (based on your previous report!) might have influenced their decision to participate in the program. Recall that the program was implemented in 2015, so the values of these 5 additional variables are pre-program values. This data set is available on the course website under the name "ps3_Nicaragua.dta". The first two variables -- treat and income -- are the same variables you used in Problem Set 2. The full data set contains these two variables plus 5 additional pre-program variables as follows:

Variable Name - Description

treat - 1 = HH participated in RBD program in 2015; 0 = HH did NOT participate in RBD in 2015

income - 2015 per capita income from main program activity ($ US)

job - 1 = HH main activity was cattle in 2014; 2 = HH main activity was grain in 2014; 3 = HH main activity was yuca (cassava root) in 2015

age - Age of head of household in 2014

education - Number of years of education obtained by head of household in 2014

capital - Total value of mobile capital (tools, tractors, equipment, etc.) used on the farm in 2014

land - Farm size (in manzanas; 1 manzana = 1.7 acres) in 2014

1. In the last problem set, we tried to estimate the ATT of the RBD program using a simple bivariate regression. However, we know that omitting variables can bias our estimates. Let's start by exploring this potential bias in more detail. Specifically, let's look at the implication of omitting the household's farm size (the variable "land" in your data set).

a. Why might we be concerned about omitting land? In your answer, please refer to the two conditions we discussed in lecture that need to hold in order for an omitted variable to be a problem (i.e., lead to Omitted Variable Bias).

b. Let's begin by reminding ourselves of the results from the bivariate regression: INCOMEi = α + β1 * TREATi + εi

What is β^1, your regression coefficient from this bivariate regression? How do we interpret this coefficient?

c. Now estimate the "long" regression with both TREAT and the households' pre-program farm-size (land) on the right hand side: INCOMEi = α + β1L * TREATi + β2L * LANDi + εiL

How do you interpret β^1and β^2L? Discuss both economic and statistical significance.

d. How strongly is baseline farm size related to whether farmers participate in to the program? To answer this question, estimate the following regression: LANDi = π0 + π1 * TREATi + ∈i

How do you interpret π^1?

e. Use your results from parts (c) and (d) along with the OVB formula from class to calculate the magnitude of the omitted variable bias.

f. Interpret (in a short paragraph) the OVB. Should we be concerned if we fail to control for farm size? Why or why not?

2. Farm size (land) is not the only potential omitted variable. Discuss, using economic theory, whether or not you would be concerned if we fail to control for each of the other 4 baseline variables included in the data set. In your answer, make sure you refer to the two conditions required for an omitted variable to cause bias in our estimate of program impact.

3. A common way to examine whether the treatment group differs systematically from the control group is to construct a balance table.

a. Calculate the mean of each variable for both treatment and control groups (Note that job is a categorical variable with 3 possible values, so calculate the proportion of farmers with each value - i.e., in each of the three primary activities -- for both groups.)

b. Conduct a t-test for the difference in means between the two groups for each variable. Are any of the differences statistically significant?

c. Should any of these variables be included in a regression? Discuss why or why not, with reference to your findings in the table.

d. Looking back to question 2, did you find what you expected for the systematic differences across groups?

4. Let us now examine how controlling for these covariates affects our estimates of the treatment effect. In a single table (using the "outreg" commands you learned in section), report the parameter estimates from two models: a bivariate regression including only the treatment variable on the RHS and a multivariate regression including all the baseline controls in addition to the treatment variable.

a. Assuming that we have controlled for all the relevant variables, what is your estimate of the ATT using multivariate regression? Is it economically significant? Is it statistically significant?

b. Compare your estimates of the ATT from the two regressions. How important was controlling for these variables?

5. Rather than controlling for covariates in multivariate regression, we can also match observations between treatment groups based on their covariates.

a. Using a probit model, calculate the probability that each individual participates in the RBD program. Which variables seem important in predicting participation? Do the signs of the coefficients make sense? (Make sure you save the predicted propensity scores from this estimation!)

b. On a single graph, plot the kernel densities of the propensity scores for both the treatment and control groups. Do we have a common support? Do you have any concerns?

c. Using Stata's command teffects psmatch, estimate the ATT using propensity score matching (be sure to specify a probit model and the option atet to get ATT estimates). Compare your estimate of the ATT to that obtained with multiple regression.

6. Interpreting the estimates from both multiple regression and propensity score matching as the causal impact of the RBD program requires the same key assumption.

a. What is the name of this assumption? What does it mean, "in English"?

b. How confident are you that this assumption holds with this augmented data set provided to you by the Minister of Agriculture? How confident are you that you that this new data set has allowed you to generate an unbiased estimate of the ATT?

Attachment:- Assignment Files.rar

Reference no: EM131967969

Questions Cloud

Deposit every year to meet her retirement goal : Given this, how much must she should deposit every year to meet her retirement goal?
What are the amount and nature of recognized gain or loss : For each transaction, what are the amount and nature of recognized gain/loss? What is Larry's 2016 AGI?
Illustrate the price the firm will charge : Makes sure to add in all costs curves, marginal revenue and demand to illustrate the price the firm will charge.
If you could vary the percentages what would be your mix : During the last presidential election, one of the canidates mentioned flat taxes for wages, capital gains and consumption (sales/VAT).
Calculate the mean of each variable for treatment : Economics of Global Poverty Problem Set - Calculate the mean of each variable for both treatment and control groups
Lower bound for the price of a 40-call : Assume that the price of a 50-put on a single share of stock X has been determined to be $2, that stock X share price today is $45, and that the time premium
Compute the firms acid test ratio : There are currently 100,000 common stock shares outstanding and the firm pays a $2.20 dividend per share.
What is present value of the asset discounted cash flows : what is the Present Value of the Asset's Discounted Cash Flows?
Define and calculate the unemployment rate : Define and calculate the unemployment rate (show working). What is the labour force participation rate. - Maximum number of words 40 plus calculations

Reviews

len1967969

5/3/2018 1:37:48 AM

Subject: Econ of global poverty, statistics, STATA, Economics. Detailed Question: Hi, I'd really appreciate it if someone could help me out with this assignment. I've attached the problem set and the data set for stata. After your report from Problem Set 2, the Minister of Agriculture from Nicaragua was convinced that the data you previously had access to were insufficient to generate an unbiased estimate of the causal impact of the Rural Business Development (RBD) Program. Recall that the RBD provided training and credit to farmers in order to raise farm productivity and household income. The government has extended your contract for one more week in order to generate a new and improved estimate of the ATT before deciding whether to scale up the RBD. Please format all answers nicely in a word processing program. Attach your do-files (including the names of all who participated in writing the do-file) to the end of your document.

Write a Review

Econometrics Questions & Answers

  Design a simple econometric research project

Design a simple econometric research project

  Multiplicative decomposition method

Multiplicative decomposition method

  Market for cigarettes

The Australian government administers two programs that affect the market for cigarettes.

  Solve the forecast model

Solve the forecast model

  What are the marginal abatement cost functions

What are the marginal abatement cost functions for each of the two areas? Calculate the loss in the two areas due to over-control (for the rural area) and under-control (for the urban area).

  Write the t statistic for testing the null hypothesis

Explain why this model violates the assumption of no perfect collinearity.  Write the t statistic for testing the null hypothesis

  What is economics system

What is economics system? What are the types of economics system? Briefly explain each type of economics system by giving examples of nations that are close to each type

  Multiple choice questions related to market concentration

Determine when a competitively produced product generates negative externalities in production, the industry will,

  Calculating number of units produced by firm

Assume a company has the following production function: Q = 100 K.5 L1 . Currently, the company hires 1,000 workers and employs 100 units of capital.

  Question about mobile commerce

M-commerce also known as mobile commerce is being lumped in with several strategic internet plans. Explain some of the industries that are likely to use mobile commerce and how it is working for them.

  Calculating the average days past due and average flow time

Auto Data manufactures custom engineering testing machine. The following 5-orders are currently in the design department:

  Mechanism of an english auction and second price auction

Briefly discuss the difference between mechanism of an oral or English auction and a Vickrey or second price auction.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd