Write down the estimated regression model

Assignment Help Applied Statistics
Reference no: EM131020924

Assignment

This assignment consists of two sections: 1) a quiz with fill-in-the-blank questions; and 2) a SPSS data project.

SECTION 1: QUIZ

Regression with a Dummy Independent Variable:

1. Consider data on personal income (PI) of married and unmarried women. Suppose you find that the average PI is $50,000 for married women and $40,000 for unmarried women. Let Y=PI and X=dummy for married (that is, 1 = married, 0 = unmarried)

1) How much more do married women make compared to unmarried women, on average?__________________

2) Write down the estimated regression model Y = a + b*X (all info needed is given):

3) Interpret the intercept term: ____________________________________________

4) Interpret the slope term: _______________________________________________

Multiple Linear Regression:

2. Consider the following model of income with three independent variables from the lecture:
Y = a + b1*X1 + b2*X2 + b3*X3 = - 9,239 - 4,195 * X1 + 141 * X2 + 3,020 * X3

where X1 is a dummy variable, 0=male, 1=female

and X2 is years of work experience

and X3 is years of education

1) How much more do men earn compared to women on average?

2) How much do women with 10 years of work experience and 16 years of education earn on average?

3) How much do men with 10 years of work experience and 12 years of education earn on average?

4) What variables that could (further) mediate the effect of gender on earnings are omitted here?

Two-Way Table: Marginal Distribution and Conditional Distribution

3. Consider the two-way table below based on a four year study about the relationship between anger and heart disease among a random sample of individuals. The subjects (i.e. participants in the study) were free of heart disease at the beginning of the study when they took a test that measured how prone they were to sudden anger. Their heart health was monitored over a four year period and it was recorded whether they developed Coronary Heart Disease (CHD). In short, the study attempts to examine whether anger levels are associated with the likelihood of developing coronary heart disease. Now please answer the questions (a) to (c):

1) In the two-way table below, report the marginal distributions and the total sample size, in counts and percent.

2) In the two-way table below, report the conditional distributions of Coronary Heart Disease in percent. Note that the conditional distribution of Coronary Heart Disease refers to the distribution of Coronary Heart Disease given a certain Anger Level.

3) With reference to your calculations above, discuss whether there is potential association between anger and Coronary Heart Disease.

         #Individuals

Coronary Heart Disease

NO Coronary Heart Disease

 

Low Anger

 

530

 

3,057

 

Moderate Anger

1,100

4,621

 

High Anger

 

270

 

606

 

 

 

 

 

SECTION 2: SPSS PROJECT

1. Regression with One Independent Variable vs. Regression with Multiple Independent Variables

Use the dataset from Assignment#2 (StateData_hw2.sav) to estimate the following two models, and then answer questions 1) to 4):

Model 1: Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers. [You may have done this already in Assignment#2. If so, just repeat the estimation.]

Model 2: Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers (X1) and state median household income (X2).

[Hint: Topic about Multiple Regression was covered in Lecture Note #6.2. For the second model estimation based on two variables X1 and X2, the SPSS procedures are: Analyze Regression Linear select variable as Dependent variable and Independent variable, here you select two variables, X1 and X2, as Independent variables click "OK".]

1) Provide the regression equations for both models and the corresponding values for R2.

2) For the second model, provide interpretations of the constant term and the two slopes.

3) Explain intuitively why the effect of % smoking changed the way it did when the median income was accounted for.
[No loss of points for this question. Just give it a try. I hope to encourage you to think harder about the effect of each independent variable, as well as the interaction of the effects, in the multiple regression model. Formal discussion about such problems may come in 9172.]

4) Use the two models to predict the HDDR (heart disease death rate) for New York State, and then compare the two predicted values to the actual value of HDDR for New York State.

Reference no: EM131020924

Questions Cloud

What is most critical step in the capital budgeting process : What is the most critical step in the capital budgeting process? Why are there no "absolute" answers to capital budgeting decisions?
New heritage doll company case-harvard business review : What additional information does Harris need to complete her analyses and compare the two projects? What specific questions should she ask each of the project sponsors?
When evaluating the financial statements of a given firm : It is often said that anyone with a pencil can calculate financial ratios, but it takes a brain to interpret them. What kinds of things should the analyst keep in mind when evaluating the financial statements of a given firm?
Difficulties of obtaining accurate information : Do you think that this fraction is close to the actual proportion who cheated? Why? (Discuss the difficulties of obtaining accurate information on a question of this type.)
Write down the estimated regression model : Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers (X1) and state median household income (X2).
How long would it take you pay off the balance on new card : How many months will it take to pay off the debt if you only make the $200 minimum payment each month? All is not lost, because you just received an offer to transfer your $10,000 balance from your current credit card to a new credit card charging a ..
Two different bonds currently outstanding : Jallouk Corporation has two different bonds currently outstanding. Bond M has a face value of $20,000 and matures in 20 years. The bond makes no payments for the first 6 years, then pays $1,100 every six months over the subsequent eight years, and fi..
Which people change their behavior after they get insurance : The situation in which people change their behavior after they get insurance (illustrated by the above scenario) because the change benefits them but increases costs to the insurer is called
Has the writer followed all instructions for the assignment : Is the work honest and "relatable?" Does it employ a conversational tone and make use of narrative devices, such as dialogue, scene construction, and characterization? Does it address Lopate's notions of egotism and contrariety?

Reviews

Write a Review

 

Applied Statistics Questions & Answers

  1 find the equation of the regression line for the given

1. find the equation of the regression line for the given data.nbsp what is the predicted value of y when x -2?nbsp

  A manager of a small store wanted to discourage

A manager of a small store wanted to discourage shoplifters by putting signs around the store saying "Shoplifting is a crime!" However, he wanted to make sure this would not result in customers buying less. To test this, he displayed the signs every ..

  Mary''s interest in doing a study to see

1). In Module 4, we considered Mary's interest in doing a study to see if learning of 6th graders on a math lesson is affected by background noise level. There, she was planning to use 2 noise conditions and then analyze her outcomes using a t-test f..

  A candidate for the key pair

The cryptanalyst computes y = Dx' (c0), where D is the decipherment function corresponding to E, for each possible key x', and then checks the table to see if y is in it. If so, (x, x') is a candidate for the key pair. How should the table be ..

  What variables can be used in a pearson correlation table

Create a correlation table for the variables in our data set. Reviewing the data levels from week 1, what variables can be used in a Pearson's Correlation table (which is what Excel produces)

  What are the expected values for each alternative

What are the expected values for each alternative? What decision should be made under expected value? What is the EVPI?

  How variance and the standard deviation measure variation

Define the range, variance, and standard deviation for a population. Discuss how the variance and the standard deviation measure variation.

  Registration of securities by the sec indicates to investors

True or false Registration of securities by the SEC indicates to investors that the risk of those securities is reasonable

  Five balls are distributed between two urns

Five balls are distributed between two urns

  Business research report proposal

Identify a business research topic and define the research questions for the identified problem or opportunity

  Next two days this item is requested more than once

In an inventory analysis it was concluded that, on the average, demands for a certain item were made 2 times per day. What is the probability that on the next 2 days this item is requested more than once?

  A standard deviation of four pounds.

A survey of 50 lobster fishermen on Funafuti (an island in Tuvalu), found that they catch an average of 32 pounds of lobster per day with a standard deviation of four pounds.a) If a fisherman is selected randomly, what is the probability that hi..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd