Write down the estimated regression model

Assignment Help Applied Statistics
Reference no: EM131020924

Assignment

This assignment consists of two sections: 1) a quiz with fill-in-the-blank questions; and 2) a SPSS data project.

SECTION 1: QUIZ

Regression with a Dummy Independent Variable:

1. Consider data on personal income (PI) of married and unmarried women. Suppose you find that the average PI is $50,000 for married women and $40,000 for unmarried women. Let Y=PI and X=dummy for married (that is, 1 = married, 0 = unmarried)

1) How much more do married women make compared to unmarried women, on average?__________________

2) Write down the estimated regression model Y = a + b*X (all info needed is given):

3) Interpret the intercept term: ____________________________________________

4) Interpret the slope term: _______________________________________________

Multiple Linear Regression:

2. Consider the following model of income with three independent variables from the lecture:
Y = a + b1*X1 + b2*X2 + b3*X3 = - 9,239 - 4,195 * X1 + 141 * X2 + 3,020 * X3

where X1 is a dummy variable, 0=male, 1=female

and X2 is years of work experience

and X3 is years of education

1) How much more do men earn compared to women on average?

2) How much do women with 10 years of work experience and 16 years of education earn on average?

3) How much do men with 10 years of work experience and 12 years of education earn on average?

4) What variables that could (further) mediate the effect of gender on earnings are omitted here?

Two-Way Table: Marginal Distribution and Conditional Distribution

3. Consider the two-way table below based on a four year study about the relationship between anger and heart disease among a random sample of individuals. The subjects (i.e. participants in the study) were free of heart disease at the beginning of the study when they took a test that measured how prone they were to sudden anger. Their heart health was monitored over a four year period and it was recorded whether they developed Coronary Heart Disease (CHD). In short, the study attempts to examine whether anger levels are associated with the likelihood of developing coronary heart disease. Now please answer the questions (a) to (c):

1) In the two-way table below, report the marginal distributions and the total sample size, in counts and percent.

2) In the two-way table below, report the conditional distributions of Coronary Heart Disease in percent. Note that the conditional distribution of Coronary Heart Disease refers to the distribution of Coronary Heart Disease given a certain Anger Level.

3) With reference to your calculations above, discuss whether there is potential association between anger and Coronary Heart Disease.

         #Individuals

Coronary Heart Disease

NO Coronary Heart Disease

 

Low Anger

 

530

 

3,057

 

Moderate Anger

1,100

4,621

 

High Anger

 

270

 

606

 

 

 

 

 

SECTION 2: SPSS PROJECT

1. Regression with One Independent Variable vs. Regression with Multiple Independent Variables

Use the dataset from Assignment#2 (StateData_hw2.sav) to estimate the following two models, and then answer questions 1) to 4):

Model 1: Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers. [You may have done this already in Assignment#2. If so, just repeat the estimation.]

Model 2: Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers (X1) and state median household income (X2).

[Hint: Topic about Multiple Regression was covered in Lecture Note #6.2. For the second model estimation based on two variables X1 and X2, the SPSS procedures are: Analyze Regression Linear select variable as Dependent variable and Independent variable, here you select two variables, X1 and X2, as Independent variables click "OK".]

1) Provide the regression equations for both models and the corresponding values for R2.

2) For the second model, provide interpretations of the constant term and the two slopes.

3) Explain intuitively why the effect of % smoking changed the way it did when the median income was accounted for.
[No loss of points for this question. Just give it a try. I hope to encourage you to think harder about the effect of each independent variable, as well as the interaction of the effects, in the multiple regression model. Formal discussion about such problems may come in 9172.]

4) Use the two models to predict the HDDR (heart disease death rate) for New York State, and then compare the two predicted values to the actual value of HDDR for New York State.

Reference no: EM131020924

Questions Cloud

What is most critical step in the capital budgeting process : What is the most critical step in the capital budgeting process? Why are there no "absolute" answers to capital budgeting decisions?
New heritage doll company case-harvard business review : What additional information does Harris need to complete her analyses and compare the two projects? What specific questions should she ask each of the project sponsors?
When evaluating the financial statements of a given firm : It is often said that anyone with a pencil can calculate financial ratios, but it takes a brain to interpret them. What kinds of things should the analyst keep in mind when evaluating the financial statements of a given firm?
Difficulties of obtaining accurate information : Do you think that this fraction is close to the actual proportion who cheated? Why? (Discuss the difficulties of obtaining accurate information on a question of this type.)
Write down the estimated regression model : Estimate and write down a regression model predicting the heart disease death rate based on the percent of smokers (X1) and state median household income (X2).
How long would it take you pay off the balance on new card : How many months will it take to pay off the debt if you only make the $200 minimum payment each month? All is not lost, because you just received an offer to transfer your $10,000 balance from your current credit card to a new credit card charging a ..
Two different bonds currently outstanding : Jallouk Corporation has two different bonds currently outstanding. Bond M has a face value of $20,000 and matures in 20 years. The bond makes no payments for the first 6 years, then pays $1,100 every six months over the subsequent eight years, and fi..
Which people change their behavior after they get insurance : The situation in which people change their behavior after they get insurance (illustrated by the above scenario) because the change benefits them but increases costs to the insurer is called
Has the writer followed all instructions for the assignment : Is the work honest and "relatable?" Does it employ a conversational tone and make use of narrative devices, such as dialogue, scene construction, and characterization? Does it address Lopate's notions of egotism and contrariety?

Reviews

Write a Review

Applied Statistics Questions & Answers

  What are the variance and standard deviation

What are the variance and standard deviation?

  What is the difference in proportions in the original data

In a sample of 56 addicts, 28 were given a new drug to help them overcome their addiction, while the remaining 28 were given the current drug. The following is the output from a randomization test for the difference in the proportion of addicts suffe..

  State the null hypothesis for the effect of guided imagery

What type of analysis was conducted in this study? Was this analysis technique appropriate? Provide a rationale for your answer and how many means are being compared for the pain scores at 12 weeks?

  What is the probability that the third woman

In 2015, a survey of women from Country X found that 56 percent are married. (a) If we randomly select three women from Country X, what is the probability that the third woman is the only one of the three that is married? (Give each answer to the ne..

  Construct an ungrouped frequency distribution

Construct a graph comparing the LDL between pre menopause and post menopause women. Write a brief narrative (text) describing the graph.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  Researchers want to know whether the variation in the popul

Researchers want to know whether, the variation in the population of trap-spacing measurements, is larger than 10 m2. They will conduct a test of hypothesis using α = 0.05. a.) Note that s2 > 10. Consequently, a fisherman wants to reject the null hyp..

  The mean life span of a brand name tire

The mean life span of a brand name tire

  What is the lower and upper limit

A sample of 9 managers with over 15 years of experience has an average salary of $71,000 and a sample standard deviation of $18,000. Assuming that the salaries of managers with over 15 years experience are normally distributed, you can be 95% confide..

  What is the probability that xy will be even

If x is chosen at random from the set {1,2,3,4} and y is to be chosen at random from the set {5,6,7}, what is the probability that xy will be even?

  Cigarette smokers make lower college grades than nonsmokers

Consider the following news headline, "Cigarette Smokers Make Lower College Grades than Nonsmokers" The news article goes on to say that researchers at a university collected information on rate of cigarette smoking (total number of cigarettes smoked..

  What is the probability that she gets across the road

Cars arrive at a particular junction at a rate of 1.5 per minute. A little old lady takes 20 seconds to cross the road at this junction and sets out as soon as one car passes. What is the probability that she gets across the road before the next ca..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd