Run a proper regression analysis

Assignment Help Basic Statistics
Reference no: EM131762000

Assignment - Use Stata to answer the following questions. Save .do files with comments explaining your methods and logic. All answers and necessary tables should be included in the document.

Question 1 -

Suppose you are interested in the impact of some predictors on being in a honors math program (honors; 1=in the honors math; 0 = not in the honors math). The first set of predictors are individual background variables (female, ethnicity, and socioeconomic status). The second set of predictors that you need to enter on the top of the individual background variables are science and reading scores. Finally, the third block of predictors that you want to add includes school type (sctyp; 1 = public, 2 = private) and program (prog; 1 = general, 2 = academic, 3 = vocation). Data for N = 200 students is in Q1honors.csv. Answer the following questions.

a. Run a proper regression analysis (from start to finish, including data exploration, distribution, testing assumptions, testing for missing values, etc.) considering the distribution of an outcome variable with three blocks of predictors and report the results of three models in an APA-formatted table.

b. Formally compare the first model and the third model using a proper test statistic.

c. Using the first model, calculate the probability of being in the honors math program for a Hispanic male whose SES level is high, and test it. Report all results.

d. Give the odds of being in the honors program interpretation for the regression coefficient of female in the best fitting model. Provide rationale as to why you chose that model as the best fitting model.

e. Address multicollinearity issues for the best fitting model among the three models tested

f. Examine outliers, leverage, and influence of the best fitting model after you addressed multicollinearity problem.

Now you'll run a different model for the following questions.

g. Now conduct a proper analysis to answer the following research question: Does the SES effect on the outcome differ across school type while controlling for female, ethnicity, science, and reading scores?

h. Conduct a proper analysis to evaluate if reading scores is a mediator for the impact of science scores on the outcome.

Question 2 -

Long (1990, 1997) investigates factors affecting the research productivity of doctoral students in biochemistry. The response variable in this investigation, art, is the number of articles published by the student during the last three years of his or her PhD Program. The explanatory variables are as follows:

fem Gender: dummy variable - 1 if female, 0 if male

mar marital status: dummy variable-1if married, 0 if not

kid5 Number of children give years old or younger

phd Prestige rating of PhD Department

ment Number of articles published by mentor during the last 3 years

Long's data (on 915 biochemists) are in the file Q2phd.csv.

a. Examine the distribution of the response variable. Based on this distribution, does it appear promising to model these data by linear least-squares regression? Perhaps after transforming the response? Explain your answer.

b. Following Long, perform a Poisson regression of the art on the explanatory variables. What do you conclude from the results of this regression?

c. Perform regression diagnostics on the model fit in the previous question. If you identify any problems, try to deal with them. Are the conclusions of the research altered?

d. Refit Long's model allowing for overdispersion (using a quasi-Poisson or negative binomial model). Does this make a difference to the results?

Attachment:- Assignment Files.rar

Reference no: EM131762000

Questions Cloud

Effect of supplemental health service on the quality of life : Unequal benefits Researchers on aging proposed to investigate the effect of supplemental health services on the quality of life of older people.
How can use of the dsms cross-cutting symptom measures : Why is it important for counselors and other behavioral health professionals to become familiar with the use of the Diagnostic disorder?
Explain why the human resource function should be aligned : Explain why the human resource function should be aligned with an organization's strategic plan (use ideas from the Module One discussion on this topic).
Compute mean change in stock price of the companies : Initial public offerings (1.3) The business magazine Forbes reports that 4567 companies sold their first stock to the public between 1990 and 2000.
Run a proper regression analysis : Run a proper regression analysis considering the distribution of an outcome variable with three blocks of predictors
Define the forensic assessment : This discussion question meets the following CACREP Standards:Use of assessments for diagnostic and intervention planning purposes.
What is the population for the sample survey : Ontario Health Survey The Ministry of Health in the province of Ontario, Canada, wants to know whether the national health care system is achieving its goals.
Determine the of days past due for each of the preceding : Determine the # of days past due for ' each of the preceding accounts as of the end of August
What specific recommendations would you make : What specific recommendations would you make that you or others could do to increase the likelihood that a positive change will occur in this situation?

Reviews

len1762000

12/12/2017 5:02:26 AM

Stats Stata: non-linear, poisson, GLM, logit regression missing data analysis, assumptions, etc. Use Stata to answer the following questions. Save .do files with comments explaining your methods and logic. All answers and necessary tables should be included in the document. Be sure to insert equations for models, hypothesis testing, etc. It is expected that you will run any models test necessary to formally conduct an analysis even if the test is not required in the questions. Report all finding using APA format.

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd