Plot an appropriately labelled graph with age

Assignment Help Applied Statistics
Reference no: EM132367541

Assignment - STATA Questions

Please provide a full record of your software code and software output in an appendix.

The attached dataset is a dataset which contains survey responses from 2500 women aged over 70.

This dataset has been created in order to assess selected risk factors for depression. A summary of the dataset has been provided in Table 1.

Table 1: Depression dataset

Variable Name

Description

Key

studyno

Unique identifier

 

Age

Age in years at the time of survey completion

 

social_support_tertiles

Tertiles of the social support scale

1=In the lowest tertile of social support

2=In the middle tertile of social support

3=In the highest tertile of social support

e.g. those with social_support_tertile=3 are in the third who have the highest level of social support

depression

In the last 3 years have you been told by a doctor that you have Depression

0=No

1=Yes

Q1. Using a chi-squared test and a t-test, assess the association between age and depression, and social support and depression. Present your results in a table which would be suitable for inclusion in a scientific paper. Under the table describe and interpret these results.

[Note. For this question do not fit a statistical model, and look at each exposure variable one by one].

Q2a. Create a 'collapsed' dataset which records the number of depression records in each category of social support. [For this task you can temporally ignore the age variable. I am not assessing the procedure you used to create the data here, as long as the numbers are correct].

Using this grouped version of the data use software to run a logistic regression model which assesses the association between social support [as a categorical variable] and depression. Carefully interpret your results.

Q2b. Use software to run 'the same model' on the individual (non-collapsed) data. Present the software output and highlight that this model gives us the same estimates of association between social support and depression as the model in 2a.

Q3a. Using software fit a logistic regression model to assess whether there is an association between age and depression in this sample [including only age and depression]. Interpret the estimated age coefficient (and confidence interval and p-value).

Q3b. Use a Wald test to test whether the log(OR) associated with a 1-unit increase in age is greater than In(1.1).

Q3c. Using the model from part 3a, plot an appropriately labelled graph with age on the x-axis and the predicted log odds of depression on the y-axis.

Q3d. Detail how the value of the log-likelihood presented in your software output in 3a was calculated.

Q4a. Using software fit a single logistic regression model which assesses the association between the exposures social support [as a categorical variable] and age, and the outcome depression. Interpret the coefficients produced from this model.

Using software run a likelihood ratio test to assess the statistical significance of adding social support (as a categorical variable) to a more basic model which just includes the exposure age.

Q4b. What is the null and alternative hypothesis for this likelihood ratio test?

Q4c. How do you interpret the results of the likelihood ratio test?

Q4d. Using the model output from the relevant separate models (i.e. the log likelihood values) calculate the chi-squared statistic for this likelihood ratio test by hand.

Q5. Use statistical software and the Hosmer-Lemeshow method to assess how your model from Q4a (that includes age and social support) fits the data. Interpret the output produced. Briefly comment on possible limitations of the Hosmer-Lemeshow technique.

Q6a. Fit a logistic regression model with depression as the outcome, which includes age and social support as independent variables. This time include social support as a linear (trend) term as opposed to a categorical variable.

Interpret the results from your model. Explain whether you would you prefer to present the results of the model from Q6a or Q4a?

Explain why we would not use the Likelihood Ratio test compare the models form Q6 and Q4a.

Q6b. From this model in Q6a what is the predicted probability of depression for someone aged 75.25 and in the highest social support tertile?

Note - Attached the data file to be used to solve the above questions. The questions should be solved using STATA Software.

Attachment:- Data File.rar

Reference no: EM132367541

Questions Cloud

Pioneer in the study of personality types : Isabel Briggs Myers was a pioneer in the study of personality types. The personality types are broadly defined according to four main preferences.
What else might be going on to make up this relationship : What can we conclude about the relationship between these two variables? What else might be going on to make up this relationship?
Determine the? upper-tail critical value of test statistic : When performing a ?2 test for independence in a contingency table with r rows and c? columns, determine the? upper-tail critical value of the test statistic
What is the probability that roastbeef sandwich : If a sandwich is selected at random, what is the probability that it's a roastbeef sandwich?
Plot an appropriately labelled graph with age : Using the model from part 3a, plot an appropriately labelled graph with age on the x-axis and the predicted log odds of depression on the y-axis
State the null and alternative hypotheses : State the null and alternative hypotheses and explain how you develop these two (2) hypotheses.
5 steps of decision tree analysis : Given the 5 steps of decision tree analysis, which of these three conditions yields the most possible outcomes and alternatives and why?
What is the probability that the reporter : What is the probability that the reporter made no typographical errors for the article? Use the Poisson distribution and round answer to 4 decimal places.
Describe what process you would go through : Describe what process you would go through to determine student's GPA in your school using a stratified sample.

Reviews

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd