Reference no: EM132851656
A research study was conducted to predict incidence of cirrhosis on a population using a few descriptor variables from that population. The variables we have are:
urbanpop - the size of the urban population
lowbirth - the reciprocal of the number of births to women between 45 and 49, times 100
wine - consumption of wine per capita
liquor - consumption of hard liquor per capita
cirrhosis - the death rate from cirrhosis
(a) make a SAS program to read the SAS dataset "drinkingdat.txt" into SAS.
(b) Generate correlation matrix of all the variables.
(c) Is there multicollinearity among the covariates? Please justify your answer.
(d) Please run a multiple linear regression using cirrhosis as the dependent variable and the other variables as independent variables using (1) forward model selection (2) backward model selection (3) stepwise model selection.
(e) Please write out the estimated multiple linear regression model you chose in (d).
(f) Please examine the goodness of fit for the final model.