What are significant predictors of chd

Assignment Help Basic Statistics
Reference no: EM13246805

A retrospective sample of males in a heart-disease high-risk region of the Western Cape, South Africa. There are roughly two controls per case of CHD. Many of the CHD positive men have undergone blood pressure reduction treatment and other programs to reduce their risk factors after their CHD event. In some cases the measurements were made after these treatments. These data are taken from a larger dataset, described in Rousseauw et al, 1983, South African Medical Journal.

There are 463 observations in the dataset. The variables in the dataset are :

sbp - systolic blood pressure
tobacco - cumulative tobacco (kg)
ldl - low density lipoprotein cholesterol adiposity
famhist - family history of heart disease (Present, Absent)
typea - type-A-behavior obesity
alcohol - current alcohol consumption
age - age at onset
chd - response, coronary heart diseease

If you would prefer to analyze this data in using some other statistical package, you will need to export the data from R using something like a write.table command (or some variation thereof).

The following questions are of practical interest:

1. What are significant predictors of CHD? What would a final model look like and can you provide an estimate of its predictive accuracy (i.e. do model selection and then evaluate predictive accuracy) ? What functional forms are most appropriate for the various predictors in your final model ?

2. Since high ldl often precedes a diagnosis of CHD, will a two stage model which first uses ldl as a response in stage 1 and then CHD as a response in stage 2, provide more accurate predictions of CHD than the model built question 1 above ?

3. There are often situations where finding just one obviously best submodel is dicult. There may be many good competing sub-models.

However, you might decide to bring together multiple models to improve predictive performance. Develop a strategy for doing this on this dataset, being careful to clearly compare and contrast (to the single model approach) predictive performance. Also, make sure to clearly motivate your strategy giving enough intuition so that I can follow things easily.

Please provide complete justifications for why you chose a particular modeling strategy including the underlying assumptions you are making. Analyze the data and provide some overall inferences with regards to the questions being posed. Write a report that details your analysis.

Reference no: EM13246805

Questions Cloud

What is the electric potential at given point : Point charges q1=+2.00?C and q2=?2.00?C are placed at adjacent corners of a square for which the length of each side is 4.50cm, What is the electric potential at point b
Describe what is the amat : Assume a block size of 256 bytes, a clock rate of 1GHz, an L1 miss rate of 2%, and that main memory takes 100ns of overload and then delivers 16 bytes per clock cycle. What is the AMAT
Find the largest electrical output : a river with a water temperature T(L)=20 degree C is to be used as the low temperature reservoir of a large power plant, what is the largest electrical output that the plant can deliver to its customers
How much average power is being wasted due to switching : A MOSFET transistor is being used as a converter switch in a 100v system. It is switching at 50 KHz and has a linear transition time of 3us. The full load current is 40 amps. - How much average power is being wasted due to switching
What are significant predictors of chd : What are significant predictors of CHD and what would a final model look like and can you provide an estimate of its predictive accuracy
Explain what is the concentration of cu2+ cell : What is the concentration of Cu2+ in the following cell at 25 degrees C if the cell voltage is 0.955V?
What is the mans speed at the instant : An 80.0-kg man jumps from a height of 2.50 m onto a platform mounted on springs, What is the man's speed at the instant he depresses the platform 0.120 m
Explain how many molecules of acetylene react with oxygen : How many molecules of acetylene (HCCH) react with 131 molecules of oxygen to produce carbon dioxide and water
Determine the surface charge density : An air-filled capacitor consists of two parallel plates, each with an area of 7.60 cm2, separated by a distance of 1.60 mm. An air-filled capacitor consists of two parallel plates, each with an area of 7.60 cm2, separated by a distance of 1.60 mm.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Mean diameter of the peaches

A random sample of 100 peaches is gathered and the mean diameter is calculated. What is the probability that the sample mean is greater than 3.79 inches.

  Bayes theorem-question

A test indicates the presence of a particular disease 90% of the time when the disease is present and the presence of the disease 2% of the time when the disease is not present.

  Find probability of achieving exactly two successful calls

The random digit dialing machine is expected to reach a live person 15% of the time. In eight attempts, what is the probability of achieving exactly two successful calls?

  Question on nonparametric tests

Under what circumstances should a nonparametric test be used? Explain. What are the strengths and weaknesses of nonparametric tests?

  Introduction of level of significance

Is there sufficient evidence for one to claim that in general, rugby players have a higher BMI than the multisport men? Perform the test at the 0.01 level of significance.

  Overall significance of the regression model

Test the  overall significance of the regression model  at α = .05 level. Please use the six-step hypothesis-testing framework we employed in class, and write out the last three steps in this table:

  Way researcher-s methods be improved to eliminate bias

"Do you believe that gun control laws which restrict the ability of Americans to protect their families should be eliminated?"

  Determine the mean and standard deviation of x

Let x be the random variable that denotes the net amount a playr wins by playing this lottery. Write the probablity distribution of x. Determine the mean and standard deviation of x. How will you interpret the values of the mean and standard devia..

  Hypothesis testing of cigarettes

Safe T cigarettes claim that their product has "low tar." Government standards are that this claim is invalid if there are more than 13.5mg tar per cigarette.

  Find lowest possible score a student needs to qualify

Given that the exam has a mean of 1000 and a standard deviation of 200, what is the lowest possible score a student needs to qualify for acceptance into the JB University?

  Steady-state probabilities of the state

Otherwise, no ordering will take place. Find the steady-state probabilities of the state of this Markov chain?

  Purpose of link function of glm

Describe the purpose of the link function of a GLM. What is the identity link? Explain why it is not often used with binomial and Poisson responses.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd