Reference no: EM132397948
STAT 601
Assignment
Answer all questions specified on the problem and include a discussion on how your results answered/addressed the question.
Submit your .rmd file with the knitted PDF (or knitted Word Document saved as a PDF). If you are having trouble with .rmd, let us know and we will help you, but both the .rmd and the PDF are required.
This file can be used as a skeleton document for your code/write up. Please follow the instructions found under Content for Formatting and Guidelines. No code should be in your PDF write-up unless stated otherwise.
For any question asking for plots/graphs, please do as the question asks as well as do the same but using the respective commands in the GGPLOT2 library. (So if the question asks for one plot, your results should have two plots. One produced using the given R-function and one produced from the GGPLOT2 equivalent). This doesn't apply to questions that don't specifically ask for a plot, however I still would encourage you to produce both.
You do not need to include the above statements.
Please do the following problems from the text book R Handbook and stated.
1. An investigator collected data on survival of patients with lung cancer at Mayo Clinic. The investigator would like you, the statistician, to answer the following questions and provide some graphs. Use the cancer data located in the survival package.
a. What is the probability that someone will survive past 300 days?
b. Provide a graph, including 95% confidence limits, of the Kaplan-Meier estimate of the entire study.
c. Is there a difference in the survival rates between males and females? Provide a formal statistical test with a p-value and visual evidence.
d. Is there a difference in the survival rates for the older half of the group versus the younger half? Provide a formal statistical test with a p-value and visual evidence.
2. A healthcare group has asked you to analyse the mastectomy data from the HSAUR3 package, which is the survival times (in months) after a mastectomy of women with breast cancer. The cancers are classified as having metastasized or not based on a histochemical marker. The healthcare group requests that your report should not be longer than one page, and must only consist of one plot, one
table, and one paragraph. Do the following:
a. Plot the survivor functions of each group only using GGPlot, estimated using the Kaplan-Meier estimate.
b. Use a log-rank test to compare the survival experience of each group more formally. Only present a formal table of your results.
c. Write one paragraph summarizing your findings and conclusions.
Attachment:- Program file.rar