Define a binary outcome variable

Assignment Help Other Subject
Reference no: EM132379033

STAT 601 Homework -

Answer all questions specified on the problem and include a discussion on how your results answered/addressed the question. Submit your .rmd file with the knitted PDF.

Please do the following problems from the text book R Handbook and stated.

1. Collett (2003) argues that two outliers need to be removed from the plasma data. Try to identify those two unusual observations by means of a scatterplot.

2. (Multiple Regression) Continuing from the lecture on the hubble data from gamair library;

a) Fit a quadratic regression model, i.e., a model of the form

Model 2: velocity = β1 × distance + β2 × distance2 + ε

b) Plot the fitted curve from Model 2 on the scatterplot of the data.

c) Add the simple linear regression fit (fitted in class) on this plot - use different color and line type to differentiate the two and add a legend to your plot.

d) Which model do you consider most sensible considering the nature of the data - looking at the plot?

e) Which model is better? - provide a statistic to support you claim.

Note: The quadratic model here is still regarded as a linear regression" model since the term-linear" relates to the parameters of the model and not to the powers of the explanatory variables.

3. The leuk data from package MASS shows the survival times from diagnosis of patients suffering from leukemia and the values of two explanatory variables, the white blood cell count (wbc) and the presence or absence of a morphological characteristic of the white blood cells (ag).

a) Define a binary outcome variable according to whether or not patients lived for at least 24 weeks after diagnosis. Call it surv24.

b) Fit a logistic regression model to the data with surv24 as response. It is advisable to transform the very large white blood counts to avoid regression coefficients very close to 0 (and odds ratio close to 1). You may use log transformation.

c) Construct some graphics useful in the interpretation of the final model you fit.

d) Fit a model with an interaction term between the two predictors. Which model fits the data better? Justify your answer.

4. Load the Default dataset from ISLR library. The dataset contains information on ten thousand customers. The aim here is to predict which customers will default on their credit card debt. It is a four-dimensional dataset with 10000 observations. The question of interest is to predict individuals who will default . We want to examine how each predictor variable is related to the response (default). Do the following on this dataset

a) Perform descriptive analysis on the dataset to have an insight. Use summaries and appropriate exploratory graphics to answer the question of interest.

b) Use R to build a logistic regression model.

c) Discuss your result. Which predictor variables were important? Are there interactions?

d) How good is your model? Assess the performance of the logistic regression classifier. What is the error rate?

5. Run all the codes (additional exploration of data is allowed) and write your own version of explanation and interpretation.

Attachment:- Assignment Files.rar

Reference no: EM132379033

Questions Cloud

Explain why the reproductive system : In ten sentences or less explain why the reproductive system is among the most important systems in the entire body. Please cite your source
Description of the four-chambered heart : Read the summary. What three examples of form following function are evident in this description of the four-chambered heart?
Location in the body of each specific epithelial tissue : Discuss the structure, function, and location in the body of each specific epithelial tissue.
Are us markets becoming less competitive because of mergers : Are US markets becoming less competitive because of mergers and acquisitions? Are US markets becoming more competitive because of new technology?
Define a binary outcome variable : STAT 601 Homework - Define a binary outcome variable according to whether or not patients lived for at least 24 weeks after diagnosis
Explain how and why that affects type 1 diabetics : Another clue you gather as Alice walks into your office is her gait. What specifically would you be looking for when observing the walking gait of a 50 year old
Importance of monitoring weight in type 1 diabetics : You first note Alice's weight; did she lose or gain weight? Explain the importance of monitoring weight in type 1 diabetics?
Distinguish between the male and the female pelvis : Distinguish between the male and the female pelvis. Identify and describe the bones of the lower limb.
Name the bones that make up the coxal bone : List the bones that make up the pelvic girdle and explain why the pelvic girdle is more stable than the pectoral girdle.

Reviews

len2379033

9/30/2019 3:24:22 AM

Answer all questions specified on the problem and include a discussion on how your results answered/addressed the question. Submit your .rmd file with the knitted PDF (or knitted Word Document saved as a PDF). If you are having trouble with .rmd, let us know and we will help you, but both the .rmd and the PDF are required. This file can be used as a skeleton document for your code/write up. Please follow the instructions found under Content for Formatting and Guidelines. No code should be in your PDF write-up unless stated otherwise.

len2379033

9/30/2019 3:24:17 AM

For any question asking for plots/graphs, please do as the question asks as well as do the same but using the respective commands in the GGPLOT2 library. (So if the question asks for one plot, your results should have two plots. One produced using the given R-function and one produced from the GGPLOT2 equivalent). This doesn’t apply to questions that don’t specifically ask for a plot, however I still would encourage you to produce both. You do not need to include the above statements.

Write a Review

Other Subject Questions & Answers

  Evaluate the contributions and criticisms of psychoanalytic

Evaluate the contributions and criticisms of psychoanalytic models to the explanation of human behavior.

  What patient education needs completed for intervention

1. What would be your response? 2. If you decide to prescribe something for her, what would be the first line?

  Evaluate the appropriateness of the analysis used

A brief summary of a peer-reviewed counseling research article related to your chosen topic for your Final Project. In your summary, explain the data analysis. Evaluate the appropriateness of the analysis used. Explain the data analysis in relati..

  What are the issues and trends of the ideal health system

The purpose of this assignment is to allow students to create their "ideal health system" with respect to cost, access, and quality. Students will analyze.

  What were the opsec procedures

Describe what problems you envision the lack of an effective OPSEC program could hold for a local, county, state, tribal, or federal law enforcement agency.

  How quantitative methods differ from qualitative methods

It should be evident that outcome evaluations have historically been conducted with quantitative methods (numbers, statistics).

  Write a comment about the given post

Successful communication between healthcare providers and their patients from different cultural backgrounds depends on developing awareness of the normative cultural values of patients and how these differ from the cultural values of most western..

  Summarize four psychology of combat concepts

Identify and summarize four psychology of combat concepts (PTSD, Hot combat & fire fights, the psychology of killing, and hazing) one per page, that are featured in the film black hawk down by Mark Bowden.

  Why did the speaker refer to this city on the hill

Google the phares "city upon a hill" and find a 20th or 21th century speech which contains that phrase. Write a short paragraph in which you identify the speaker and the occasion. Why did the speaker refer to this "city on the hill"?

  Estimate production in both cases for next seven years

Based on this information, you have been tasked with preparing expansion recommendations for Nanovo (using Excel is optional but recommended). Estimate production in both cases (major and minor expansion) for next 7 years, what is average sale price

  What elements of the heros quest

What elements of the hero's quest are found in Augustine's Confessions?

  Reducing and preventing adverse drug reactions

Write a paper about the persuasive speech describing and defending these suggestions.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd