### What is the accuracy of the logistic regression classifier

Assignment Help Applied Statistics
##### Reference no: EM132234453

Project: Fit a Logistic Regression Model to the Thoracic Surgery Binary Dataset - Part 1

For this project, you will be working with the thoracic surgery data set from the University of California Irvine machine learning repository. This dataset contains information on life expectancy in lung cancer patients after surgery.

The underlying thoracic surgery data is in ARFF format. This is a text-based format with information on each of the attributes. You can load this data using a package such as foreign or by cutting and pasting the data section into a CSV file.

a. Fit a binary logistic regression model to the data set that predicts whether or not the patient survived for one year (the Risk1Yvariable) after the surgery. Use the glm() function to perform the logistic regression. See Generalized Linear Models for an example. Include a summary using the summary() function in your results.

b. According to the summary, which variables had the greatest effect on the survival rate?

c. To compute the accuracy of your model, use the dataset to predict the outcome variable. The percent of correct predictions is the accuracy of your model. What is the accuracy of your model?

Project: Fit a Logistic Regression Model to Previous Dataset - Part 2

Fit a logistic regression model to the binary-classifier-data.csv dataset from the previous assignment.

Dataset (use previous data): binary-classifier-data.csv is attached.

a. What is the accuracy of the logistic regression classifier?

b. How does the accuracy of the logistic regression classifier compare to the nearest neighbors algorithm?

c. Why is the accuracy of the logistic regression classifier different from that of the nearest neighbors?

Attachment:- Assignment Files.rar

#### What is the difference between a bernoulli distribution

What is the difference between a bernoulli distribution and a poisson distribution? How can they be applied to a standardized normal distribution, specifically when looking at

#### What inputs do you think need reevaluation

Mark realizes that queueing theory helps him only so much in determining the number of representatives needed. What inputs do you think need reevaluation? How would you go abo

#### Obtain a time-series plot and correlogram

Obtain a time-series plot and correlogram of the residuals from (9) and comment on them with regard to the violation of classical assumptions. Test for fourth-order autocorr

#### Which of the variables are quantitative

A description of different houses on the market includes the following three variables. Which of the variables are quantitative - he total number of states represented by the

#### Gathers data on fifty graduate students

A researcher wants to predict graduate-school grade point average (GPA) from GRE verbal scores (which is what the pesky GRE is supposed to do, anyway!). She gathers data on

#### Etermine whose weight is closer to average

Jack weighs 160 pounds and his sister weighs 110 pounds. If the mean weight for men his age is 175 with a standard deviation of 14 pounds and the mean weight for women is 145

#### The proportion of businesses failed is higher for women

Consider a sample of 148 small businesses. During a three-year period, 15 of 106 headed by men and 7 of the 42 headed by women failed. Let p1 be the proportion of busin

#### Describe an independent-samples t-test

As you continue to review SPSS, statistics are important in assessing development and comparisons between groups (means). In SPSS, two group means can be compared to assess