What is the accuracy of the logistic regression classifier

Assignment Help Applied Statistics
Reference no: EM132234453

Project: Fit a Logistic Regression Model to the Thoracic Surgery Binary Dataset - Part 1

For this project, you will be working with the thoracic surgery data set from the University of California Irvine machine learning repository. This dataset contains information on life expectancy in lung cancer patients after surgery.

The underlying thoracic surgery data is in ARFF format. This is a text-based format with information on each of the attributes. You can load this data using a package such as foreign or by cutting and pasting the data section into a CSV file.

Instructions: Include all of your answers in a R Markdown report.

a. Fit a binary logistic regression model to the data set that predicts whether or not the patient survived for one year (the Risk1Yvariable) after the surgery. Use the glm() function to perform the logistic regression. See Generalized Linear Models for an example. Include a summary using the summary() function in your results.

b. According to the summary, which variables had the greatest effect on the survival rate?

c. To compute the accuracy of your model, use the dataset to predict the outcome variable. The percent of correct predictions is the accuracy of your model. What is the accuracy of your model?

Project: Fit a Logistic Regression Model to Previous Dataset - Part 2

Include all of your answers in a R Markdown report.

Fit a logistic regression model to the binary-classifier-data.csv dataset from the previous assignment.

Dataset (use previous data): binary-classifier-data.csv is attached.

a. What is the accuracy of the logistic regression classifier?

b. How does the accuracy of the logistic regression classifier compare to the nearest neighbors algorithm?

c. Why is the accuracy of the logistic regression classifier different from that of the nearest neighbors?

Attachment:- Assignment Files.rar

Reference no: EM132234453

Questions Cloud

Disgruntled customer is returning damaged suit jacket : A disgruntled customer is returning a damaged suit jacket he bought the previous week that he needed for an event that night. He is extremely upset.
What factors that help decide when a team : What factors that help decide when a team should be used for a workplace task/initiative.
Prepare a statement that describes the product life cycle : In a 1 page essay prepare a statement that describes the product life cycle and provide an example used in healthcare.
Create a full crud front end in visual studio : Create a full CRUD - Create, Read, Update, and Delete front end in Visual Studio - The database must be operated on using Entity Framework, with LINQ
What is the accuracy of the logistic regression classifier : Project: Fit a Logistic Regression Model to Previous Dataset - Part 2. What is the accuracy of the logistic regression classifier
Important the master schedule covers : While there is no set time period a master schedule has to cover, it is important the master schedule covers the...
Discuss benefit plan you currently have through employer : If you are currently covered under a benefit plan, outline and discuss the benefit plan you currently have through your employer.
Accomplishment of organizational goals and values : Recommendations regarding an expansion of the benefits programs offered at the company that would further align HR with the accomplishment of organizational
Side of google being run as flexible-flat technocracy : What are the advantages and disadvantages of the creative side of Google being run as a flexible and flat “technocracy”

Reviews

Write a Review

Applied Statistics Questions & Answers

  Walker county health trends memorandum

You get a text message from your Medical Director.  As usual he is moonlighting in Huntsville.  He has decided that he is spending so much time there, he may want to buy a condo.  But before he does, he wants to check on health conditions in the area..

  An extensive study involving thousands of british children

1. In an extensive study involving thousands of British children, Arden and Plomin (2006) found significantly higher variance in the intelligence scores for males than for females. Following are hypothetical data, similar to the res..

  What was the basic goal of this article

A group of researchers set out to determine if there was a significant difference in family communication types, What was the basic goal of this article

  An airline''s goal is to fill the plane as much as possible

On any given flight, an airline's goal is to fill the plane as much as possible without overbooking. If, on average, 10% of customers cancel their tickets, all independently of each other, what is the probability that a particular flight will be o..

  What sampling method is used to select your sample data

Statistics for Business and Finance (BUS5SBF). In this assignment you will be analysing and interpreting household data. Organise your sample data in a spreadsheet as per the instructions in the Excel sheet. What sampling method is used to select you..

  What are the tax consequences of the share certificate

Advise John Jones of the tax consequences of Items 1 – 6, above. You should discuss what amounts would be included in his assessable income or, if any item is not assessable income, why that is so. Your answer should include a discussion of the follo..

  Use technology to construct the confidence intervals

In a survey of 601 males ages? 18-64, 392 say they have gone to the dentist in the past year. Construct? 90% and? 95% confidence intervals for the population proportion. Interpret the results and compare the widths of the confidence intervals. If? co..

  A sociologist cites a study showing that, in a particular

A second researcher doubts these findings, believing that the actual figure is higher. To attempt to resolve the question, a simple random sample of 60 preschool children is chosen, and their TV watching habits are measured by having their parents ke..

  Does the advertisement seem to have been effective

What proportion of the individuals surveyed who recalled seeing the advertisement had purchased Fresh? Based on your answer to part a, does the advertisement seem to have been effective? Explain.

  What things may correlate but not be causal. correlation doe

Foot size correlates positively with math skills. This is because younger children have tiny feet and do not know math. Correlation does not mean causation. Sometimes we make that error. What things may correlate, but not be causal?

  What is estimated CI of average blood vitamin D

What is the estimated 95% confidence interval (CI) of the average blood vitamin D level of US landscapers in ng/mL

  Set up a confidence interval estimate of the proportion

The labor relations manager of a large corporation wished to study the absenteeism among workers at the company's central office during the last year. A random sample of 25 workers revealed the following: an average of 9.7 days; a standard deviation ..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd