Create a pivot table for the training data

Assignment Help Applied Statistics
Reference no: EM13758259 , Length: 4

Question 1:

Create a pivot table for the training data with Online as a column variable, CC as a row variable, and Loan as a secondary row variable.

The values inside the cells should convey the count (number of records).

Complete the numbers in the table below:

 

 

online=0

online=1

CC=0

Loan=0



CC=0

Loan=1



CC=1

Loan=0



CC=1

Loan=1



Question 2

Consider the task of classifying a customer who owns a bank credit card and is actively using online banking services. Looking at the pivot table that you created, what is the probability that this customer will accept the loan offer?

Question 3

Create two separate pivot tables for the training data. One will have Loan (rows) as a function of Online (columns) and the other will have Loan (rows) as a function of CC.

Compute the probabilities below (report three decimals).

Note: P(A|B) means "the probability of A given B".

1. P(CC = 1|Loan = 1) = the proportion of credit card holders among the loan acceptors = 

2. P(Online = 1|Loan = 1) = 

3. P(Loan = 1) = the proportion of loan acceptors = 

4. P(CC = 1|Loan = 0) = 

5. P(Online = 1|Loan = 0) = 

6. P(Loan = 0) = 

Question 4

Compute the naive Bayes probability P(Loan = 1|CC = 1, Online = 1).

Note: Use the quantities that you computed in the previous question.

Question 5

Of the two values that you computed earlier, which is a more accurate estimate of P(Loan=1|CC=1, Online=1)?

Select one:

The value based on the separate pivot tables (one with CC and Loan, and one with Online and Loan)

The value based on the complete crossed pivot table (with Online, CC, Loan)

Question 6

In XLMiner, run naive Bayes on the data and request Detail Report for the training data. Examine the "Conditional probabilities" table. Which of the entries in this table are needed for computing P(Loan = 1|CC = 1, Online = 1)? Mark all that apply (you may get slightly different but very close probabilities due to software upgrade, use the closest ones for selecting your  s.)

Select one or more:
0.301
0.402
0.374
0.288
0.712
0.699
0.598
0.626

Question 7

In the XLMiner Naive Bayes output, locate the predicted probability for P(Loan=1 | Online = 1, CC = 1). The 4-decimal value is given by...

Reference no: EM13758259

Questions Cloud

Retaining the value of position : Provide analysis showing the net profit from (i) the covered call and (ii) the protective put on the expiration date assuming the stock price has fallen 20%. Which strategy is more effective at retaining the value of your position?
Program implements the functionality of a deck of cards : Write a complete program using "ECLIPS" that implements the functionality of a deck of cards. In writing your program, use the provided DeckDriver and Card classes shown below. Write your own Deck class so that it works in conjunction with the two..
The outpatient center regarding possible bariatric surgery : Previous medical evaluations have not indicated any metabolic diseases, but he says he has high blood pressure, which he tries to control with sodium restriction and sleep apnea. He current works at a catalog telephone center.
Assuming-size of fish population satisfies logistic equation : A biologist stocked a lake with 45 fish and estimated the carrying capacity (the maximal population for the fish of that species in that lake) to be 7,000. The number of fish tripled in the first year. Assuming that the size of the fish population sa..
Create a pivot table for the training data : Create a pivot table for the training data with Online as a column variable, CC as a row variable, and Loan as a secondary row variable - Create two separate pivot tables for the training data.
What forest/domain model should shiv llc implement : What forest/domain model should Shiv LLC implement? What is the domain name? Where should the domain controllers be place? Should RODC be part of the consideration
Writing your own educational philosophy : Writing your own Educational Philosophy. Why do you want to teach? Whom are you going to teach? How and what are you going to teach?
restore credibility and generate positive press reporting : Create a public relations campaign for a financial institution that has recently received negative exposure in the media pertaining to its lack of responsiveness to those wishing to modify existing home loans. The goal of your campaign is to influenc..
Provide the owner with a reasonable rate of return : A business should provide the owner with a reasonable rate of return based upon:

Reviews

Write a Review

Applied Statistics Questions & Answers

  What is the overall accuracy of the test

A screening test for a newly discovered disease is being evaluated. In order to determine the effectiveness of the new test, it was administered to 900 workers; 150 of the individuals diagnosed with the disease tested positive.

  Test of two means

Test of two means.

  Find the linear correlation coefficient for the systolic

Use the Excel Analysis ToolPak to find the linear correlation coefficient for the systolic and diastolic measurements.

  Business research report proposal

Identify a business research topic and define the research questions for the identified problem or opportunity

  Find thecovariance between x and y

Suppose that two students named Gwyneth and Josephine have a total of 20 CDs in their room, consisting of 5 blues CDs and 15 reggae CDs. Each of the students chooses 7 CDs at random (without replacement), with all choices equally likely.  (Thus, a..

  Prepare a report using the numerical methods

Prepare a report using the numerical methods of descriptive statistics presented in this module to learn how the variables contribute to the success of a motion picture.

  Compute the sample variance

Compute the sample mean and compute the sample variance and sample standard deviation - What is the average miles per gallon for city driving?

  Please read through the case scenario shown below to gain

please read through the case scenario shown below to gain an understanding of the background information and the

  The sample average number of strokes being

The sample average number of strokes being more than 5.0.

  Given a business situation word problem

Given a business situation word problem or case study such as one dealing with processing time or quantity of fill, use the normal probability distribution to determine a course of action.

  Question 1 a is a gradual long-term up-or-down movement of

question 1 a is a gradual long-term up-or-down movement of demand.answer seasonal pattern cycle trend

  Question 1you are a data analyst working for the australian

question 1you are a data analyst working for the australian petrol pricing commissioner and have been requested to

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd