##### Reference no: EM13918737

Assignment-

1. Review the predictor variables and guess what their role in a credit decision might be. Are there any surprises in the data?

2. Divide the data into training and validation partitions, and develop classification models using the following data mining techniques in XLMiner: logistic regression, classification trees, and neural networks.

3. Choose one model from each technique and report the confusion matrix and the cost/gain matrix for the validation data. Which technique has the most net profit?

4. Let us try and improve our performance. Rather than accept XLMiner's initial classification of all applicants' credit status, use the "predicted probability of success" in logistic regression (where success means 1) as a basis for selecting the best credit risks first, followed by poorer risk applicants.

a. Sort the validation on "predicted probability of success."

b. For each case, calculate the net profit of extending credit.

c. Add another column for cumulative net profit.

d. How far into the validation data do you go to get maximum net profit? (Often, this is specified as a percentile or rounded to deciles.)

e. If this logistic regression model is scored to future applicants, what "probability of success" cutoff should be used in extending credit?

**Attachment:-** GermanCredit.rar