Reference no: EM131116006
Question on data mining
Your task is to predict the output variable "choice" based on 16 input features: x1, x2, ....,x15, x16.The output "choice" is a categorical variable that can take 5 possible values: "M", "B", "J", P", and "O".The first 8 input features (x1, x2, ....,x8) are binary variables. The last 8 input features (x9, x10, ....,x16) are continuous variables.
1. Train a decision tree inductive learning model on the data from the CSV file "finalQ3Train.csv" that contains 1500 examples.
2. Express your trained model in the form of IF ... THEN rules. Test your trained model on the 500 examples from the CSV file "finalQ3Test.csv" and present your confusion matrix.
3. Predict values for "choice" for the 8 examples in the csv file "finalQ3newCases.csv". The examples are shown below
x1
|
x2
|
x3
|
x4
|
x5
|
x6
|
x7
|
x8
|
x9
|
x10
|
x11
|
x12
|
x13
|
x14
|
x15
|
x16
|
1
|
1
|
1
|
1
|
1
|
0
|
1
|
0
|
0.0284
|
0.2196
|
0.5259
|
0.6206
|
0.0950
|
0.3350
|
0.2470
|
0.9676
|
1
|
1
|
0
|
1
|
1
|
0
|
0
|
1
|
0.7419
|
0.9260
|
0.4711
|
0.8340
|
0.8770
|
0.1129
|
0.4805
|
0.7469
|
0
|
0
|
1
|
0
|
1
|
0
|
1
|
1
|
0.3867
|
0.9002
|
0.4240
|
0.6029
|
0.5547
|
0.6674
|
0.1499
|
0.4527
|
0
|
1
|
0
|
1
|
1
|
0
|
0
|
0
|
0.8848
|
0.0752
|
0.1195
|
0.3625
|
0.1565
|
0.1205
|
0.7666
|
0.4188
|
1
|
0
|
0
|
0
|
1
|
1
|
1
|
0
|
0.2893
|
0.0067
|
0.1855
|
0.6999
|
0.5777
|
0.5959
|
0.0324
|
0.8211
|
1
|
1
|
1
|
1
|
1
|
1
|
1
|
1
|
0.7549
|
0.3705
|
0.3349
|
0.8772
|
0.9453
|
0.2476
|
0.3782
|
0.1878
|
1
|
1
|
1
|
1
|
0
|
1
|
1
|
1
|
0.7921
|
0.1539
|
0.9011
|
0.5596
|
0.7125
|
0.1035
|
0.0587
|
0.2399
|
0
|
0
|
1
|
0
|
1
|
0
|
0
|
0
|
0.7190
|
0.8441
|
0.5841
|
0.8670
|
0.7620
|
0.8794
|
0.3351
|
0.4677
|
Indicate how unrealized holding gains and losses
: Indicate how unrealized holding gains and losses should be reported for investment securities classified as trading, available-for-sale, and held-to-maturity.
|
What is the difference between an edge act bank
: What is the difference between an Edge Act bank and an international banking facility?
|
What is an offshore center
: What is an offshore center?
|
Prepare the journal entry at december
: If the bonds in question 8 are classified as available-for sale and they have a fair value at December 31, 2010, of $3,604,000, prepare the journal entry (if any) at December 31, 2010, to record this transaction.
|
Task is to predict output variable choice based on 16 input
: Your task is to predict the output variable "choice" based on 16 input features: x1, x2, ....,x15, x16.The output "choice" is a categorical variable that can take 5 possible values: "M", "B", "J", P", and "O".The first 8 input features (x1, x2, ....,..
|
Will an mnc issuing debt in low interest rate currencies
: Will an MNC issuing debt in low-interest-rate currencies necessarily lower its cost of funds? Why?
|
Low capacity for exercise
: A study of the effects of exercise used rats bred to have high or low capacity for exercise. There were 8 high-capacity and 8 low-capacity rats.
|
What is the difference between a foreign branch
: What is the difference between a foreign branch and a subsidiary bank?
|
Measure of attachment to friends
: One of the response variables was a measure of attachment to friends (roughly, secure relationships), measured by the Inventory of Parent and Peer Attachment. The results are summarized in the table below.
|