Task is to predict output variable choice based on 16 input

Assignment Help Basic Statistics
Reference no: EM131116006

Question on data mining

Your task is to predict the output variable "choice" based on 16 input features: x1, x2, ....,x15, x16.The output "choice" is a categorical variable that can take 5 possible values: "M", "B", "J", P", and "O".The first 8 input features (x1, x2, ....,x8) are binary variables. The last 8 input features (x9, x10, ....,x16) are continuous variables.

1. Train a decision tree inductive learning model on the data from the CSV file "finalQ3Train.csv" that contains 1500 examples.

2. Express your trained model in the form of IF ... THEN rules. Test your trained model on the 500 examples from the CSV file "finalQ3Test.csv" and present your confusion matrix.

3. Predict values for "choice" for the 8 examples in the csv file "finalQ3newCases.csv". The examples are shown below

x1

x2

x3

x4

x5

x6

x7

x8

x9

x10

x11

x12

x13

x14

x15

x16

1

1

1

1

1

0

1

0

0.0284

0.2196

0.5259

0.6206

0.0950

0.3350

0.2470

0.9676

1

1

0

1

1

0

0

1

0.7419

0.9260

0.4711

0.8340

0.8770

0.1129

0.4805

0.7469

0

0

1

0

1

0

1

1

0.3867

0.9002

0.4240

0.6029

0.5547

0.6674

0.1499

0.4527

0

1

0

1

1

0

0

0

0.8848

0.0752

0.1195

0.3625

0.1565

0.1205

0.7666

0.4188

1

0

0

0

1

1

1

0

0.2893

0.0067

0.1855

0.6999

0.5777

0.5959

0.0324

0.8211

1

1

1

1

1

1

1

1

0.7549

0.3705

0.3349

0.8772

0.9453

0.2476

0.3782

0.1878

1

1

1

1

0

1

1

1

0.7921

0.1539

0.9011

0.5596

0.7125

0.1035

0.0587

0.2399

0

0

1

0

1

0

0

0

0.7190

0.8441

0.5841

0.8670

0.7620

0.8794

0.3351

0.4677

Reference no: EM131116006

Questions Cloud

Indicate how unrealized holding gains and losses : Indicate how unrealized holding gains and losses should be reported for investment securities classified as trading, available-for-sale, and held-to-maturity.
What is the difference between an edge act bank : What is the difference between an Edge Act bank and an international banking facility?
What is an offshore center : What is an offshore center?
Prepare the journal entry at december : If the bonds in question 8 are classified as available-for sale and they have a fair value at December 31, 2010, of $3,604,000, prepare the journal entry (if any) at December 31, 2010, to record this transaction.
Task is to predict output variable choice based on 16 input : Your task is to predict the output variable "choice" based on 16 input features: x1, x2, ....,x15, x16.The output "choice" is a categorical variable that can take 5 possible values: "M", "B", "J", P", and "O".The first 8 input features (x1, x2, ....,..
Will an mnc issuing debt in low interest rate currencies : Will an MNC issuing debt in low-interest-rate currencies necessarily lower its cost of funds? Why?
Low capacity for exercise : A study of the effects of exercise used rats bred to have high or low capacity for exercise. There were 8 high-capacity and 8 low-capacity rats.
What is the difference between a foreign branch : What is the difference between a foreign branch and a subsidiary bank?
Measure of attachment to friends : One of the response variables was a measure of attachment to friends (roughly, secure relationships), measured by the Inventory of Parent and Peer Attachment.  The results are summarized in the table below.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd