Provide a summary of the comparison

Assignment Help Computer Engineering
Reference no: EM131474851

Data Mining - Practical: Using Naïve Bayes Classifier for Classification

This is a combined practical and exercise session, in which we have two tasks. First, we use Weka to run two different classifiers (Naïve Bayes and J48) and compare their performances. Second, we use Weka to help us to calculate and decide whether someone is likely to get sunburned or not as Naïve Bayes Classifier does. This will help us to familiarise ourselves with the theories on Naïve Bayes Classifier we have learned in today's lecture.

Step 1:Launching Weka and Loading Training Data

Launch Weka by clicking on: RunWeka.bat

Select ‘Explorer' from the list of Applications.

Select the ‘Preprocess' tab and click on ‘Open File'. Choose the file ‘sunburn.arff' which contains the training data set.

Step 2: Exploring Naïve Bayes Classifier

Select the ‘Classify' tab and make sure that "NaiveBayes" is chosen from the classifier list and "Use training set" is clicked as the test option. Note that attribute "burned" is chosen by default as the class attribute but the class attribute can be changed if needed.

Click ‘Start'will create a classification model/classifier from the training dataset. The classifier is listed in the Result list while the details about the classifier are displayed in the ‘Classifier output' window.

Step 3: Comparing Naïve Bayes Classifier with Decision Tree-based Classifier J48 Using 10-fold Cross-validation

First, you can easily run Naïve Bayes Classifier again with a different evaluation method. Select the "cross-validation" test option with 10 folds as default and click Start again. The classifier output is quickly replaced to show how well the learned model performs on the cross-validation.

Second, you can also easily run J48 with 10-fold cross-validation. Select the ‘Classify' tab and make sure that "J48" is chosen from the classifier list and Select the "cross-validation" test option with 10 folds as default and click Start again. The classifier output is quickly replaced to show how well J48 performs on the cross-validation.

Finally, compare the two sets of evaluation results.

Q1. Provide a summary of the comparison between the classifier outputs by Naïve Bayes Classifier and J48.

Step 4: Discovering Naïve Bayes Classifier

We now focus on discovering how the Naïve Bayes Classifier produces the output. We have just covered the Naïve Bayes Classifier in today's lecture. We now want to familiarise ourselves with the theories that we have learned in today's lecture by working through an example using Weka.

Select the ‘Preprocess' tab and go back to examine the training data set. Click on attribute burned in the attributes window on the left and you will see that the class attribute has two class labels "burned" and ‘none".

Q2. Calculate the probability estimates of these two class labels, i.e., P(burned = "burned") and P(burned = "none") based on their class distributions. Click on attribute burned in the attributes window will display the class distribution, i.e., 9/16 with class label "burned" while 7/16 with class label "none"

P(burned = "burned") =
P(burned = "none") =

Click "Edit" to view the sunburn relation table that shows the value distribution of each attribute given a class label. Click on attribute burned will sort the instances in the table in alphabetic order. For example, among the 9 instances with class attribute value "burned", 2/9 has hair = "red".

Q3. Calculate the conditional probability of each attribute on each class, e.g., P(hair = "blonde"|burned = "burned"). Note that you would have 6 + 6 + 6 + 4 = 22 conditional probabilities to calculate.

P(hair = "blonde"|burned = "burned") =
P(hair = "brown"|burned = "burned") =
P(hair = "red"|burned = "burned") =
P(hair = "blonde"|burned = "none") =
P(hair = "brown"|burned = "none") =
P(hair = "red"|burned = "none") =
...

Q4. Calculate the classification probability of each class, given hair = "blonde", height = "short", weight = "average", lotion = "none".

P(burned = "burned"|hair = "blonde", height = "short", weight = "average", lotion = "none)
=

P(burned = "none"|hair = "blonde", height = "short", weight = "average", lotion = "none)
=

Q5. Is the given person likely to get sunburned based on the prediction by Naïve Bayes Classifier?

Attachment:- Practical2.rar

Reference no: EM131474851

Questions Cloud

Write a high-level program for this computation using a fork : Consider the following computation: ci = ai*bi + ci*di where, i = 1 to N. Write a high-level program for this computation using a fork/join.
Employee referrals for organizational recruitment : Employee referrals for organizational recruitment, paid or unpaid, can be an effective tool for recruitment.
Operate in its general and specific environments : identify the factors that operate in its general and specific environments.
What is distributional interest : What is distributional interest?
Provide a summary of the comparison : Provide a summary of the comparison between the classifier outputs by Naïve Bayes Classifier and J48 - Calculate the probability estimates of these two class
Marginal revenue curves of a competitive firm identical : Why does the firm in pure competition operate in the rising portion of its marginal cost curve?
Explain evaluation of alternatives and strategic choice : Identify that how Channel power & leadership can be an investment in profitability. Explain evaluation of alternatives and strategic choice?
Estimate the computation time for the program : Estimate the computation time for the program in the above problem assuming the following time characteristics.
Competitor advantages take place : How is game theory applied when competitor advantages take place? In other words, how do you implement game theory into the equation of competition?

Reviews

Write a Review

Computer Engineering Questions & Answers

  What is a data type in c

What is the difference between a constant and the variable in a program? Why would you use one or the other.

  How to create a class that player guess anumber

make a class that player guess anumber, the application generates a random number, and player determine whether they are correct. Display a message indicating whether the players guess was correct, too high or too low.

  Utilize routers for connectivity devices instead of hubs

Utilize routers for connectivity devices instead of hubs

  What is generally not tracked in change management database

What is generally not tracked in a change management database? When performing a security assessment, using a framework such as NIST 800-15, which is generally the first step?

  Find the mean height of the basketball players

Following are heights, in inches, for a sample of college basketball players. Find the mean height of the basketball players.

  What might have happened if the bug had not been caught

My first programming job in 1977 was to convert a set of application systems from one version of COBOL to another version of COBOL for the government of Prince Edward Island. What might have happened if this bug hadn't been caught and all gas tax r..

  Describe the program development cycle

Describe the importance of using a structured, modular approach when creating program requirements, design, and code.

  Utilization of authentication

In the 802.11i, authentication always utilizes the SSL/TLS, Outer, Inner, Both A and B, Neither A nor B.

  The application must calculate the average

make a Java Application that will ask the user for his or her name and to enter in three numbers. The application should calculate the average and then output the value to the user.

  Write an user defined function to print all the prime number

Write an user defined function to print all the prime numbers within a range.

  How the data gets sent from a computer or gaming system

In a 1-2 page paper, discuss how the data gets sent from a computer or gaming system to the computer/gaming system of all of the people with whom one is playing

  Title page two body pages ref pageaddress the following in

title page two body pages ref page nbspnbspaddress the following in your paperis the victim selection process different

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd