Model performance using statistical significance testing

Assignment Help Business Management
Reference no: EM133889537

Part I: Algorithm Comparison

1. Investigate the difference in model performance using statistical significance testing. We will compare three models (decision tree J48, 5-NearestNeighbor and OneR) on two different data sets (diabetes.arff and breast-cancer.arff), and perform a pairwise comparison of the models on each data set (total of six paired experiments separately or run everything at the same time).

2. Choose 10 folds cross-validation as your experiment type and repeat 5 times on each pair.

3. For both data sets, compare the performance of all three algorithms using a paired t-test. For each model, describe parameter settings/design decisions you make in acquiring your data (so that your experiments are replicable).

4. You can collect accuracy estimates using the Experimenter in WEKA, dumping the results into a CSV file and picking the appropriate column of the file. You will need to implement the paired permutation test yourself.

5. Does any one of the algorithms work significantly differently on either one of the two datasets from another algorithm? Report your findings. You should use screenshots, calculations, and analysis to support your conclusions.

6. For each pair of algorithms that you find to perform significantly differently, calculate the p-value of the paired t-test to support your finding.

Part II: Cost Analysis

In this part you will replicate my cost-analysis demonstration in class using the dataset "breastcancer.arff". First, generate the classifier output using J48. Next, make changes to the weights associated with certain types of errors based on the following rules and run your J48 classifier again.

a. Cost of "recurrence events" being wrongly classified as "no recurrence events" is 4

b. Cost of "no recurrence events" being wrongly classified as "recurrence events" remain 1

Questions:

1. Show me the total cost before and after you apply the cost weights.

2. Show me the confusion matrix before and after you apply the cost weights.

3. If you further change the cost of "recurrence events" being wrongly classified as "no recurrence events" to 10, how will the algorithm be affected? Is the algorithm still practical? Why or why not?

Reference no: EM133889537

Questions Cloud

Trash can is located in vicinity of scene of the crown : It in the trash can is located in vicinity of the scene of the crown. Jamie's mother testified that Jamie was at the home, was at home at the time of crime
Identify where various professionals might hold differing : Identify where various professionals might hold differing views about intervention and explain how you might approach advocating for the client?
Defence of duress and defence of necessity : What is the difference between the defence of duress and the defence of necessity? Briefly explain two criticisms of constructive murder.
How do these stories rework the gothic castle to fit setting : How do these stories rework the Gothic castle to fit the settings of their stories? How do Poe and Gaskell use their settings to create suspense?
Model performance using statistical significance testing : Investigate the difference in model performance using statistical significance testing. compare the performance of all three algorithms using a paired t-test.
What are the four central frames of color-blind racism : What are the four central frames of color-blind racism? Also, describe the notion of "life chances" as it is discussed in this essay.
Sign arbitration agreements as condition of employment : Under what circumstances, if any, is it appropriate for employers to require employees to sign arbitration agreements as a condition of employment?
What is trick or tool that you have found that really helps : What do you find yourself having the most problems with grammar? What is one trick or tool that you have found that really helps you write more "correctly"?
How satisfied are you with your two documents : How satisfied are you with your two documents? Do you still see any room for improvement? What did you find most challenging about this assignment?

Reviews

Write a Review

Business Management Questions & Answers

  Caselet on michael porter’s value chain management

The assignment in management is a two part assignment dealing 1.Theory of function of management. 2. Operations and Controlling.

  Mountain man brewing company

Mountain Man Brewing, a family owned business where Chris Prangel, the son of the president joins. Due to increase in the preference for light beer drinkers, Chris Prangel wants to introduce light beer version in Mountain Man. An analysis into the la..

  Mountain man brewing company

Mountain Man Brewing, a family owned business where Chris Prangel, the son of the president joins. An analysis into the launch of Mountain Man Light over the present Mountain Man Lager.

  Analysis of the case using the doing ethics technique

Analysis of the case using the Doing Ethics Technique (DET). Analysis of the ethical issue(s) from the perspective of an ICT professional, using the ACS Code of  Conduct and properly relating clauses from the ACS Code of Conduct to the ethical issue.

  Affiliations and partnerships

Affiliations and partnerships are frequently used to reach a larger local audience? Which options stand to avail for the Hotel manager and what problems do these pose.

  Innovation-friendly regulations

What influence (if any) can organizations exercise to encourage ‘innovation-friendly' regulations?

  Effect of regional and corporate cultural issues

Present your findings as a group powerpoint with an audio file. In addition individually write up your own conclusions as to the effects of regional cultural issues on the corporate organisational culture of this multinational company as it conducts ..

  Structure of business plan

This assignment shows a structure of business plan. The task is to write a business plane about a Diet Shop.

  Identify the purposes of different types of organisations

Identify the purposes of different types of organisations.

  Entrepreneur case study for analysis

Entrepreneur Case Study for Analysis. Analyze Robin Wolaner's suitability to be an entrepreneur

  Forecasting and business analysis

This problem requires you to apply your cross-sectional analysis skills to a real cross-sectional data set with the goal of answering a specific research question.

  Educational instructional leadership

Prepare a major handout on the key principles of instructional leadership

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd