Create two learning curves of the out of sample auc

Assignment Help Other Subject
Reference no: EM131408471

1) Consider again the churn dataset. Create two learning curves (using WEKA) of the out of sample AUC on the test set (churn_test.arfff) using both logistic regression and the decision tree J48 (just go with the default settings). In particular, starting from the full training set, after each iteration, reduce the training set to half until you reach less than 100 examples. Provide a plot with both curves (copy the data into EXCEL and create the charts) .

• You can cut the dataset in half easily in Weka. In the Preprocess tab, in the box marked Filter, click on Choose. Under weka->filters->unsupervised->instance you will see RemovePercentage. (Normally, it is a good idea first to run the filter Randomize, to make sure that you are removing the data randomly; real data often will be sorted based on some attribute, which can result in throwing away many data items with similar values. Don't Randomize for this assignment; the data for this assignment already will be randomized.)

• The Undo button on the preprocess tab will undo the preprocessing (like Randomizing, RemovePercentage, etc.). Keep an eye on the data statistics (like the number of instances) in the preprocess tab to verify.

2) Create a fitting curve of the generalization AUC for decision trees as a function of the MinNumObj parameter. First change the option ‘unpruned' to ‘true'. Provide a plot of the parameter and the resulting out of sample performance using either cross validation or a training/test split. What does the parameter do? What is the optimal selection for the parameter?

3) Repeat the same experiment as in step 1, but setting minnumObj=100 and unpruned=TRUE. How does the learning curve of the decision tree change? What do you infer from this result?

Attachment:- Assignment.rar

Reference no: EM131408471

Questions Cloud

What is included in nike''s balance sheet cash account : Compute the change in NIKE's current ratio and working capital from 2008 to 2009. Which accounts are the most important in explaining that change?
Implement to insure compliance : 1. Outline and briefly explain three action items you would recommend the company implement to insure compliance with both the Ontario Human Rights Code and the Employment Equity Act.
Problem regarding the cost of living : The City of St. Albans has a unionized police force that is coming up for a contract renewal. The police have one issue: the cost of living increases. During the past 10 years, police officers have received minimal cost of living increases, and th..
What trends have affected malls : This video describes the problems of suburban regional and superregional shopping centers. while malls were attractive for 50 years, they have fallen out of favor with many shoppers, leaving shopping center developers with significant challeng..
Create two learning curves of the out of sample auc : Create two learning curves of the out of sample AUC on the test set using both logistic regression and the decision tree J48 (just go with the default settings). In particular, starting from the full training set, after each iteration, reduce t..
Discuss the factors that influence internal pay structures : 1) Discuss the factors that influence internal pay structures. Based on your own experiences, which ones do you think are the most important? Why?
What would be ge’s 2008 inventory balance : What would be GE's 2008 inventory balance if it used the FIFO assumption instead? Why is the disclosure of the LIFO reserve useful to financial statement users?
Public perception of an unethical organization : How do you think unethical behavior affects employee productivity and morale? How about public perception of an unethical organization?
Compute the inventory purchases made by hp : In its 2008 annual report, Hewlett-Packard reported beginning inventory of $8.0 billion, ending inventory of $7.9 billion on the balance sheet, and cost of goods sold of $69.3 billion on the income statement. Compute the inventory purchases made b..

Reviews

Write a Review

Other Subject Questions & Answers

  Attribute samplint plans

Consider the following two attribute samplint plans

  Influence of alcohol and possession of marijuana

Tony was a 16-year-old juvenile who was picked up for driving under the influence of alcohol and possession of marijuana. He was evaluated and taken to juvenile detention.

  Summarize role that the atmosphere conditions of wind speed

Summarize the role that the atmosphere conditions of wind speed, temperature, and stability potentially impact plume modeling activities with a Gaussian model.

  Domestic terrorism report

Address specifically the human toll, economic effect, and public policy changes that have resulted from domestic terrorist events within the United States during that period.

  Delivery outline from causes of poverty

Write a one page delivery outline from an outline format based on causes of poverty

  Which of thenbspsources and forms of power do you see on

think about the sources and forms of power you see around you or may have working for you in your life. below are

  Lan architecture-lan technology-lan interconnection

This quiz covers LAN architecture, LAN technology (including Wireless) and LAN Interconnection (Chapters 9, 10 and 11). Because of this expanded scope, the quiz is about twice as long as the previous quizzes.

  The seattle longitudinal study concluded

The Seattle Longitudinal Study concluded that middle age is a time of:

  Kant believed it is possible to be motivated

According to Landau, Kant believed it is possible to be motivated ...

  Why epidemiology and disease control should be studied

On the basis of your views and current knowledge of health care and treatment in the United States and globally, discuss why epidemiology and disease control should be studied as complements to the provision of health care services.

  Discuss services available to the older person in ireland

The Learner is to complete a Project discussing the issues relating to the Care of the Older Person. Explore the needs of the care staff who work with Older People. Discuss services available to the Older Person in Ireland

  Create presentation that explain key point related to health

For Part 1 of this assessment, you will create a PowerPoint presentation that explains key points related to the health care policy you selected.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd