Describe the paths that your tree retains

Assignment Help Applied Statistics

Reference no: EM132349458

Advanced Methods For Analytics Assignment -

You have been given a starting observation number for the Excel spreadsheet Final Exam Data.xlsx. All of the data you will need will be in the 80 rows of observations that begin with your starting observation number. You submission is to be a pdf, and you are to restrict your answers to the space provided.

Challenge 1 - The data in columns B through J involve hypothetical sales of franchised auto dealerships nationwide. The variables are:

1) PRICE (the sales price, in $10⁵, of the dealership);

2) SALES (the dealership's most recent annual sales, in $10⁶);

3) AGE (the age, in months, of the dealership);

4) UNITS (the dealership's most recent unit sales);

5) ACREAGE (the footprint, in acres, of the dealership); 5) BLDG (the footprint, in 10³ ft²of the dealership's building(s));

6) COMPS (the number of franchised dealership competitors in the dealership's market;

7) COBRNDS (the number of other franchised dealerships in that market owned by the dealership's owners); and 8) WITHIN (the number of franchised dealerships located within 3 miles of the dealership).

Use the first 60 of your observations for your training sample and the next 20 as your validation sample. With the former use your regression and regression-model-building skills to estimate a "good" (by your standards) model to predict the sales price of a dealership. Evaluate that model using your validation sample. In the space provided, report the steps you took and the conclusions you arrived at, as well as your assessment of your model's performance.

Challenge 2 - A state is considering an overhaul of its restaurant health-inspection protocol. The data in columns K through O resulted from inspections done ("Pass" or "Fail" using the proposed protocol) on a large number of of small (seating cap. < 50) restaurants. These columns also include: 1) EXPER (the number of years of experience of that restaurant's general manager; 2) AGE (the number of years that particular restaurant has been in that particular location; 3) CHAIN (whether that restaurant is part of a chain; 1 = Yes); and 4) REGION (the region (A, B or C) in the state of that location).

A) Using your first 60 observations as a training sample, formulate (and summarize) a model that would allow you to predict whether a particular location will pass the inspection. How do those predictors that you use in your model influence the likelihood of a location's passing?

B) Using your fitted prediction model, estimate the likelihood of your 20 held-out restaurants passing the inspection (use this rule ... if the estimated probability of passing is less than or equal to 0.45, forecast that restaurant as a "fail"; if the estimated probability is greater than or equal to 0.55, forecast that restaurant as a "pass"). Summarize how well this fitted model works with a 2 x 2 table.

Challenge 3 - Use the data in your training sample from the previous challenge to formulate a classification tree (pruned according to the "minimum xerror" rule). Describe the "paths" that your tree retains (e.g., "If Age ≤ 12 and Exper > 4 then "PASS"). In a simple 2 x 2 table, report how well your classification tree does in predicting the pass/fail question in your validation sample.

Challenge 4 - The data in column P are a quarterly time series depicting unique visitors to a health care website over a 20-year period. For this challenge, use the first 72 periods as your training sample and the last 8 periods as your validation sample. If you had used multiplicative Loess decomposition to forecast those last 8 quarters, what would the correlation between your forecasts and the actual values have been? What would it have been had you used an ARIMA(p,d,q)(P,D,Q)₄ model?

Challenge 5 - A consultant for the mortgage lending industry has developed a new technique for assessing the risk involved in lending to those with less-than-stellar credit profiles. The technique involves using data-mining techniques to create a Personal Financial Responsibility (PFR) index. To evaluate the predictive power of this index, the consultant selected a random sample of borrowers and evaluated their risk using PFR. The consultant then compared borrowers' PFR to their mortgage payment performance (a rating based on a variety of factors with a range of 0 to 200). Data from the sample were as follows:

PFR	Payment Performance
89	188
99	140
90	108
31	29
54	82
60	49
52	20
65	64
32	41
67	31

A) Conduct an appropriate hypothesis test to evaluate whether these data provide sufficient evidence regarding PFR's usefulness as a predictor of mortgage payment performance.

B) If you conclude PFR is a useful predictor of mortgage payment performance, how effective is it?

C) Construct a 95% confidence interval for the mean Payment Performance given a PFR score of 58.

D) Construct a 95% prediction interval for an individual's Payment Performance given his/her PFR score is 58.

Attachment:- Advanced Methods For Analytics Assignment & Data File.rar

Reference no: EM132349458

Questions Cloud

Explain the differences in social norm : How do individuals become involved with social norm groups and How do sociologists explain the differences in social norm?

What did you learn in course that surprised or inspired you : What did you learn in this course that surprised or inspired you? What will you take from this course that you will use in your current and/or future profession

Practical manner to your current work environment : How the knowledge, skills, or theories of this course have been applied, or could be applied, in a practical manner to your current work environment.

Discuss the issues of credentialism : Identify true motivation for initially attending college, and discuss the issues of credentialism and the bestowal of status as institutionalized factors

Describe the paths that your tree retains : BSTAT 5325 Advanced Methods For Analytics Assignment, The University of Texas at Arlington, USA. Describe the paths that your tree retains

What are some things that could happen to a student : Will social control on at a university on campus stricter or more lenient than social control in other parts of society?

Process improvement project : Process Improvement Project-Complete the interactive "Stakeholder Analysis: Winning Support for Your Projects," located on the Mind Tools website.

What are some social norms that could be expected : What are some social norms that could be expected by an individual to conform to when a person became a student at their college or university?

Student at college or a university : What social norms were someone is expected to conform to when that individual became a student at college or a university?

Reviews

len2349458

7/31/2019 3:11:00 AM

I have attached the data set and question and guidelines. Place every observation in a separate spreadsheet and Rstudio or R is what we used in class. You have been given a starting observation number for the Excel spreadsheet Final Exam Data.xlsx (posted in Canvas). All of the data you will need will be in the 80 rows of observations that begin with your starting observation number. You submission is to be a pdf, and you are to restrict your answers to the space provided.

Write a Review

Required(*) Message

User Account

All Pages