Creating a model to predict the success of NBA teams

Assignment Help Other Subject
Reference no: EM132278178

Excel Data Analysis Capstone Project -

Overview - This project requires that you use the tools learned throughout this portion of the course to create a model for a real world situation - creating a model to predict the success of NBA teams.

A tragedy has happened and the world has lost all copies of Win-Loss data for the 2017-2018 and 2016-2017 seasons for the NBA. All other stats remain, but we can no longer trust the Win and Loss counts anymore. The only solution the world has is to use your advanced analysis skills to predict what those win totals should be; these win predictions will be written into the history books, so we need to be accurate.

This project is to be completed in groups of 3.

Part 1: Collect and Prepare Data

For this model you will need to prepare two sets of data:

  • Source Data - this will contain team statistics including wins and losses for the 2015-2016 and going backwards to earlier seasons. Note: This model must not contain any win-like statistics such as losses, winning percentage, Pythagorean wins/losses, margin of victory, SRS, etc.
  • Subject Data - this will contain the same columns as your Source Data sheet, except the wins - as we're predicting those.

Source Data - The source for our data is Basketball Reference. The exact data used to build your model is something that you must determine - the starting point is to download the Miscellaneous Stats table into Excel.

There are many other tables of team stats available on that page that you can add to your model to potentially improve its accuracy. Once you have the base of your sheet made with the Miscellaneous Stats the other sets of statistics can easily be added. Once you've assembled your sheet with the 2015-2016 data, repeat the process for at least 4 previous seasons, so you have 150+ total rows of data on which to build the model.

You may need to modify column names, or otherwise clean the data to prepare it for analysis. As part of this you will need to create a Playoff column that holds a variable denoting if the team made the playoffs or not.

Note: When you've created the sheet you must remove the Win-proxy stats, some of those are L, PW, PL, MOV, and SRS. There may be others depending on what you've added to your model. Ensure you double check this as inclusion of any win/loss statistics will ruin the accuracy of your model.

Subject Data - To prepare your subject data you must basically redo the same steps in setting up your Source Data for the two most recent full seasons - 2017-2018 and 2016-2017. Also, delete all values from the wins (W) column and the playoff columns as those are our predicted values.

Creating a Predictive Model -

Once your data is prepared you can begin creating your predictive model. Any and all of the tools we looked at in the course are available to you. You may find that as you proceed in building your model that data needs to be added or removed from your initial worksheet. You may also choose to use other techniques such as normalization and partitioning to create a more accurate model.

As you are going through this process you must take note of what method you are using, what changes you make to the model data, and why you are making those decisions. You will need to present both your model, and the reasoning of why you built it as you did and why it is superior to the alternatives that proved to be less accurate. The process of developing your model is the most important part of this process, so ensure you are making logical improvements and documenting the reasoning and impact.

Creating a Classification Model -

In this step you must create a classification model that uses your source data to predict if teams will be in the playoffs or not. Follow the same process as the prediction model, but use classification tools to split the teams into the two groups.

Comparing the Results -

Create a chart that compares your predicted results for both the prediction and classification to the real results for the two seasons that are being predicted. Calculate the margin of error for your prediction model and the successful classification percentage. Include this in your report.

Attachment:- Assignment File.rar

Reference no: EM132278178

Questions Cloud

What is the best way to present your topic : Create a Wiki Presentation page on either a topic of contemporary art or a contemporary artist not covered in class. The page must include relevant text.
What type of planning student do in courses : What type of planning student do in courses to maximize the outcome of the time?
What religious or political factors led to the emergence : Select one movement from the list below, and describe the time period, media (materials), methods, and subjects significant to that movement.
Make the required adjustments to convert table to 3nf : Assuming it is in 1NF, explain why the 2NF table is not in 3NF. Make the required adjustments to convert table to 3NF.
Creating a model to predict the success of NBA teams : Excel Data Analysis Capstone Project - create a model for a real world situation - creating a model to predict the success of NBA teams
Boolean algebra to minimize the logic function : Give the DNF of ~Z. Use laws from Boolean algebra to minimize the logic function.
Discuss issues concerning data acquisition : Discuss issues concerning data acquisition, management, and data sharing.
Why each blog is a good one for rock pic to appear on : In addition to the news media, identifying and reaching out to influential bloggers is a great way to help promote your organization. The goal of an effective.
Calculate the total force exerted on the structure : UFMFP9-15-3 - Mechanics of Materials - Global college of engineering and technology - Calculate the total force exerted on the structure due to the wind

Reviews

len2278178

4/9/2019 3:21:52 AM

Grading Deliverables - The deliverable for this project are: A paper that contains the results of your modeling exercise, the explanation of what you did, and the reasoning behind why your model is the best. Your spreadsheet. Bonus - Apply your models to the current, partially complete season to project wins and playoff inclusion. Do your predicted results align with current winning percentages and conference standings?

len2278178

4/9/2019 3:21:46 AM

Note: When you’ve created the sheet you must remove the Win-proxy stats, some of those are L, PW, PL, MOV, and SRS. There may be others depending on what you’ve added to your model. Ensure you double check this as inclusion of any win/loss statistics will ruin the accuracy of your model. Note: This model must not contain any win-like statistics such as losses, winning percentage, Pythagorean wins/losses, margin of victory, SRS, etc.

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd