Build a predictive model to classify shots as missed or made

Assignment Help Applied Statistics
Reference no: EM132271207

Assignment - KOBE BRYANT SHOT SELECTION

OVERVIEW: Kobe Bryant marked his retirement from basketball by scoring 60 points in his final game as a member of the Los Angeles Laker team on Wednesday, April 12, 2016. Starting to play professional basketball at the age of 17, Kobe earned the sport's highest accolades throughout his long career. Using 20 years of data on Kobe's shots made and shots missed, can you predict which shots will be successful?

DATA: The original data set contains the location and circumstances of every shot attempted by Bryant during his 20-year career. Your task is to predict whether the basket went in (shot_made_flag = 1) or missed (shot_made_flag = 0). The data for estimation is in Kobe.xlsx.

For this exercise, 5000 of the shot_made_flags have been removed from the original data set and are shown as missing values in the project2Pred.xlsx file. These are the test set shots for which you must submit a classification. You are provided a sample classification file, project2Pred.xlsx with the shot_ids needed for your predicted classification. Provide you predicted classifications in this file and submit both your paper and the prediction file. I have the actual values of the shot_made_flag for these missing shot_ids and will evaluate the classifications. Your goal is to provide the best predictions possible.

Each group is on the honor system to not use any information outside of the dataset to predict each of the missing shot flags.

DATA CONTINUED

The field names are given below (Data descriptions are available in Kaggle):

action_type

combined_shot_type

game_event_id

game_id

lat - court location identifier (latitude)

loc_x - court location identifier (x/y axis)

loc_y- court location identifier (x / y axis)

lon - court location identifier (longitude)

minutes_remaining - (in period)

period

playoffs

season 

seconds_remaining

attendance

avgnoisedb - avg noise in arena (decibels)

shot_distance

shot_made_flag (this is what you are predicting)

shot_type

shot_zone_area

shot_zone_basic

shot_zone_range

team_id

team_name

game_date

matchup

opponent

shot_id

arena_temp (oF)

DELIVERABLE: Submit a paper with an 8 page limit with a separate Appendix up to 5 pages. Code should be in a second appendix and can be as long as necessary. A separate file with predicted classifications also should be submitted.

PAPER REQUIREMENTS -

Introduction

Data Description

Exploratory Data Analysis

  • Address the need for any potential transformations.
  • Address and identify outliers.
  • Address and identify any multicollinearity.

Build models to provide arguments and evidence for or against the propositions below:

  • The odds of Kobe making a shot decrease with respect to the distance he is from the hoop. If there is evidence of this, quantify this relationship. (CIs, plots, etc.).
  • The probability of Kobe making a shot decreases linearly with respect to the distance he is from the hoop. If there is evidence of this, quantify this relationship. (CIs, plots, etc.).
  • The relationship between the distance Kobe is from the basket and the odds of him making the shot is different if they are in the playoffs. Quantify your findings with.

Build a predictive model to classify shots as missed or made. You should produce at least 1 of each type of model:

  • A logistic regression model.
  • A Linear Discriminant Analysis (LDA) model.

Evaluation: Compare each competing models with the AUC, Mis-Classification Rate, Sensitivity, Specificity and objective / loss function. The log loss function of the model should be used to assess the model fit:

-1/N i=1N[yilog pi + (1 - yi)log(1 - pi)].

Where N is the total number classifications, yi is the shot_made_flag and pi is the probability from the model of each outcome (shot made or shot missed.)

Note - Need A SAS programming assignment done. All relevant info in the zip files.

Attachment:- Kobe-data file.rar

Attachment:- Assignment Files.rar

Reference no: EM132271207

Questions Cloud

Department manager at an upscale store : You agree with your boss that some customers might find it offensive and that it should somehow be covered up. You need to talk to Alex."
Discuss gender differences in communication : Discuss communication styles and which one is the most effective? Discuss gender differences in communication.
Identify the main products-services : Identify the main products/services. Analyze the marketing and marketing strategy of the firm. Discuss the products, product mix, and product strategies.
How are leadership and management similar : What are some examples of ethical challenges that leaders and managers face in today's global business environment?
Build a predictive model to classify shots as missed or made : Assignment - KOBE BRYANT SHOT SELECTION. Task is to predict whether the basket went in (shot_made_flag = 1) or missed (shot_made_flag = 0)
How would you characterize ubers business model : How would you characterize Uber's business model and strategy? What are the key elements of its customer value proposition? Its profit formula?
Close-knit work arrangement deal with issues that arise : How could a Gen Y employee and an older more experienced employee that are paired together in such a close-knit work arrangement deal with issues that arise?
Did they provide enough evidence to adequately establish : What evidence was presented? Was it adequate to establish a causal link? Did the evidence presented come from credible and reliable sources?
How compensation plans influence success of an organization : How compensation plans can influence the success of an organization. How influences outside an organization can affect its compensation plan.

Reviews

Write a Review

Applied Statistics Questions & Answers

  What percentage of the discount chains female employees have

If 25 percent of the discount chain's employees have a management position, what percentage of the discount chain's female employees have a management position?

  Let a1 and a2 be two events related to an experiment

Let A1 and A2 be two events related to an experiment. Given P(A1)= 1/2, P(A2)= 1/3, P(A1 ∩A2)= 1/4. Find the following probabilities (a) P(A1 ∪ A2) (b) P(A1c ∪ A2c)

  What is the estimate of the population proportion

Wiley Publications has determined that out of a sample of 5,826 of its publications for 2012, 4,222 of them had been pirated online in some form. What is the estimate of the population proportion? What is the standard error of this estimate?

  Find the probability of drawing a diamond card

Find the probability of drawing a diamond card in each of the consecutive draws from a wellshuffled pack of cards, if the card drawn is not replaced after the first draw.

  Case - housing price structure in mid city

Do buyers pay a premium for a brick house, all else being equal - Is there a premium for a house in neighborhood 3, all else being equal?

  Suppose 200 executives are randomly surveyed

According to a survey by Accountemps, 48% of Canadian executives within medium sized companies believe that employees are most productive on Tuesdays. Suppose 200 executives are randomly surveyed.

  Q1 mr davidson plans to purchase a new house in october

q1. mr. davidson plans to purchase a new house in october 2013. the sale price of the house is 436000. he plans to pay

  Top-grade supplier to meet his annual demand

Omar Haaris receives 5000 tripods annually from Top-Grade Supplier to meet his annual demand

  The state university of new jersey

One measure of the risk or volatility of an individual stock is the standard deviation of the total return (capital appreciation plus dividends) over several periods of time. Al though the standard deviation is easy to compute, it does not tak..

  Draw a pie chart for your data

Find a categorical variable for which there are at least three categories and for which you can collect at least 20 observations.

  Find the confidence interval at confidence level

Five hundred eleven (511) homes in a certain southern California community are randomly surveyed to determine if they meet minimal earthquake preparedness recommendations. One hundred seventy three (173) of the homes surveyed met the minimum recommen..

  Draw graphs depicting the general shape of each relationship

Dollinger, Matyja, and Huber (2008) studied the correlations between overall exam performance in an upper-division psychology course and several other variables. - Describe each correlation and draw graphs depicting the general shape of each relati..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd