Develop a number of classification models

Assignment Help Data Structure & Algorithms
Reference no: EM13878691

A major retail chain that specialises in electronics goods has hired you as their Data Scientist. Over the past 12 months they have embarked on the rollout of a Loyalty/Affinity card for their customers.

The retail chain would like to complete the roll out of the Loyalty/Affinity card to the remainder of their stores and their customers. You can been assigned to the project and your roll is to build a predictive model that can be used to determine, from their database of customers, who are most likely to join their Loyalty/Affinity card.

You have been provided with a file containing the customers who have been involved in the Loyalty/Affinity card project so far. The data set contains both the customers who have joined the Loyalty/Affinity card and those that can not joined.

The target variable in the data set that indicates if the customer has taken up a Loyalty/Affinity card is the AFFINITY.

You will need to load the customer file, which is in CSV format, into your SAS Enterprise Miner workspace. From there you will analyse the data, develop a number of predictive models, evaluate these models to determine which one gives the best results and then to make your recommendations.

Data Description

Electronics Data 
Name   Description 
CUST_ID    Unique identifier of each customer 
CUST_GENDER    The gender of the customer  M  or  F 
AGE    The current Age of the customer. You can assume this is correct and up to date 
CUST_MARITAL_STATUS    Marital Status of the customer  
COUNTRY_NAME    The country where the customer lives 
CUST_INCOME_LEVEL    The salary range for the customer 
EDUCATION    The highest level of education the customer has completed 
OCCUPATION    The current occupation category for the customer 
HOUSEHOLD_SIZE    The number of people in the household of the customer. This number includes the customer 
YRS_RESIDENCE    How long the customer has lived at their current residence 
AFFINITY_CARD    Target Variable. 0 = no affinity card,  1 = has taken an affinity card 
BULK_PACK_DISKETTES    Indicator for this item purchased.  0 = no purchase,  1 = purchased 
FLAT_PANEL_MONITOR    Indicator for this item purchased.  0 = no purchase,  1 = purchased 
HOME_THEATER_PACKAGE    Indicator for this item purchased.  0 = no purchase,  1 = purchased 
BOOKKEEPING_APPLICATION    Indicator for this item purchased.  0 = no purchase,  1 = purchased 
PRINTER_SUPPLIES    Indicator for this item purchased.  0 = no purchase,  1 = purchased 
OS_DOC_SET_KANJI    Indicator for this item purchased.  0 = no purchase,  1 = purchased 
Y_BOX_GAMES    Indicator for this item purchased.  0 = no purchase,  1 = purchased

See the separate instructions on the Notes webpage for how you can load external data into your account on the SAS OnDemand server.

Required Tasks

You are required to produce a report (following the CRISP-DM report, as much as possible) detailing your work investigating the data and classifying the provided data.

The first task you should complete is a data investigation exercise, where you will document the characteristics and other information that you can determine about each Feature.

You will need to work through/develop a number of classification models. To do this you need to use the data mining tool used in class. In this tool you can have a number of different classification techniques and within each of these you can modify the various parameter settings.

You will need to develop a number of classification models. When you have developed all of your models (using the appropriate classification techniques available in the tool), you will have to evaluate them and identify the classification model and configuration that gives the best or most appropriate answer.

Attachment:- Assignment_Data.csv

Reference no: EM13878691

Questions Cloud

Examine the influence of budgetary processes and styles : Examine the influence of budgetary processes and styles on creativity within an organisation and explain the associated consequences for strategic management.
The coupon rate on the limbaugh bonds is equal : Liddy Products, Inc. just issued 10-year, 8% coupon bonds at par. Outstanding Limbaugh Corp. bonds, which have a maturity of 10 years, sell at a premium to par and are viewed by investors as having the same risk as the Liddy bonds. Therefore, it must..
Name the objectives of an effective accounting system : Name the objectives of an effective accounting system.
Why anyone take trouble to study the history of management : Why would anyone take the trouble to study the history of management accounting? Identify the benefits and beneficiaries of such a study, examining critically the work already carried out in this area.
Develop a number of classification models : First task you should complete is a data investigation exercise, where you will document the characteristics and other information that you can determine about each Feature.
Describe what is meant by the voltage gain : Questions 1. Describe what is meant by the voltage gain of an amplifier 2, What is the primary purpose of a coupling capacitor ub transistor amplifiers?
Describe three popular program manipulation techniques : Describe three popular program manipulation techniques.
A solar thermal plant with energy storage capability : A solar thermal plant with energy storage capability is to be constructed at about a 40° latitudeto provide the electric energy to a system whose 24-hr average load is 100 MWe during the winter.Assume that the solar thermal power plant will have a 25..
Distinguish between internal and external sources : Distinguish between internal and external sources of computer fraud.

Reviews

Write a Review

Data Structure & Algorithms Questions & Answers

  Huffmancodes

You will turn in one file: HuffmanCodes.java, which can encode and decode files using Huffman codes. The program has the following command-line interface:

  Write a c++ program to find the intersection

Write a C++ program to find the intersection, A set is a collection of distinct entities regarded as a unit, being either individually specified or (more usually) satisfying specified conditions.

  Create the shoutbox class for your virtual world

Create the ShoutBox class for your Virtual World. Your ShoutBox class will have two methods - initialize your data structures with words or have the user enter the words

  Perform an insertion sort on the file pointed

Using only the local data already supplied in FileSort, perform an insertion sort on the file pointed to by fd. Use lseeks for this; do not try to create any sort of array or list. An array-based version of insertion is supplied for your reference.

  Pseudocode program

Write a program that inputs the first name, middle initial (without the period), last name, age, salary and sex of new employee. Then displays that person's name with the first name first, middle initial followed by a period, and last name then ag..

  Create a program using c++

Create a program using C++ or Java that will automatically generate x numbers between a range provided by the user? For purposes of this project, the range can be 1-20 and store them in an array.

  Difference between a problem and an opportunity

What was the problems and/or opportunities facing Delta in late 1997? What is the difference between a problem and an opportunity

  Computing randomized quick sort-s running time

Suppose that all element values are equal. What would be randomized quick sort's running time in this case? Each element of A[p .. q-1] is less than A[q], and each element of A[t+1 .. r] is greater than A[q]

  Is an array that is in reverse sorted order a heap

What is the expectation of the index of the maximum element in the array and expectation of the index of the minimum element in the array?

  Need algorithim showing a home maintenance project

Need Algorithim showing a home Maintenance Project. Problem statementHigh-level view of the program solutionFunction and internal structure of each program module

  Design a linked list structure

Design a linked list structure Music that contains data fields Name, Artist, Number_of_Songs, and a pointer to the list. Design the structure with three members and fill in data for each member.

  Give an algorithm that returns the position

Give an algorithm that returns true if a string contains properly nested and balanced parentheses, and false if otherwise. Hint: At no time while scanning a legal string from left to right will you have encountered more right parentheses than left..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd