Perform a k-nearest neighbors prediction

Assignment Help Basic Statistics
Reference no: EM132397534

Assignment

Written Assignment #2B requires hands-on practice of XLMiner, and you are expected to use XLMiner to mine the Boston Housing data in the file BostonHousing.xls posted in Written Assignment #2B entry in Blackboard

Your task is to run k-nearest neighbors algorithm in XLMiner for both prediction and classification tasks describe below, and submit your answer with your XLMiner execution result files attached in your submission. Since the k-nearest neighbor algorithm can be used for both classification and prediction, there are two menus under XLMiner, Classify and Predict.

The file BostonHousing.xls contains information on over 500 census tracts in Boston, where for each tract 14 variable values are recorded. The last column (CAT.MEDV) was derived from MEDV, such that it obtains the value 1 if MEDV>30 and 0 otherwise. Consider the goal of predicting and classifying the median value (MEDV and CAT.MEDV) of a tract, given the information in the first 13 columns (input variables) in the column list. Partition the data into training (60%) and validation (40%) sets. (For description of the column names in BostonHousing.xls, please make reference to Table 2.2 on page 33 of the textbook)

1. Under Predict menu in XLMiner, perform a k-nearest neighbors prediction with all the predictors from column A (CRIM) to column B (LSTAT) (excluding the CAT.MEDV, the CAT.MEDV column is the outcome or decision variable for classification) for both training data set and validation data set, trying values of k from 1 to 10 to predict the value MEDV. What is the best k chosen? What does it mean? Also attach the execution result file including RMSE (Root Mean Square Errors) in your submission. (you can try run prediction with normalizing data and without normalizing data).

2. Under Classify menu in XLMiner, perform k-nearest neighbors classification with all the predictors from column A (CRIM) to column B (LSTAT) (excluding the MEDV, the MEDV column is the outcome or decision variable for prediction) for both training data set and validation data set, and find the best K for validation data set, trying values of k from 1 to 10 to classify CAT.MEDV (make sure to normalize the data). Also attach the execution result file including confusion matrix, lift chart, and ROC chart in your submission.

Attachment:- Boston Housing.rar

Reference no: EM132397534

Questions Cloud

Produce definition of data visualization : Produce a definition of data visualization. Explain how it caters to the perceptual abilities of humans.
Discussing the safe harbor provisions under hipaa : Write an essay of at least 500 words discussing the Safe Harbor provisions under HIPAA. Write in essay format not in outline, bulleted, numbered or other.
Develop metrics and measure results : In order to have a successful IG program, one of the eight (8) Information Risk Planning and Management step is to develop metrics and measure results.
Discussion pertaining to the key performance indicators : Description regarding the metrics your team will use to measure performance. discussion pertaining to the key performance indicators (KPIs).
Perform a k-nearest neighbors prediction : Perform a k-nearest neighbors prediction with all the predictors from column A to column B for both training data set and validation data set, trying values.
Cloud computing and data forensics : You have been assigned to investigate whether or not employee at local hospital has been accessing patient records.
Essay on hacking manufacturing systems : Write two page single space essay on hacking manufacturing systems. Recent hacks happened for the automotive industry. How to secure their infrastructure
Accessing patient records and setting information : You have been assigned to investigate whether or not an employee at a local hospital has been accessing patient records and setting information
Data analyst capstone course project : Build a machine learning model to test and do prediction and Build a machine learning model and test it with the Test set values dataset

Reviews

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd