Examine the various predictor variables

Assignment Help Applied Statistics
Reference no: EM132343841

Assignment -

Answer the following questions. Answers should be uploaded in a neat, easy-to-read Word document. Move all graphs, charts, and tables to the single document. Do not upload spreadsheets. Be sure to read this week's written lecture for links and other helpful information. Necessary datasets are linked. These questions ask you to explain, describe, or outline something in addition to the program output. The essay parts count for about 50 percent of the points in your answer so be sure and include well-considered, detailed explanation and discussion in your own words. Use APA style references and citations if needed. Copying and pasting or similar plagiarism/cheating will result in zero points on the entire assignment. These questions are from Chapter 6, Shmueli, Bruce, and Patel.

1. The file BostonHousing (See attached) contains census information concerning housing in Boston, MA. The dataset has information on 506 housing tracts. The dataset contains 12 predictor variables and one outcome variable, MEDV, median house price. See the text for a table containing variable descriptions or run Analytic Solver (XLMiner) or similar software to view the data description. The following questions refer to this dataset.

2. Why is the data partitioned into training and validation sets as part of the data mining process? What is the purpose of each?

3. Fit a multiple linear regression model to the median house price as a function of CRIM, CHAS and RM using Solver or SPSS Modeler. Use the coefficient table in the output to write the linear equation predicting the median house price.

4. Examine the various predictor variables. Which predictors are likely to be measuring the same thing? Discuss the relationships among INDUS, NOX, and TAX.

5. Compute the correlation table for the numerical predictors and look for highly correlated pairs. These could cause multicollinearity. Which ones should be removed?

6. Use exhaustive search to reduce the remaining predictors. Choose the top three models. Run each on the training set and compare their accuracy for the validation set. Compare RMSE, average error, and lift charts. Describe the best model.

Attachment:- Assignment & Data Files.rar

Reference no: EM132343841

Questions Cloud

What are you most proud of about your cultural heritage : What are you most proud of about your cultural heritage and why? How might media coverage affect the public's perception of your culture?
What is the market-implied growth rate : What is the market-implied growth rate, g, of a stock with the following parameters. Dividend payment forecasted for next year is $3.8.
Be sure to explain all sides of the ethical dilemma : Explain an ethical issue involving a child or adolescent in the context of the illness. Be sure to explain all sides of the ethical dilemma.
Identify an area of hr practice for investigation : Summarise the stages of the research process and compare different data collection methods.Identify an area of HR practice for investigation.
Examine the various predictor variables : Examine the various predictor variables. Which predictors are likely to be measuring the same thing? Discuss the relationships among INDUS, NOX, and TAX
What is its estimated price per share today : The required rate of return that investors demand to hold AB Corp.'s stock is 8% What is its estimated price per share today?
How much should you pay for this stock : The dividend is expected to decrease by 3.6% each year forever. How much should you pay for this stock today if your required return is 20%?
The building blocks of culture : A discussion of different building blocks and how you saw them exhibited.Reflection on your reaction towards this different culture and what helped you to adapt
What is the company price per share : The company has 20 million shares outstanding. Using this information and a WACC of 12.5%, what is the company's price per share?(in $millions).

Reviews

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd