Build regression models for predicting house

Assignment Help Basic Statistics
Reference no: EM1374968

1. In this exercise we will be building regression models for predicting house prices. We will be using data collected on 91 houses in Gainesville, Florida. The dataset contains the selling price of each house and information on four other explanatory variables.

The variables contained in the dataset are:

Y: Price. It is measured in thousands of dollars.

X1: Area. It is the floor area of the house measured in thousands of square feet.
X2: Bed. The number of bedrooms of the house.
X3: Bath. The number of bathrooms of the house.
X4: Pool. Indicates whether the house has a swimming pool.

Questions:
(a) Exploratory part.

i) Plot each of the predictors against the response. Plot the predictors against each other. The purpose here is to get a graphical idea of the relationships in the data. Do not include these plots in your report, just provide a brief summary of what you observed.

(b) Simple linear regression.

i) Fit 3 simple linear regression models with area, bed, and bath as the only predictor in each. Report the estimated parameters from the model that you consider to be the most useful in predicting house prices, along with an explanation why you consider that model to be the most useful one.

ii) Assuming that the best single predictor model is area, provide a 99% confidence interval for the mean price for a house area = 2500 square feet.

iii) Assume your neighbors own a house with area = 2500 square feet. Obtain a 99% prediction interval for the selling price of the house if they decided to sell it.

(c) Multiple linear regression.

i) Fit a regression model using all 4 predictor variables. Report the estimated i parameters and interpret the coefficient for the variable Pool.

ii) Suppose your neighbors house actually has area = 2500 square feet, 3 bedrooms, 3 bathrooms, and a pool. What is the predicted selling price for this house? Obtain a 95% prediction interval.

iii) Conduct an ANOVA P-test and interpret the results. Conduct a test to sec if the number of bedrooms a house has is a useful predictor of its price. Interpret the results. Should we include number of bedrooms in a model with the other 3 variables in it?

iv) Return to the model in (1) and use that as the full model. Fit a model without the variables pool and bath and use that as your reduced model. Conduct the F-test to see whether or not pool and bath are useful predictors using the full and reduced model. Interpret.

2. Let X1,. .. , Xn. denote a random sample from a normal distribution with mean μ and variance σ2. The probability density function (pdf) of Xi, i = 1, .. . , n, is given by

1097_Simple linear regression.png

(a) Derive Derive the likelihood and log-likelihood functions.

(b) Show that the arithmetic mean, X', is the maximum likelihood estimator of the unknown mean μ.

(c) Show that the arithmetic mean, X', is a sufficient statistic for the unknown mean μ.
(d) Show that the sufficient statistic from part 2c is distributed as X'~N(μ,σ2/n).

(e) Use the pdf from part 2d to show that the arithmetic mean, X' is the maximum likelihood estimator of the unknown mean μ.

Reference no: EM1374968

Questions Cloud

Term paper for management of strategic operation : You are required to complete a course project that reveals mastery in application of the operations management concepts emphasised in the course. This involves reporting on a specific operation or process for an organisations and the operations ma..
Design and synthesis of continuous time controllers : Design and synthesis of continuous time controllers - A graphical user interface (GUI) is a human-computer interface that uses menus, dialogue box, and button which can be manipulated by a mouse
Design a c# windows phone 8 application : Design and implement a C# Windows Phone 8 application based on the SoundBoard app in the Windows Phone 8 Development for Absolute Beginners textbook.
What is density of this mixture of gases : A mixture of CO2 and N2 gases has an experiment density of 21922 mol/m3 at T=225K and Pressure= 68.759 MPa and Z=1.67661. What is density of this mixture of gases in mol/m3 by Peng-Robinson equation of state? (Mole fraction of Co2= 10.56%)
Build regression models for predicting house : Build regression models for predicting house prices. We will be using data collected on 91 houses in Gainesville, Florida. The dataset contains the selling price of each house and information on four other explanatory variables.
Illustrates what is maximum number of application per hours : Illustrates what is maximum number of application per hours that can be handled by present configuration of process.
Discuss competitive supply and profit maximization : A number of stores offer film expanding as a service to their consumers. Assume that each store that offers this service has a cost function C(q)=50+0.5q+0.08q2 .
Illustrate what is average inventory if costs are minimized : Illustrate what is average inventory if costs are minimized. Suppose that ordering cost is not $20.68 and Cotteler has been ordering 175 units each time an order is placed. For this order policy.
Discuss transportation and logistics management policy : Discuss transportation and logistics management policy. Illustrate what impact does policy (local, state, and/or federal) have on transportation.

Reviews

Write a Review

Basic Statistics Questions & Answers

  Calculating p-value and interpreting its meaning

By using 0.05 level of significance, is there any evidence of difference in mean life of bulbs produced by two kinds of machines?Calculate p-value and interpret its meaning.

  Student population has remained constant during this period

Develop an appropriate forecast model for the given problem whereas the student population has remained constant during this period.

  Obtaining a test statistic and p-value

Obtain a test statistic and p-value. Interpret the results at = .01.

  Finding percent of variation in starting salaries

Based on this sample information find out what percent of variation in starting salaries is described by GPA, and find out the equation that would forecast salary from GPA. Give a scatter plot.

  Determining optimum solution for the problem

Determine the optimum solution for the problem.

  Calculating probability using normally distributed with mean

Suppose the random variable X is normally distributed with mean =50 and standard deviation =7 calculate P(X).

  Illustrate a quantitative assessment of what our charts

Illustrate a quantitative assessment of what our charts would look like and what these new targets would do to our operations.

  Effective time duration of a safe dosage

Researchers at a pharmaceutical company have found that the effective time duration of a safe dosage of a pain relief drug is normally distributed with mean 2 hours and standard deviation 0.3 hour.

  Find the mean and standard deviation

Determine the mean and the standard deviation of the sample.

  Determining the probability of mean annual snowfall

Determine the probability that mean annual snowfall during 40 randomly picked years will exceed 111.8 inches?

  Value of the test statistic

Compute the value of the test statistic (to 2 decimals).

  Difference between i-chart and x-bar chart

Would the method described above be a good way to determine which team was taller (explain why or why not)?

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd