Build a multiple regression model from your data

Assignment Help Basic Statistics
Reference no: EM131068431

PROJECT-

For this Course Project you will collect data, perform preliminary data analysis, build and analyze a model, and use the results of your analysis to make predictions, draw conclusions, and support decisions.

The Project will be conducted in three phases:

Phase I:  Collect data and describe your data set. Please include: a description of what the data is, how it was collected (if known), type of variables (categorical/ continuous), unit of analysis, and business scenario.

Phase II:  Perform preliminary analysis of your data, using descriptive statistics.  Please include: central tendencies, variability, normality, and a visual representation for each variable as well as correlations between variables, and your preliminary thoughts on which variables will be included in the regression model (i.e. which are independent variables and which is the dependent variable; these can and probably will change!).  IF you would like to include regression analysis for me to look at then I will give you feedback.

The first two phases will be graded based on a satisfactory submission.  If the first submission is on-time and satisfactory, then full credit will be awarded.  In the case of an unsatisfactory submission five points may be deducted for each required re-do.

Phase III:  Build a multiple regression model from your data, and prepare a business report that includes all of your previous work, and that presents a recommendation to a decision-maker based on your model and analysis.

PROJECT DETAILS

Phase I, Data Collection

You may collect your data from (almost) any source(s).  The objective is to include a numerical response (dependent) variable that can be predicted from some number of other (independent) variables.  These data do not have to come from the same source, but should be compatible as data sets.  Data should be cross-sectional (no time-series data).

The minimum requirement is 50 observations with ten independent variables. The requirement is to include a numerical response (dependent) variable that can be predicted from some other (independent) variables. Numerical dependent variables are better, but up to 3 may be categorical (max 3 categories) or Binary. These data do not have to come from the same source, but should be compatible as data sets (i.e., if your response is a monthly result over a ten-year period, your other data should cover the same time period and increments). The minimum requirement is 50 observations (50 countries, 50 companies, 50 counties, whatever) with ten independent variables and one dependent variable (11 overall). It is best not to have one the variables at 50 points in time, unless the points in time are quite close to each other. It would be better to have one or a few points in time with lots of observations at that time. [Beware the tautology:  do not collect temperature and humidity to "predict" the heat index!]  Ensure that your data set will allow you to draw relevant conclusions about something that matters. The data may be from any field (preferably business-related) and should be collected so that you can establish relationships among your data to support some sort of a conclusion or recommendation. Please explain your planned business scenario - i.e. who would need to predict this DV and what would they use results for?

The submission will be in the form of an Excel file submitted in Canvas with a summary of what it is and where it came from.

Phase II, Preliminary Data Analysis

Apply descriptive statistics to your data set.  This can include graphical depictions as well as some basic calculated statistics.  Since you will be building a multivariate model, the correlations between your independent variables should be included.  You should, at this point, be able to make some preliminary observations about your data.  These observations (and any others you come up with later) should make their way into your business report, but will generally appear in appendices unless you determine them to be critical to the decision you are recommending. This submission should be a word document with your excel file also attached.

For each variable separately:

- Variable Name

- Description (what is it?)

- Units

- Central Tendency (mean, median, mode - use the appropriate one!)

- Variability (range, standard deviation)

- Normal distribution?  (continuous variables only)

- Outliers?  What did you decide to do with the outliers?

- Correlation with your Dependent variable

- Concerning correlations with other Independent variables (.7 or higher)

- Visual representation of variable

Overview of Data

- After running all descriptive information, do you have any thoughts on which may be better predictors of your dependent variable or thoughts overall of how things look? 

Phase III:  Model Construction and Business Report

You will build a multiple regression model from your data using the techniques we have learned in the course.  You should decide here how you intend to use your model to conduct analysis, make predictions, and support decisions. 

You will wrap all of your work up in a business report.  Remember that the target is an executive who you will ask for a decision based on your recommendation.  Perform analysis with your model, interpret your model, include your calculations and the original data (in appendices) but present the bottom line to the decision-maker up front.  The report will be submitted in paper copy at the beginning of class.  The clear plastic binder is highly discouraged.

While many organizations suggest a format for a business report, there are as many that do not, so the presentation is up to you.  However, the following page may be used as a guideline.

Business Report Format-

Cover Sheet

Title.  Indicate who the report is for, and what the report is about.  (Use this to establish the "setting" for your instructor to grade your submission.)

Your name and position. (Again to establish context for the grader.)

Executive Summary.  A single paragraph that an executive can read and immediately know what decision you are recommending and why.

Main Body

A 2-3 page report that tells the executive what decision should be made, and why the decision should be what it is.  This should reference (and may include) the model you are using to support the decision-making process, and may also describe how confident the executive should be when making this decision.  (In extreme cases the report can go up to 5 pages.  Business reports not intended for senior executives may be longer, based on the organization's needs.)

BLUF! (Bottom Line Up Front!)  The decision should be clear after the first few sentences, and definitely by the end of the first paragraph.

Include only that information that will be critical to the executive's decision-making process.

Refer to all supporting data and analyses that are included in appendices.  Appendices should appear in order of importance, and should be referenced in that order.

Appendices

(No page limit, whatever is appropriate to describe the following)

A. Model and Interpretation

Show the final model (Y=....) you developed to support the decision, and interpret it, to include discussing the effects of the ranges of your input variables.  This is where you discuss the meaning and relationship between predictors and outcome (i.e. when Y increases, what happens to X?) there does not need to be "stats language" here.  It can be very helpful to plug in values to demonstrate how the model works.

B. Model Statistical Analysis

Discuss the strength of the model in terms of how it supports the decision-making process.  Include the relevant Excel output that supports the quality of the model.

  • Correlation and multiple regression analyses were conducted to examine the relationship between Y and X(s)....
  • Discuss normality, missing data problems (if any), outliers (if any), and correlations between Y and X(s) - strength, direction, and r^2.
  • Explain the MR output - r, r^2, F, p. Explain significant beta weights - t, p, relationship
  • Include final tables hereto refer to when discussing results.

C. Model Development

Explain the process you used to turn the data into a model.  Explain predictors that you started with and did not include in your final model with rationale.  Discuss how you checked for assumptions.  Discuss variable elimination and transformation, as well as any other clever modeling techniques you used.  You do not have to include every step of your process, but you should show critical analyses that led to important modeling decisions.

D. Data Analysis

Show your descriptive and graphical analysis of the data, to include all the observations that might contribute to the modeling process.

E. Data

Describe briefly the data set and include the sources.  For very small data sets you may include them.  For other data sets (hundreds of observations) or larger, do not waste your company's paper.

Attachment:- Assignment.rar

Reference no: EM131068431

Questions Cloud

What do the companies sell or produce : When were the companies founded? What do the companies sell or produce? What are the mission and vision statements of the companies?
What is their average tax rate : If a Real Estate Professional has $100,000 in Active Income and $20,000 in Real Estate losses how much can this person write-off of their loss against their Active Income? How much will they pay in total taxes? What is their Marginal Tax Bracket? Wha..
Construction accounting & financial management : Time cards are being entered into the accounting system for four employees. The costs for Employee 1 areto be billed to job cost code 302.01.01100L. Ten hoursof Employee 2 time is to be billed to job cost code302.01.06110L and the remaining 30 hou..
Calculate consumer surplus and producer surplus : Calculate consumer surplus and producer surplus.
Build a multiple regression model from your data : INFO 2020 PROJECT- Build a multiple regression model from your data, and prepare a business report that includes all of your previous work, and that presents a recommendation to a decision-maker based on your model and analysis
The opportunity of the purchase of the land : a business is considering a cash outlay of $250,000 for the purchase of land which it could lease for $35,000 per year. If alternative investments are available which yield an 18% return, the opportunity of the purchase of the land is:
What is the opportunity cost of a bottle of root beer : what is the opportunity cost of a bottle of root beer
Provide the definition of total comprehensive income : Provide the definition of total comprehensive income. Explain the rationale for presenting additional line items, headings, and subtotals in the statement of comprehensive income.
Truck for your construction company with a sticker price : You want to purchase another truck for your construction company with a sticker price of $25,000. The car dealer offers you a $2,000 discount (lowering the price to $23,000) and a 48-month, 8.5% APR compounded monthly. Or, no discount with a 4.0% ..

Reviews

Write a Review

Basic Statistics Questions & Answers

  Find probability that in a year hurricane will strike the us

Find the probability that in a given year a) exactly one major hurricane will strike the U.S. mainland, b) at most one major hurricane will strike the U.S. mainland.

  If the standard deviation for a population as estimated

if the standard deviation for a population as estimated from a sample is s 5.6 then the standard error for a sample

  Find the probability of getting a five each time

You are dealt one card from a 52- card deck. Then the card is replaced in the deck, the deck is shuffled, and you draw again. Find the probability of getting a picture card the first time ad a spade the second time.

  Drawing the sample of church members

For a regional survey of suburban households to obtain data on television viewing habits, a statistical sample of suburban areas is first selected.  Within the chosen areas, statistical samples of whole blocks are selected; and within the selected..

  1 the residents of a housing development for senior

1. the residents of a housing development for senior citizens have completed a survey in which they indicated how

  Question on binomial distribution

Can the binomial distribution be used to model the number of rare events that occur over a given time period?

  Find the expected gain for the insurance company

If the probability that she dies in the coming year is 0.001, what is the expected gain for the insurance company?

  Assume that the given procedure yields a binomial

assume that the given procedure yields a binomial distribution with n trials and the probability of success for one

  To determine the efficiency of the current ticket operation

mike dreskin manages a large los angeles movie theater complex called cinema l ll lll lv. each of the four auditoriums

  What type of non-probability sampling would involve

A school administrator randomly selects 12 classes from your school and then selects all of the students of those classes to study a school library issue. Which type of sampling design is being used.

  Construct a suitable diagram to represent this data

Four regions of Yorkshire classify companies according to primary, manufacturing, transport, retail and service. The number of companies working in each region within each category is shown below.  Industrial Sources for Consumption and Investment..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd