Identify what types of models you used to describe the data

Assignment Help Management Information Sys
Reference no: EM131126183

General assignment: Your term projects should fall within the scope of a data analytics problem of the type you have worked with in class/ labs, or know of yourself - the bigger the data the better. This means that the work must go beyond just making lots of figures.

You should develop the project to indicate you are thinking of and exploring the relationships and distributions within your data to lead to optimized predictive models.

Start with a hypothesis, claim, or questions. Think of one or more ways to construct model(s)1, find or collect the necessary data, and do both preliminary analysis, detailed modeling, validation, summary (interpretation) and (if any) resulting decisions.

Note: You do not have to come up with a positive result, i.e. disproving the hypothesis is just as good. Please use the section numbering below for your written submission for this assignment.

Guidance: Topics, scope and general nature - please use the opportunity in Assignment 5 (project proposals) and seek feedback from the instructor and your classmates.

1. Introduction

Describe your motivation, initial hypothesis/ idea that you wanted to investigate, and if applicable any prior work, interest in the topic (like an intro for a paper, with references). Min. 1/2 page.

2. Data Description

1 NOTE: graduate students must develop at least two different types of models, not just change the number of variables for a given model.

Describe how you determined which datasets you used in this project, the criteria, source, data and information-types in detail, associated documentation and any other supporting materials. Min. 1/2 page text (+graphics if applicable).

3. Analysis

Explore the statistical aspects of your datasets. Perform any transformations, interpolations, smoothing, cleaning, etc. required on the data, to begin to explore your hypothesis/ questions. Analyze the distributions; provide summaries of the relevant statistics and plots of any fits you made. Discuss and specify or estimate possible sources of error, uncertainty or bias in the data you used (or did not use). Min. 2 pages text + graphics.

4. Model Development and Application of model(s)

Identify what types of models you used to describe the data (regression, classification, clustering, etc.), patterns/ trends you found, visual approaches that helped you choose models, and or variables (type/ number) in the model, other parameter choices or settings for the models (e.g. distance metrics, kernels, etc.). Apply the models to assess model performance (i.e. predict). Discuss the confidence in your results including any statistic measures. Discuss how you validated your models and performed any optimization (give details). Min. 6 pages text + graphics.

5. Conclusions and Discussion

Describe your conclusions; interpret the results, predictions you made, the models and their characteristics, and a give summary of what changed as you went through the project (data, analysis, model choices, etc.), what you would do next, or do differently in a subsequent exploration. Min. 1 page text + graphics (optional).

References - websites, papers, packages, data refs, etc. should be included at the end.

Include your R scripts! (e.g. in a zip file).

6. Oral presentation (5%). Suggest these slides (limit your presentation to 5 mins):

a. Title (with your name)
b. Problem area - what you wanted to explore/ solve/ predict and why, and what you wanted to predict?
c. The data - where it came from, why it was applicable and the preliminary assessments you made.
d. How you conducted your analysis: distribution, pattern/ relationship and model construction. What techniques did you use/ not use and why?
e. How did you apply the model? How did you optimize, account for uncertainties?
f. What did you predict and what decisions (prescriptions) were possible. What was the outcome?

Graphical Representations

Provide graphical representations related to each of questions 2, 3, and 4, at least.
Ensure all figures are numbered, legible, fully explained and annotated.
The final document should be a minimum of 8 pages of writing (but can be more). All graphics should be within your written assignment unless they are very large. Large graphics files should be sent as a separate attachment (e.g. in a zip file).

Reference no: EM131126183

Questions Cloud

Prepare the balance sheet as of 30 november 2010 : Using the data for Impeccable Travel Service shown in Practice Exercises 1-4A and 1-5A, prepare the balance sheet as of November 30, 2010.
Conaway company purchased a machine for cash : Conaway Company purchased a machine for cash on January 1, 1998. The price was $31,000. In addition, Conaway incurred costs of $200 in transporting the machine to the factory site and a further $800 in installing the machine.
Using the data for express travel service shown in practice : Janis Paisley invested an additional $30,000 in the business during the year and withdrew cash of $18,000 for personal use.
Did the pahler court use the same reasoning : Recall the difference between a crime and a tort. Based on these two cases, analyze and discuss whether artists should be held liable for the actions of their fans.
Identify what types of models you used to describe the data : Explore the statistical aspects of your datasets. Perform any transformations, interpolations, smoothing, cleaning, etc. required on the data, to begin to explore your hypothesis/ questions. Analyze the distributions; provide summaries of the rele..
In which state or states can the suit be brought : In Chapter 1 of the text you read about the Bailey v. Eminem defamation case where the court held Eminem's lyrics were protected by the First Amendment. Read the article and view the video (the links are listed under Week 1 Additional Learning
What are the celsius temperatures of body temperature : What are the Celsius temperatures of Body temperature, ice freezing, water boiling, room temperature?
Compare two versions of the same article by an author : Read the two (2) versions of the article titled: "The Objectification of Women. Whose Fault Is it?" by Santi DeRosa in Chapter 8. Identify the thesis statement of each version. Summarize the second or final version.
Calculate the mass of steam : A heat exchanger is used to warm apple cider using steam as the heat source. - Calculate the mass of steam required to heat 150 kg of cider.

Reviews

Write a Review

Management Information Sys Questions & Answers

  Draw a swim lane diagram showing all roles

Draw a swim lane diagram showing all roles, tasks and decisions - Create a spreadsheet to show how the information collected, during the accident of a crash, can be stored in a database.

  Handling attackers to ensure that the application behaves

handling attackers to ensure that the application behaves appropriately when being directly targeted taking suitable

  Design a query tolist all rooms

Design a query tolist all rooms costing over $100.00, format the dollar amount field to have a '$' sign, label the query.

  What are the pros and cons of patient care

Share your experiences with healthcare information systems in your clinical setting. What are the pros and cons of patient care? If you are not currently working, think about your experiences as a consumer of healthcare services, keeping in mind t..

  What procedures could you follow to minimize risk

How can information technology support a company's business processes and decision making and give it a competitive advantage? Give examples to illustrate your answer.

  Impact of global expansion on supply chain

What impact has global expansion had on supply chain practices

  Compare the definitions and them into one best definition

Find at least three definitions for object-oriented programming. Compare the definitions and them into one "best" definition.

  What is the role of an hcos strategic planning process

What is the role of healthcare marketing in an HCO's strategic planning process

  Important information about erp systems1 at what size does

important information about erp systems1. at what size does an organization consider an erp system? why does size

  System development projectswhat situations might prompt

system development projectswhat situations might prompt system development projects? select one of the situation types

  Write paper on android fragmentation-burden or benefit

Write paper on Android Fragmentation: Burden or Benefit

  Using the following data regarding hospital monthly

1. using the following data regarding hospital monthly expenditures in 000s of dollars evaluate normality. are the

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd