Applying data science techniques to a concrete problem

Assignment Help Applied Statistics
Reference no: EM131754893

Project

This project will be open-ended. It will consist of your approach to applying data science techniques to a concrete problem. Consider a data-related problem in a field you are interested in. Pick a subject that you like, so that this project means something to you. You should use public data sources; I mentioned some during the course; I will give you some suggestions at the end of this document. You can use any data that you like (even scrape it, if you wish).

Basically, ask yourself a question related to data, collect and visualize the data, then answer (or say something about) the problem you asked. You can use your imagination, or build on examples we used during the semester.

You need to submit a written document together with code, touching on the elements mentioned below. You need to incorporate visualizations in your project. Your project should be a pdf (say a document saved as pdf), or an R presentation (markdown / shiny), or an html file. If you have something else in mind, let me know before you start.

Your project should include:

- What is the question you hope to answer?

- What data are you planning to use to answer that question?

- What do you know about the data so far?

- Why did you choose this topic?

- How did you gather the data?

- How did you preprocess (clean) the data?

- What methods did you use to filter the data, if appropriate?

- What programming language did you use and why?

- How did you model (visualize) the data?

- All the relevant code and output.

- Conclusion (did you solve the question in the beginning?)

The project should be based on what we discussed and does not have to incorporate advanced statistical analysis

Public Data Sources Examples (google the names):

- data.gov

- NYC open data, OpenData DC, DataLA

- Yelp data

- UN data

- Twitter data

- Rdatasets

- pythonapi

- Quandl

- US Census

- County Health Data

- City Portals

Reference no: EM131754893

Questions Cloud

Medicine and clinical studies brigham : Medicine and Clinical Studies Brigham and Women's Hospital reports that approximately 20% of American males and 10% of American females.
What type of taxpayers are considered eligible taxpayers : What type of taxpayers are considered "eligible" taxpayers with regard to special ordinary loss treatment of IRC Section 1244 stock
Random number generators : The purpose of this exercise is to determine whether a random number generator really produces a random sequence of observations.
What percentage of red balls must be in jar : Although he follows the five Rules of Actional Thought, he is having trouble picking and he wants your help.
Applying data science techniques to a concrete problem : What is the question you hope to answer? - What data are you planning to use to answer that question - What methods did you use to filter the data
Create an energetic and engaged workforce : In reviewing successful companies it becomes apparent they maximize a firm's human and social capital to create an energetic and engaged workforce.
Fully describe the issue and state why and how it effects : Fully describe the issue and state why and how it effects the individual taxpayer
Draw a sensitivity to probability graph : Draw a sensitivity to probability graph and explain what the values, axes, and lines represent. Show where Eduardo's best alternative is represented.
Describe active participation : describe Active Participation as it relates to a taxpayers involvement in an investment in Real Estate

Reviews

Write a Review

Applied Statistics Questions & Answers

  Hypothesis testing

What assumptions about the number of pedestrians passing the location in an hour are necessary for your hypothesis test to be valid?

  Calculate the maximum reduction in the standard deviation

Calculate the maximum reduction in the standard deviation

  Calculate the expected value, variance, and standard deviati

Calculate the expected value, variance, and standard deviation of the total income

  Determine the impact of social media use on student learning

Research paper examines determine the impact of social media use on student learning.

  Unemployment survey

Find a statistics study on Unemployment and explain the five-step process of the study.

  Statistical studies

Locate the original poll, summarize the poling procedure (background on how information was gathered), the sample surveyed.

  Evaluate the expected value of the total number of sales

Evaluate the expected value of the total number of sales

  Statistic project

Identify sample, population, sampling frame (if applicable), and response rate (if applicable). Describe sampling technique (if applicable) or experimental design

  Simple data analysis and comparison

Write a report on simple data analysis and comparison.

  Analyze the processed data in statistical survey

Analyze the processed data in Statistical survey.

  What is the probability

Find the probability of given case.

  Frequency distribution

Accepting Manipulation or Manipulating

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd