Which sampling method summary stats

Assignment Help Other Subject
Reference no: EM133744866 , Length: word count:1600

Data Acquisition and Management, Sampling and data mining project

Your Task
Read the Assessment Instructions and complete sections (a)

Learning Outcome 1: Create analysis-ready data sets by applying and exploring basic validation, preprocessing, filtering and cleaning techniques

Learning Outcome 2: Evaluate and apply data mining software

Assessment Description

Business Problem: Airbnb is a U.S. company which provides an online marketplace for short- term and/or holiday accommodation. Airbnb collect large volumes of data to gain insight into their clients and associated customers, such as review scores, host acceptance rate, ‘superhosts', popular accommodation types and density of listings in particular location.

Data sets: We have obtained data on Airbnb listings in Melbourne with a variety of variables. Sampled datasets, the original data and data dictionary will be available from Week 4. See sections below.

Assessment Instructions

Analysis and Report
Use Microsoft Excel or Power BI or Tableau.

Recall the sampling methods below that you have learnt about in lectures.

A data dictionary file and the following datasets (as .csv files) that contain sample data generated using quota, systematic, simple random, and stratified sampling will be available from week 4, see section c. below. You will also have to access the original population dataset cleansed_listings_dec_18.csv from the source, see section a. and section e. below.

Create a report and include your response to the following questions:
Access the data file cleansed_listings_dec_18.csv, by going to the link provided on MyKBS under the Assessment 1 tab. You will initially be downloading a zip folder from the Melbourne Airbnb Open Data project on Kaggle. Extract all the files within the folder and then choose the file cleansed_listings_dec_18.csv. Browse over the columns and comment on which variables appear to be the most useful in terms of insights into current listings. Document that in your report. (150 words)

List an advantage, possible disadvantage and limitations of each of the sampling methods. (150 words)

Access the sampled data sets on MyKBS. Choose a number of different variables, as in part (a), then for each of the sampled datasets create summary statistics for each of those variables. That is, make sure that the selected variables are the same for each of the four datasets and document them in your report. (300 words)

Interpret and compare the results of the summary stats across all four sample datasets. What conclusions can you draw from the comparison. Document your findings in your report. (500 words)

Repeat the above for the original dataset cleansed_listings_dec_18.csv. Explain with statistical examples which sampling method summary stats (across all chosen variables) were nearest in value to the original dataset summary stats.

Explain the variations in your report and include the supporting data. Explain possible ethical issues that could occur from the use of sampled data.

Briefly evaluate the software that you have used to produce the summaries. (500 words)

 

Reference no: EM133744866

Questions Cloud

What factors contribute to our identity : What factors contribute to our identity? How do we prioritize what aspects/characteristics are most important in our identity?
Explain the major sources of revenue for the state : In 1250 to 1500 words, explain the major sources of revenue for the state and local governments in Texas. How is the state's tax system regressive?
What are best practices and how are they being implemented : What are the best practices? How are they being implemented? What could your work setting improve upon based on the best practices presented in the literature?
Explain with statistical examples which sampling method : Explain with statistical examples which sampling method summary stats (across all chosen variables) were nearest in value to the original dataset summary stats
Which sampling method summary stats : which variables appear to be the most useful in terms of insights into current listings - Interpret and compare the results of the summary stats across
Create summary statistics for each of those variables : DATA4200 Data Acquisition and Management, Sampling and data mining project - Create analysis-ready data sets by applying and exploring basic validation
Analyze the arguments for government intervention : Analyze the arguments for government intervention as opposed to arguments for market-based solutions. Hint: See the information about market failures.
Demonstrate an understanding of the world around you gained : Demonstrate an understanding of the world around you gained by reading and citing current, credible, and sourced newspapers.
Provide a summary of each interview : Provide a summary of each interview. Provide a summary describing any general thoughts or conclusions you gained from your interviews.

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd