Reference no: EM133767057 , Length: word count:3000
Data Analytics
Assignment
Overview
Students are required to submit a report of approximately 2500 words along with exhibits to support findings with respect to the provided Airbnb dataset. This report should consist of a comprehensive review on data quality issues, detailed descriptions of data cleaning techniques, and an in-depth theoretical discussion on data preprocessing. Additionally, students are required to present their work in class during week 12 in a 15-minute presentation.
Introduction:
Airbnb has significantly disrupted the traditional hospitality industry with more travelers opting for Airbnb as their primary accommodation provider. In this assignment, students will theoretically analyze the Barwon South West dataset obtained from Inside Airbnb to ensure data quality and prepare it for further analysis.
Dataset
Inside Airbnb - Barwon South West, Vic, Victoria, Australia
The dataset could be downloaded from the link below:
Tasks for the Report:
Task 1: Identify Common Data Quality Issues
Identify and explain four common data quality issues present in the dataset that require cleaning.
Discuss how each issue can impact data analysis and decision-making processes.
Task 2: Describe Data Cleaning Techniques
Identify and describe four common techniques used for data cleaning, including handling missing values, handling outliers, dealing with inconsistencies, and removing duplicates.
Provide theoretical examples of how these techniques can be applied to the Airbnb dataset.
Task 3: Data Pre-processing
Perform data pre-processing to prepare the dataset for analysis. This includes selecting relevant columns, cleaning the dataset by removing irrelevant information, handling missing values, and transforming data formats. Students can do the pre-processing phase in Excel. They are required to provide an explanation of the preprocessed data and upload it to Moodle.
- This unit requires you to use APA system of referencing. See Sydney International's quick reference guide. It should be used in conjunction with the online tool Academic Writer
Report Structure:
Cover Sheet Title Page
Table of Contents
Common Data Quality Issues Data Cleaning Techniques Data Pre-processing References
Report: Expected word count 2,500 words Students are expected to submit their assessments via Turnitin on Moodle. Due date for submission of Group Report is week 11- 29th of September.
Presentation: 10 minute group presentation delivered in-class on week 12.
Group Formation: Students are responsible for self-assigning themselves to a group in Moodle. Each group consists of 4 members.