Construction of the data dictionary and programming task

Assignment Help Other Subject
Reference no: EM132487466

SIT112 Data Science Concepts

Task Description

There are two main tasks for this assignment:
• Construction of the data dictionary and
• Programming tasks to perform data analysis and descriptive analytics.

Construction of the Data Dictionary

For a data scientist, after obtaining the dataset, the first most crucial task is to obtain a good understanding of the data they are dealing with. This includes: examining the data attributes

(or, equivalently, data fields), seeing what they look like, what is the data type for each field, and, from this information, determining suitable analysis tools. A systematic approach to this process, as we have learned from the lectures and practical sessions, is to construct a data dictionary for the dataset.

You are required to prepare two sheets in your data dictionary Excel file:

• Dataset description
• Attribute dictionary

The total for this task is 35 marks. The data description sheet is worth 5 marks. The attribute dictionary is worth 30 marks, where each correct attribute specification is worth 2.5 marks. Name your solution as [YourID]_datadictionary.xls and submit this file.

Programming task

A Python Jupyter Notebook file assignment1_notebook.ipynb has been prepared for you to complete this task. Download this notebook, load it up to Jupyter and follow instructions inside the notebook to complete this task.

Attachment:- Data Science Concepts.rar

Reference no: EM132487466

Questions Cloud

Explain an organization using agile : Explain an organization using agile and how it affects strategic planning of the organization. what are other methods other than agile methodology.
Determine the discount rate assuming the present value : Bank Z offers you 6% semi-annual interest due at the end of the year. What will be the difference in the Effective Interest Rate charged by the two banks?
Massive volumes of grains and oilseeds through livestock : Why, according to Weis, is it such a big ecological problem to be cycling massive volumes of grains and oilseeds through livestock?
Prepare the journal entry on December : The employee bonuses for the current year is estimated to be $970,000. Prepare the journal entry on December 31 to record the bonuses
Construction of the data dictionary and programming task : Construction of the data dictionary and Programming tasks to perform data analysis and descriptive analytics - A systematic approach to this process
Educational systems outside of the united states : What are two examples of teacher leaders in educational systems outside of the United States?
Compare the use of r vs python : Several Big Data Visualization tools have been evaluated in this week's paper. While the focus was primarily on R and Python with GUI tools, new tools.
What are the underlying assumptions : What potential conflicts exist? What are the potential overlapping responsibilities that require a segregation of duties with access to organizational value?
How comfortable would you be using a bathroom : How comfortable would you be using a bathroom that was not specified as either for males or females?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd