CIS7031 Programming for Data Analysis Assignment

Assignment Help Computer Engineering
Reference no: EM132495967

CIS7031 Programming for Data Analysis - Cardiff Metropolitan University

This assessment is designed to demonstrate a student's completion of the following Learning Outcomes:

Learning Outcome 1: Critically analyse and evaluate various statistical and computational techniques for analysing datasets and determine the most appropriate technique for a business problem;

Learning Outcome 2: Critically evaluate, develop and implement solutions for processing datasets and solving complex problems in variousenvironments using relevant programming paradigms;

Learning Outcome 3: Evaluate and apply key steps and issues involved in data preparation, cleaning, exploring, creating, optimizing and evaluating models;

Learning Outcome 4: Evaluate and apply aspects of data science applications and their use.

Assessment Requirements

This assignment will use employment data of Wales from the StatsWales data source. This dataset provides workplace employment estimates, or estimates of total jobs, for Wales and its NUTS2 areas, along with comparable UK data disaggregated by industry section.
For this assignment students will undertake a data analysis and machine learning approach to reveal the workplace employment landscape of Wales.

Part 1. Data processing
1.1. Download the dataset for the period 2009 - 2018 and create a dataframe that concatenates Wales (total)employment value only.
1.2. Check for any null value or outlier. If found replace that with mean value.
1.3. Change the name of the industries as bellow
The final data frame should look like following

Part 2. Data analysis
For each question provide graph/chart along with your own interpretation (~ 50 words)
2.1. Which industry employed highest and lowest workers over the period?
2.2. Which industry has the highest and lowest overall growth over the period?
2.3. Which years are the best and worst performing year in relation to number of employment. (highest and lowest employment)

Part 3. Visual analysis
Create a dynamic scatter/bubble plot showing the change of workforce number over the period using Plotly express.

Part 4. Correlation
4.1. Taking average employment number for each industry over the period, show and identify the highest and lowest correlated industries.
4.2. Make a year wise correlation for each industry. Does the aforementioned industries are also correlated over the each year? Explain your answer.

Part 5. Clustering (k means&hierarchical)
5.1. Using the best and worst performing year column's employment data (2.3) undertake a K means clustering analysis (K=2 & 3) and identify industries cluster together. Writeyour own interpretation (~100 words).
5.2. Using the same dataset (best & worst performing) create a hierarchical cluster. Compare the cluster with k means clusters.

Part  6. Discussion
Provide a brief discussion (~ 300 words) on employment landscape of Wales based on the employment data analysis results.

Attachment:- employment data.rar

Reference no: EM132495967

Questions Cloud

Describe of potential boundary challenges in field education : An explanation of the use of self during your field education experience that you may have encountered or that you might encounter
Create a measurement plan to assess the phenomenon : Identify the phenomenon you would measure and explain how you conceptualize this phenomenon. Provide at least 3 questions you would use to measure
Explaining salient points of the given theoretical construct : Analyze resilience theory by explaining salient points of this theoretical construct. Using the different crises from your textbook (Chapters 10-14).
Explain the term economic restructuring : Explain the term Economic Restructuring. What were some of the consequences of the economic restructuring on these families? Make sure to use specific examples
CIS7031 Programming for Data Analysis Assignment : CIS7031 Programming for Data Analysis Assignment help and solution, Cardiff Metropolitan University - assessment writing service - develop and implement
Explain what the different zones that will passes : Explain what the different zones that Will passes. Explain how time is used to determine which social class a person belongs-provide examples from the film
Discuss what you found most interesting in the study : Choose a recently published research article related to adult development and aging and write a 1-2 page summary of the article. a. The research article.
How can values be positive and negative : In what ways could your values both benefit and impede in the work of helping others? Another way of asking this question is, "How can values
Design a small public health program : Design a small public health program that takes into account the audience that you are addressing. What are some things you should be aware

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd