Design and develop advanced big data applications

Assignment Help Other Subject
Reference no: EM132871137 , Length: word count:2500

KF7032 Big Data and Cloud Computing - Northumbria University

Big Data and Cloud Computing

Aims
The aim of this assignment is to introduce a practical application of Big Data and Cloud Computing using a realistic big data problem. Students will implement a solution using an industry leading Cloud computing provider together with appropriate distributed processing environments such as Apache Spark. This will involve the provisioning and configuring of appropriate Cloud Computing resources and the selection of problem appropriate algorithms and visualization methods.

Learning Outcome 1: Apply big data analytic algorithms, including those for visualization and cloud computing techniques to multi-terabyte datasets.
Learning Outcome 2: Critically assess data analytic and machine learning algorithms to identify those that satisfy given big data problem requirements.

Learning Outcome 3: Critically evaluate and select appropriate big data analytic algorithms to solve a given problem, considering the processing time available and other aspects of the problem.

Learning Outcome 4: Design and develop advanced big data applications that integrate with third party cloud computing services.

Learning Outcome 5: Critically assess and interpret primary research to identify its applicability to a given big data problem scenario.

Big Data Product: Burglary Protection

In this scenario you are a data scientist working with a marketing consultancy. Your client is an insurance company that is developing a highly segmented home insurance product.Since it is hypothesized that customers who live in an area where burglary is prevalent would be more interested in a new insurance policy, the companywould like to find outwhether Burglary is more frequent in particular areas of England. If that is the case the company needs to determine whether these are areas of affluence, where a premium policywith high benefits could be sold, or one of relative deprivation where alow-cost economic policywith proportionately lower pay-outs would be more appropriate.

To solve this problem, you will usepublicly available data sets that have been prepared for you and placed in Amazon S3. These include (but are not limited to):-

1. Street Level Crime Data published by the UK Home Office, this dataset contains 19 million data rows giving a crime type, together with theirlocation as a latitude and longitude.

2. Land Registry Price Paid Data: This gives the postcode of a property, the property type from a enumeration of D (Detached), S (Semi-Detached), T (Terraced), F (Flats/Maisonettes) and the price paid.

3. English Indices of Deprivation Data:The English Indices of Deprivation 2010 data set contains the rankings of measures of deprivation within small area level across England. The 32000 localities are ranked from the least to most deprived, scored on seven different dimensions of deprivation.

4. Postcode Data: This data set provided by the Ordinance Survey gives a latitude and longitude to every postcode. This is useful in the product to provide a relation between the Land Registry Price Paid dataset postcode, and the original crime dataset latitude/longitude.

Specifics
1. Process the data prepared for you using Apache Spark.
2. Filter the dataset so that crimes refer to Burglary only.
3. Using appropriate software, determine whether Burglary is more closely associated with areas of affluence, relative deprivation, or neither.
4. Select and prepare no more than three visualizations to support your analytic findings from (3).
5. Explain the reasoning behind your code so that it is clear what each block is intended to achieve, and why.
6. Report critically on the advantages, disadvantages, and limitations of the methods used.
7. Your submission will be a Jupyter Notebook containing both code (typically Python), and explanatory text (Markdown) limited to 2500 words (plus references). References from scientific literature must be used and your discussion must be your own words.

Harvard Referencing

Attachment:- Big Data and Cloud Computing.rar

Reference no: EM132871137

Questions Cloud

What extent does have on the political lean of the court : What extent does it have on the political lean of the Court, and therefore, on United States law (specifically civil rights and civil liberties)?
List any special affiliation services provides to membership : Classify what specific public policy category it attempts to advance and how it goes about having its interests applied in policy development.
Discuss differences between unitary and confederal systems : Discuss the differences between Unitary and confederal systems of government. Why would we need a strong national government?
Calculate the corrected R-E for December : Unearned revenue of $2,300 was recorded in the 2018 revenue. Calculate the corrected R/E for December 31, 2018
Design and develop advanced big data applications : Design and develop advanced big data applications that integrate with third party cloud computing services and Critically assess and interpret primary research
What are the drivers of the development : One characteristic of the second global age is the rise of new powers in the developing world. What are the drivers of this development?
Name two examples of regional intergovernmental organization : Name two examples of regional IGOs . Why might regional IGOs be more effective channels of interaction between states than global IGOs?
What would be Composite Board profits : Composite Board is considering buying Fiberboard and producing all the output at its current factory. What would be Composite Board's profits
Strategic leadership and strategic flexibility : Describe how much impact managers have on an organization's success or failure. Explain why strategic leadership and strategic flexibility are important?

Reviews

Write a Review

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd