DALT7002 Data Science Foundations Assignment

Assignment Help PL-SQL Programming
Reference no: EM132495631

DALT7002 Data Science Foundations - Oxford Brookes University

Learning Outcome 1. Demonstrate the ability to identify and integrate data of various types from traditional and alternative sources, and make informed judgements about their use in data science research
Learning Outcome 2. Critically evaluate the methodologies applied in data collection, data processing, data analysis & dissemination of research findings
Learning Outcome 3. Critically assess methods and data strengths and limitations combined to application of R and/or Python

Introduction

In this coursework you will prepare a data model that combines a range of data sets. We are primarily interested in the processes you take to achieve your data model, though you will need to produce a final data set and model.

Scenario

Oxford Brookes University would like to offer a new service to staff to encourage the brightest and best staff to join us, and in recognition of the fact that Oxford itself, can be a very expensive place to live.

This new service is a town advice service that recommends towns in Oxfordshire based on a certain key characteristics, these being:

• House prices
• Broadband speed
• Crime in the area over the last month

They would also like to consider other factors such as:

• Nearby rights of way
• Distance from Oxford vs size of the road
• Availability of Allotments

There may be other factors. So you should also gather more information from a member of Oxford Brookes academic staff to find about any other key issues that might affect a person's choice of location.

Tasks

You must use datasets that are published on by the UK government, either centrally or through a public body that would be available to a member of the UK public. You should prepare a brief questionnaire about the knowledge acquisition and send it to a domain expert (in this case Dr. Younas) to gain an insight into any other data sources you may wish to query. Dr. Younas's email


Using this information, you should produce a unified data set and model that could be used to drive a recommendation system, documenting and explaining all the processes that you undertake to achieve this data set and model. You must ensure that -

• All data used is normalised to at least 3NF
• You must use the MySQL server on SOTS to store the data or another MySQL server. You should include your tables as part of the report
• Your model must use the three key characteristics
• Your model may use the additional characteristic(s) suggested above or that arise from the knowledge acquisition session
• The combined data set must be stored in a MySQL server
• You should demonstrate that you can query the data set in R
• You should have a simple recommendation system, written in R, that allows the user to specify a value in the range 0-10, for each of the three key characteristics and then produces a score for a town and displays the top 3 towns in order
• The towns used are in Oxfordshire.
• You may restrict the number of towns you look at to main towns, but you must justify your selection in your report

You should produce a report detailing

• The stages you took to identify, obtain, clean, and use the data sets associated with the three key characteristics
• The stages you took to identify, obtain, clean and use any additional data sets that you needed to either combine or fully utilise the three key characteristics
• A justification of the approaches used in identifying, cleaning, and using the datasets
• How you might obtain, clean and use any one data set associated with the optional characteristics (Note: you do not have to do the actual work, just say what the issues are with this type of data and how you might incorporate it into your system)
• The results of your knowledge acquisition questionnaire with your domain expert
• How you might obtain, clean and use any one additional data set based on your knowledge acquisition questionnaire (Note: you do not have to do the actual work, just say what the issues are with this type of data and how you might incorporate it into your system)
• A discussion of any legal or ethical issues with the proposed system and the data used
• An overview/design of your R code
• Your R code
• Names and descriptions of the MySQL database tables
• Testing of your system

Attachment:- Data Science Foundations.rar

Reference no: EM132495631

Questions Cloud

Lower average mileage than mark b : A gives a lower average mileage than mark B? Find the value of p, interpret the result. What assumptions should you take to work on this problem?
What amount of working capital is currently maintained : Your preference is to have a quick ratio of at least 0.80 and a current ratio of at least 2.00. How do the existing ratios compare with your criteria?
Discuss Slow Food Movement in relation to food consumption : Critically discuss the Slow Food Movement in relation to food consumption in the 21st century, integrating and giving examples of the concepts of provenance
Normal variable with mean : Given that x is a normal variable with mean µ = 44 and standard deviation s = 6.7, find the following probabilities
DALT7002 Data Science Foundations Assignment : DALT7002 Data Science Foundations Assignment help and solution, Oxford Brookes University - assessment writing service - Demonstrate the ability to identify
Prepare separate entries for each transaction for lima : Prepare separate entries for each transaction for Lima. The merchandise purchased by Maw on June 10 cost Lima $3000 and the goods returned cost.
Customer buying an air conditioner : A heating and cooling company advertises that any customer buying an air conditioner during the first 16 days of July will receive a 25 percent discount
Prepare separate entries for transaction on books of maw co : Prepare separate entries for each transaction on the books of Maw Co. On June 10, Maw Co. purchased $6000 of merchandise from Lima Co
What would be the net book value on january : An estimated residual value of $1,200. The company uses double-declining-balance depreciation. The net book value on January 1, 2021 would be

Reviews

Write a Review

PL-SQL Programming Questions & Answers

  Create a database model

Create a database model and Submit the table creation statements for the Database Model.

  Write pl-sql procedures and functions

Write PL/SQL procedures and functions to populate and query that database

  Sql questions

Write a query to display using the employees table the EMPLOYEE_ID, FIRST_NAME, LAST_NAME and HIRE_DATE of every employee who was hired after to 1 January, 1995.

  Run the lab_03_01.sql script

Run the lab_03_01.sql script in the attached file to create the SAL_HISTORY table. Display the structure of the SAL_HISTORY table.

  Write sql queries

Write a query to display the last name, department number, and salary of any employee whose department number and salary both match the department number and salary of any employee who earns a commission.

  Explaining sql insert statement to insert new row in cds

Write down a SQL insert statement to insert new row in "CDS" table.

  Write down name of actors in ascending order

Write down actors (or actress, your choice, but not both) who have won at least two (2) Academy Awards for best actor/actress. Provide the actor name, movie title & year. Order the result by actor name."

  What is an sql injection attack

What is an SQL injection attack? Explain how it works, and what precautions must be taken to prevent SQL injection attacks.What are two advantages of encrypting data stored in the database?

  Determine resonant frequency in series rlc resonant circuit

Given the series RLC resonant circuit in the figure, operating at variable frequency, determine: The resonant frequency ω o ,  The circuit’s quality factor Q , The cut-off frequencies, f 1  & f 2  and the bandwidth BW

  Query that uses cube operator to return lineitemsum

Write summary query which uses CUBE operator to return LineItemSum (which is the sum of InvoiceLineItemAmount) group by Account(an alias for AccountDesciption).

  Query to show customers were missing for existing orders

As DBA, your manager called a meeting and asked why there are so many orders for customers that don't exist in the customer table. Write query which would shows which customers were missing for existing orders. Use a join or a subquery.

  Sql query into a relational algebra statement

Turn this SQL query into a relational algebra statement? SELECT Request.reqfor, Ordering.invamt, Ordering.invnbr, Ordering.invdat

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd