Discuss the important features of data mining tools

Assignment Help Database Management System
Reference no: EM131236066

Part 1

There are many large datasets available at: https://data.london.gov.uk/

https://www.microstrategy.com/cloud/personal/datasets/

Choose TWO large datasets and analyse them with pivot tables. Document the insights and trends that you find during the analysis.

Submit a word document that includes FOR EACH DATASET:

1. The URL of the dataset.

2. A description of the dataset.

3. A screen capture showing the first page of the Excel spreadsheet containing the dataset.

4. Screen captures of TWO DIFFERENT pivot tables on the dataset utilised together with any graphs output.

5. A clear analysis of your findings.

6. Focus on the insight you are trying to gain: try differentiating dimensions from facts in the dataset. You will usually have dimensions as the rows (time, location, product type) and facts in the centre (revenue, cost etc).

1292_image.JPG

Part 2

Write a 1200 word (I am not going to count them) technical report (in MS Word), complete with proper referencing, from the position of a professional business analyst, to address the following:

(a) Discuss the important features of data mining tools; and
(b) Discuss how data mining can realize the value of a data warehouse.

This exercise is from questions 18.13 and 18.14 on page 460 of the textbook.

Part 3

The four "V"s of big data are volume, variety, velocity and veracity which reflect the amount of data, the different types, the speed with which it is collected, and the uncertainty relating to its truth.

You are a large department store thinking about using big data to understand your customers better.

Draw an entity relationship model containing key attributes from a customer's internet browsing activity, your transaction sales database, social media activity and publicly available demographic data on your site (or any other interesting sources - CCTV, telephone). The title of the diagram should contain the purpose of the model, what you are trying to achieve.

Submission of assignment

You can put both parts 1 and 2 in ONE Word document and email it to me from an ECU email address. I will acknowledge receipt by email. If you do not receive acknowledgement then it means that I have not received it.

Reference no: EM131236066

Questions Cloud

Nissan motor corporation advertisement : A Nissan Motor Corporation advertisement read, "The average man's I.Q. is 107. The average brown trout's I.Q. is 4. So why can't man catch brown trout?" Suppose you believe that the brown trout's mean I.Q.
What happens if you use larger numbers to declare thearrays : What happens if you change the NUMMONTHS and NUMYEARS de finitions to other values? Be sure to use both lower and higher values. Describe what happens if you use larger numbers to Declare thearrays.
Draw a lai artifact table to define a module : Using the notation of your choice, draw a process diagram of a software development process that prototypes three different designs and choose the best from among them.
Conduct a hypothesis test of your belief : You catch 12 brown trout. A fish psychologist determines the I.Q.s as follows: 5; 4; 7; 3; 6; 4; 5; 3; 6; 3; 8; 5. Conduct a hypothesis test of your belief.
Discuss the important features of data mining tools : MAN6905 - Databases and Business Intelligence - Choose TWO large datasets and analyse them with pivot tables. Document the insights and trends that you find during the analysis.
How should the portfolio manager immunize the portfolio : The December Treasury bond futures price is currently 91-12 and the cheapest-to-deliver bond will have a duration of 8.8 years at maturity. How should the portfolio manager immunize the portfolio against changes in interest rates over the next 2 m..
Normally distributed with a mean : The delivery times for all food orders at a fast-food restaurant during the lunch hour are normally distributed with a mean of 12.6 minutes and a standard deviation of3.9minutes. Let x¯be the mean delivery time for a random sample of 10 orders at ..
Find the mean and standard deviation of a sample mean : Consider a large population with μ=70 and σ=10. Assuming nN≤0.05, find the mean and standard deviation of a sample mean, x¯, for a sample size of 19.
How should the portfolio manager immunize the portfolio : The December Treasury bond futures price is currently 91-12 and the cheapest-to-deliver bond will have a duration of 8.8 years at maturity. How should the portfolio manager immunize the portfolio against changes in interest rates over the next 2 m..

Reviews

Write a Review

 

Database Management System Questions & Answers

  Benefits of using databases and dbmss

Assignment: Research organizational benefits of using databases and DBMS's. Supplement your research with a review of your lessons. Post a response to the discussion board: Respond to the following statements and, if appropriate, include personal e..

  Calculates the annual cost of running an appliance

Write a program that calculates the annual cost of running an appliance. The program will ask the user for the cost per kilowatt-hour and the number of kilowatt-hours the appliance uses in a year:

  Presentation for your database life cycle project

Develop a PowerPoint slide presentation for your Database Life Cycle Project

  Implicit cursor to print out the mid of manager of bob

Write an anonymous PL/SQL program to compute the sum of 1, 3, 5, 7, 9. You must use a loop -  implicit cursor to print out the MID of manager of Bob.

  What tables and columns would you create in relational data

You need to store information on the business's employees, inventory, and completed sales. You also need to account for the fact that each salesperson receives a different percentage of their sales in commission. What tables and columns would you ..

  How to use traditional database design method

Explain how you would follow three phases of traditional database design method (Hierarchical, Network, and Relational), considering the following scenario.

  Create a relationship between the employees and sales tables

Create a relationship between the Employees and Sales tables. Save the relationship. A form and subform of sales by the salesperson. Name this form Employee Sales. Use the attached layout on page 5. (use the Zoom icon to make form readable)

  Consider a relation stored as a randomly ordered

Consider a relation stored as a randomly ordered ?le for which the only index is an unclustered index on a ?eld called sal. If you want to retrieve all records with sal > 20, is using the index always the best alternative? Explain.

  A university library database records

A university library database records information about books; for each book, it records the book isbn number (which is unique), and the book name. In addition, it records which books have been checked out

  Why are labor operations and labor types often tracked in

Why are labor operations and labor types often tracked in the conversion process but usually not tracked in the revenue process?

  Explain and support the database schema

Explain and support the database schema with relevant arguments that support the rationale for the structure

  Create a list of three paramter field values and append it

Create a list of the three paramter field values and append it to the db list. The search must be case insensitive. If the user enters a string that differs only in case then a match should be found.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd