Discuss the important features of data mining tools

Assignment Help Database Management System
Reference no: EM131236066

Part 1

There are many large datasets available at: https://data.london.gov.uk/

https://www.microstrategy.com/cloud/personal/datasets/

Choose TWO large datasets and analyse them with pivot tables. Document the insights and trends that you find during the analysis.

Submit a word document that includes FOR EACH DATASET:

1. The URL of the dataset.

2. A description of the dataset.

3. A screen capture showing the first page of the Excel spreadsheet containing the dataset.

4. Screen captures of TWO DIFFERENT pivot tables on the dataset utilised together with any graphs output.

5. A clear analysis of your findings.

6. Focus on the insight you are trying to gain: try differentiating dimensions from facts in the dataset. You will usually have dimensions as the rows (time, location, product type) and facts in the centre (revenue, cost etc).

1292_image.JPG

Part 2

Write a 1200 word (I am not going to count them) technical report (in MS Word), complete with proper referencing, from the position of a professional business analyst, to address the following:

(a) Discuss the important features of data mining tools; and
(b) Discuss how data mining can realize the value of a data warehouse.

This exercise is from questions 18.13 and 18.14 on page 460 of the textbook.

Part 3

The four "V"s of big data are volume, variety, velocity and veracity which reflect the amount of data, the different types, the speed with which it is collected, and the uncertainty relating to its truth.

You are a large department store thinking about using big data to understand your customers better.

Draw an entity relationship model containing key attributes from a customer's internet browsing activity, your transaction sales database, social media activity and publicly available demographic data on your site (or any other interesting sources - CCTV, telephone). The title of the diagram should contain the purpose of the model, what you are trying to achieve.

Submission of assignment

You can put both parts 1 and 2 in ONE Word document and email it to me from an ECU email address. I will acknowledge receipt by email. If you do not receive acknowledgement then it means that I have not received it.

Reference no: EM131236066

Questions Cloud

Nissan motor corporation advertisement : A Nissan Motor Corporation advertisement read, "The average man's I.Q. is 107. The average brown trout's I.Q. is 4. So why can't man catch brown trout?" Suppose you believe that the brown trout's mean I.Q.
What happens if you use larger numbers to declare thearrays : What happens if you change the NUMMONTHS and NUMYEARS de finitions to other values? Be sure to use both lower and higher values. Describe what happens if you use larger numbers to Declare thearrays.
Draw a lai artifact table to define a module : Using the notation of your choice, draw a process diagram of a software development process that prototypes three different designs and choose the best from among them.
Conduct a hypothesis test of your belief : You catch 12 brown trout. A fish psychologist determines the I.Q.s as follows: 5; 4; 7; 3; 6; 4; 5; 3; 6; 3; 8; 5. Conduct a hypothesis test of your belief.
Discuss the important features of data mining tools : MAN6905 - Databases and Business Intelligence - Choose TWO large datasets and analyse them with pivot tables. Document the insights and trends that you find during the analysis.
How should the portfolio manager immunize the portfolio : The December Treasury bond futures price is currently 91-12 and the cheapest-to-deliver bond will have a duration of 8.8 years at maturity. How should the portfolio manager immunize the portfolio against changes in interest rates over the next 2 m..
Normally distributed with a mean : The delivery times for all food orders at a fast-food restaurant during the lunch hour are normally distributed with a mean of 12.6 minutes and a standard deviation of3.9minutes. Let x¯be the mean delivery time for a random sample of 10 orders at ..
Find the mean and standard deviation of a sample mean : Consider a large population with μ=70 and σ=10. Assuming nN≤0.05, find the mean and standard deviation of a sample mean, x¯, for a sample size of 19.
How should the portfolio manager immunize the portfolio : The December Treasury bond futures price is currently 91-12 and the cheapest-to-deliver bond will have a duration of 8.8 years at maturity. How should the portfolio manager immunize the portfolio against changes in interest rates over the next 2 m..

Reviews

Write a Review

Database Management System Questions & Answers

  Imagine that you work for a finance industry-based

imagine that you work for a finance industry-based organization. your organization is looking to submit its database

  Create mock-up report to make the monthly claim

He wishes you to group data by insurance company number, with subtotals by company and grand totals for each numeric field.

  Design a collection of tables that satisfies 2nf but not 3nf

Using the FD list in problem 1, identify the FDs that violate 2NF. Using knowledge of the FDs that violate 2NF, design a collection of tables that satisfies 2NF but not 3NF.

  Determine the fact for the star schema

They also want to know if registration trends are different for female students compared to male students. Here is the schemaView in a new windowfor the operational database. Determine the fact for the star schema using the information provided a..

  Dba denormalized products database to enhance performance

The DBA denormalized some of the data in Premiere Products database to enhance performance, and one of the resulting tables is following.

  Produce a distributed data design for enterprise

Produce a distributed data design for this enterprise. Show data fragmentation/partitioning and replication for each regional database location. Indicate what attributes are in each fragment

  Do explain the process of normalization

Explain the context in which Normalization is used?

  Develop a database to house college student

Develop a database to house College student and schedule information. The data model should contain the following minimum data structures.

  Build a home inventory database based on the structure

Build one simple one page report, your choice. As an example, Contents of the Family Room. Again title the report accordingly.

  Highest average mark

Write a program to calculate and store the average obtained by 20 pupils in 7 subjects. Output the pupil that made the highest average mark in addition to those pupils making 50 marks and over.

  Create a fully attributed loagical data model diagram

Create Conceptual Schema Diagram. Create a fully attributed Loagical Data Model Diagram. Create the SQL script that will generate atleast 4 tables in the data model that you have created.

  Define the digital divide and the impact of the olpc

The Digital Divide and the impact of the OLPC initiative. Write a 1 page summary for each article and include the proper citations.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd