Draw the pie chart or line chart of data anomalies

Assignment Help Database Management System
Reference no: EM132145197

Q1. Given the following file for assignment worker.com, identify data anomalies that must be removed before data can be loaded in data warehouse.

Worker_assignment ← -----------------on course web site

File is available in SAKAI

Assignment_worker(assignment_no, assignment_date, emp_number, chg_hour,assigned_hour, charges)

Where assignment number is the number assigned to an assignment
Assignment_date is the date assignment started
Emp_number is the number of employee assigned to that assignment
Chg_hour is amount paid to that employee for that assignment
Assigned_hour is the hours assigned to that employee for that assignment
Charges are the Total charges for that employee for that assignment (this is calculated as Chg_hour*assigned_hour)

Rules:
- Assignment numbers always start with a letter followed by a 1 and are ALWAYS four characters long
ex: A123, Z178

- Emp No IS always 3 CHARCATER LONG

- An employee can not work more than 40 hour on a given project

Requirement:
Count (using EXCEL formulas -- IF, countif etc. as done in class) four types of errors:

- Missing data
- Incorrect Format
o To check length of empno--you can use LEN(cell address) to get length of item in that cell
o check for assignment number format (BONUS +1 points)
- Zero values
- Incorrect Calculations
o check for charges
charges= chg_hour*Assigned_Hour
o check for employee working more than 40 hours

Once counted

- Draw the pie chart or line chart of data anomalies and
- Discuss what errors can be corrected and how. (submit in WORD)

Must submit the EXCEL worksheet where errors are calculated and graph is drawn

Q2

Data integrity is a required feature of data warehouses. P & G is building a data warehouse and have run in data integration problems. They need to get data from 2 different users and combine them to maintain data integrity in their data warehouse.

The sources are:

Asia region
North American Region

Both region have data stored in different formats in two different files (employee_asia and emp_NA

Both tables are available in account Aggarwal as READ ONLY. You must create a copy in your account before using it.

Or

you can create your own tables.

SHOW ALL QUERIES AND OUTPUTS

1. CLEAN the data in required format (for gender, country of origin, job_class and seniority)
a. Employee gender should be standardized, i.e., male should be changed to m and female to f
b. Country should be spelled completely, i.e, USA should be spelled out as United States of America
c. Ceylon no longer exists, change the name to Sri Lanka
d. Name is one attribute in dimension table, combine name as last and first, example Bora (last) and Lakshmi (first) should be modified to Bora, Lakshmi
e. Calculate both job_class and seniority

2. Create CLEAN_ASIA table

3. Create CLEAN_NA table

4. Combine the two using UNION to create following table

EMPLOYEE_DIM (Employee Id, Employee_name, seniority, gender, country, job_class)

5. Show the contents and structure of EMPLOYEE_DIM table.
6. Give a count of male and female employees

Q3 Revise the data warehouse based on new requirements (same as what we did in class)

Attachment:- Assignment.rar

Attachment:- archive 2.zip

Verified Expert

The solution file is prepared in ms word created tables in oracle and excel file. In oracle created Assignment_worker table with following attributes assignment_no, assignment_date, emp_number, chg_hour,assigned_hour, charges and in excel found the count of Missing data, Incorrect Format, check length of empno, check for assignment number format and represented graph for number of Zero values, Incorrect Calculations,check for charges, charges and check for employee working more than 40 hours. And also discussed about errors can be corrected and how. We created two more tables in oracle are employee_Asia and employee_na and inserted data values then executed required quires and join two tables uisgn union query. Finally created new data model for two way aggregation on employee _dim table.

Reference no: EM132145197

Questions Cloud

What will be some of the challenges for you : In your role as a business psychology student, you are likely to work and study with people from many backgrounds. In 500 words (two typed pages).
What is the strategy of break talk : What is the strategy of Break Talk? Are they succeeding or failing? Why?
What are some of the risks associated with a low cost : What are some of the risks associated with a low cost leadership strategy? Provide one original example of a company that you believe employs this strategy
Create a one page goal setting plan : Goal Setting is an excellent behavioral strategy for exercise promotion and adherence. Create a one page Goal Setting Plan for yourself using information found.
Draw the pie chart or line chart of data anomalies : INSS 422 - Business Intelligence - identify data anomalies that must be removed before data can be loaded in data warehouse
What challenges will there be in researching this market : A market domain is defined as something (a product or service) that people need, but that is NOT currently available to them.
What are the characteristics of bakery industry : How competitive is the landscape, etc? Any mega trends affecting the bakery industry which will affect how the BreadTalk competes?
What are the ethical principles and implications : You have been called in to consult on cases that may require mandated treatment. After reviewing the PSY699 The ethics of mandated treatment scenarios.
Questions of the server acting as the model : Sonya, the manager of a restaurant, has just hired three servers. As part of their training program, she wants them to first learn by watching another

Reviews

inf2145197

11/23/2018 1:18:21 AM

Hi, first of all, thank you for processing my request but I need my order as soon as possible because I need to submit it so kindly complete the process and send it to me as soon as possible. Thank you. I was very happy with this assignment because it was done according to the standards set in the questions. Even if the topic was a bit difficult, it was completed with great precision. Thank you.

len2145197

10/19/2018 11:28:45 PM

INSS 422 Mid Term Exam Fall Business Intelligence You will submit two files • Word file with q1 and q2 • EXCEL file for q1 (show all work) MUST BE DONE INDIVIDUALLY READ THE INSTRUCTIONS CAREFULLY!!!!!!!! NO LATE SUBMISSION!!!!!!! DUE DATE: Oct 21st by 10pm in SAKAI Note: Please DO NOT write any comments with your submission. • This is an EXAM, I can NOT answer any questions. • Email me for any GENERAL clarifications • Exam is to be done individually • Honesty and integrity is expected, any dishonesty will result in 0 for the EXAM and further action will be taken for all involved in dishonesty. • All work should be done on the exam itself except EXCEL file. ORACLE queries and output should be embedded in the exam. Make sure to include all Oracle statements that you use (including how many rows selected etc). I will be going and checking your queries in ORACLE in your account

Write a Review

Database Management System Questions & Answers

  Knowledge and data warehousing

Design a dimensional model for analysing Purchases for Adventure Works Cycles and implement it as cubes using SQL Server Analysis Services. The AdventureWorks OLTP sample database is the data source for you BI analysis.

  Design a database schema

Design a Database schema

  Entity-relationship diagram

Create an entity-relationship diagram and design accompanying table layout using sound relational modeling practices and concepts.

  Implement a database of courses and students for a school

Implement a database of courses and students for a school.

  Prepare the e-r diagram for the movie database

Energy in the home, personal energy use and home energy efficiency and Efficient use of ‘waste' heat and renewable heat sources

  Design relation schemas for the entire database

Design relation schemas for the entire database.

  Prepare the relational schema for database

Prepare the relational schema for database

  Data modeling and normalization

Data Modeling and Normalization

  Use cases perform a requirements analysis for the case study

Use Cases Perform a requirements analysis for the Case Study

  Knowledge and data warehousing

Knowledge and Data Warehousing

  Stack and queue data structure

Identify and explain the differences between a stack and a queue data structure

  Practice on topic of normalization

Practice on topic of Normalization

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd