Create a new ssis project

Assignment Help Computer Engineering
Reference no: EM133540436

Assignment: Business Intelligence and Data Mining

Title: Extraction, Transformation, Loading (ETL) with SQL Server Integration Services (SSIS)

Instructions:

I. Create a new SSIS project.

II. Create a new database called ETL_Data in SQL Server.

III. Create an SSIS package for each of the following tasks:

1. PersonBio - This package will export the data to the ETL_Data database from the source tables and columns listed below found in the AdventureWorks database:

Source Tables: Person.Person, Person.EmailAddress, Person.StateProvince, Person.PersonPhone, Person.BusinessEntityAddress, and Person.Address.

Source fields: FirstName, LastName, AddressLine1, AddressLine2, City, StateProvince.Name, EmailAddress, and PhoneNumber.

The new table will contain every employees' information from the tables above regardless of they have a phone number or email address.
Name the new table PersonBio in your ETL_Data database.

Name the Source Assistant PersonBioSource, and the Destination Assistant PersonBioDestination

Name the package PersonBio.

2. SplitByStateName - This package will split the data from the PersonBio table into different tables within ETL_Data database based on the first letter of the StateProvince name.

You will create 5 new destination tables (StatesWith A, B, C, Null, and Others).

You may have to place the condition for the states with NULL value first, before specifying other conditions.

3. ProductSalesInfo - This package will calculate the sales amount and sales quarter for each product.

The data source for this package will be a query from the following tables: Production.Product, Production.ProductSubcategory, ProductCategory, Sales.SalesOrderHeader, Sales.SalesOrderDetail.

The query should show the following fields: Production.Product.Name, Production.ProductCategory.Name AS [CategoryName], Production.Product.ListPrice, Sales.SalesOrderHeader.OrderDate (only the orders after 2004), Sales.SalesOrderDetail.OrderQty.

Create two Derived Columns in the destination table. Name the first derived column SalesAmount. You can calculate the sales amount by multiplying ListPrice and OrderQty.

Name the second derived column SalesQtr. The data for this column should be extracted from the OrderDate field using a month function. You will need to build an "IF" statement around the month function that will check and assign the quarter value. The conditions for the IF statements can be, IF month of the date is > 9 then the value of the SalesQtr is 4th qtr, IF the month of the date is > 6 then then SalesQtr is 3rd Qtr, IF the month is > 3 then SalesQtr is 2nd Qtr, and for all the other months SalesQtr is 1st Qtr.

Name the output table ProductSalesInfo.

4. SalesAggregate - This package will aggregate the data from the ProductSalesInfo table to show the total quantity and total sales amount for each product.

The data source for this package will be the ProductSalesInfo table.

Select these fields for the source query: Production.Product.Name, Production.ProductCategory.Name AS CategoryName, Sales.SalesOrderDetail.OrderQty, Sales.SalesOrderDetail.UnitPrice.

Create a derived column called SalesAmount by multiplying Price with Qty. After adding the derived column task, add an Aggregate task in the package. Aggregate the fields in such a way that for each product name total quantity and total sales amount are shown.

Name the output table SalesAggregate.

Using a Multicast task, export the data in a flat file and into a SQLServer table. Name the flat file SalesAggregate.txt and the SQLServer table SalesAggregate.

IV. Save all of the packages in the same project.

V. Truncate the destination tables in each package before running the package.

VI. Name the data flow tasks, data source tasks, destination source tasks, and any other task or transformation module meaningfully.

Reference no: EM133540436

Questions Cloud

Create separate scripts within powershell that will perform : Create two separate scripts within PowerShell that will perform two types of backups of a given source directory and store the backed up files in a given target
What costs are associated for each task : What costs are associated for each task? Identify the estimated costs associated with each task. This may include personnel costs, materials, software licenses.
Discuss each component and specific information to included : Discuss each component and the specific information to be included within each section of the policy.
Discuss the implications of your research findings : Develop a 3 to 4 pages paper detailing problem addressed in the research paper and the proposed solution. Discuss the implications of your research findings.
Create a new ssis project : Create a new SSIS project. Create a new database called ETL_Data in SQL Server. Create an SSIS package for each of the tasks.
Prepare a report by interpreting the visualisations : Finish designing your visualisations, then prepare a report by interpreting the visualisations and integrating with theory from this subject
Create a new process and execute the command in new process : CSC 35500- With the exception of the cd and exit commands, each command should create a new process and execute the command in the new process.
Make recommendations on customer segmentation and targeting : Make recommendations on customer segmentation, targeting and product positioning; using the marketing mix to engage with the target segment; pricing tactics
What bundaberg can do to acquire new customers : What Bundaberg should do with customer relationship management and What Bundaberg can do to acquire new customers

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd