Process of preparing your dataset for analysis

Assignment Help Computer Engineering
Reference no: EM133499551

Challenge: In the process of preparing your dataset for analysis, you will often need to define PK and FK. One such situation occurs when working with state and county data.  One table may have Census info and a second table may have area (size) info.  Fortunately the government has realized that they need to have unique identifiers for all kinds of economic analyses. This is called "Federal Information Processing Series." or "FIPS" 

Problem: Not all data sets will have this info in a single column. You may have to create a pseudo key and THEN lookup the correct column (Task 2)

TASK 1: (done with SQL earlier. Using a CSV source here)

You are given two data sets - both CSV.  These have a 1:1 relationship. You are to create  new column called StateCode and write this table to a new table called mygateID_FIPS in YOUR database. The state code comes from "CSV_states-in-us". The state NAME is in both files.

TASK 2:

  1. You are given two CSV data sets: CSV_census and CSV_FIPS. Your job is to append the FIPS code column to the data in CSV_census and write the resulting 4 columns - County, state, MHI and FIPS_Code - to a new table in your database called mygateID_CensusFIPS. Since the county names may not be unique across the US, you will have to combine the county name and state name to locate the appropriate FIPS code..
  2. You are bound to miss some. Don't worry about that for this assignment. HOWEVER, you are required to analyze the result and explain what problems you ran into (A) did it get all the codes correctly (B) What got skipped (C) How would you fix these

Reference no: EM133499551

Questions Cloud

Describe the information security blueprint : Describe the information security blueprint and why it builds on top of the organization's information security policies
Propose the three common ways that issps can be created : Propose the three common ways that ISSPs can be created within an organization. Your response must be at least 75 words in length.
Analyze the steps involved in building a strategic plan : Analyze the steps involved in building a strategic plan for a health care organization. Discuss which step would be most difficult to implement.
What do you think were the critical factors that fueled : What do you think were the critical factors that fueled the need for IT governance? In what ways did ISO affect the standards for network security?
Process of preparing your dataset for analysis : process of preparing your dataset for analysis, you will often need to define PK and FK. One such situation occurs when working with state and county data
What back up applications or software have : What back up applications or software have you used or currently use if your computer crashes? Would you recommend your backup plan to others
Discuss your options and justify your final decision : so the officials resort to negotiating a price with the incoming companies such as yours in order to make a living. Discuss your options and justify
Describe and define at least one the laws focused : Describe and define at least one the laws focused on compliance within the healthcare sector that came up during our course
Identify three measures of cardiorespiratory testing : ENC 3311- identify 3 measures of cardiorespiratory testing. Discuss what VO2 max is and why this is an important screening method for exercise prescription.

Reviews

Write a Review

Computer Engineering Questions & Answers

  Mathematics in computing

Binary search tree, and postorder and preorder traversal Determine the shortest path in Graph

  Ict governance

ICT is defined as the term of Information and communication technologies, it is diverse set of technical tools and resources used by the government agencies to communicate and produce, circulate, store, and manage all information.

  Implementation of memory management

Assignment covers the following eight topics and explore the implementation of memory management, processes and threads.

  Realize business and organizational data storage

Realize business and organizational data storage and fast access times are much more important than they have ever been. Compare and contrast magnetic tapes, magnetic disks, optical discs

  What is the protocol overhead

What are the advantages of using a compiled language over an interpreted one? Under what circumstances would you select to use an interpreted language?

  Implementation of memory management

Paper describes about memory management. How memory is used in executing programs and its critical support for applications.

  Define open and closed loop control systems

Define open and closed loop cotrol systems.Explain difference between time varying and time invariant control system wth suitable example.

  Prepare a proposal to deploy windows server

Prepare a proposal to deploy Windows Server onto an existing network based on the provided scenario.

  Security policy document project

Analyze security requirements and develop a security policy

  Write a procedure that produces independent stack objects

Write a procedure (make-stack) that produces independent stack objects, using a message-passing style, e.g.

  Define a suitable functional unit

Define a suitable functional unit for a comparative study between two different types of paint.

  Calculate yield to maturity and bond prices

Calculate yield to maturity (YTM) and bond prices

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd