Task of generating some normalised data on product

Assignment Help Python Programming

Reference no: EM132277600

This assignment involves writing Python code to extract information about product details from data files and load them into a consistent SQL database. It is an example of an Extract-Transform-Load (ETL) task.

You have been given the task of generating some normalised data on product given some data files in different formats.

You are given:
• An HTML file that lists information about products, stock, and prices.
• A CSV spreadsheet containing information about the location of each product.
• A CSV spreadsheet containing detailed information about store locations. This spreadsheet relates the ID of the product with the ID of the location.

Your task is to read the data from all of these files, add it into an SQL database, and operate on the database to produce a report.
The schema for the SQL database is provided for you in the file database.py. You can run this file to create the database. Your code will then add data to it. Note that the HTML file lists the product ID as part of the HTML href attribute. This is the ID that needs to be used to populate the products table in the database.

REQUIRED OUTPUT
Your file main.py needs to generate a CSV file. The CSV file must contain the following fields:
• Description
• Price (including the currency symbol)
• Amount in stock
• Store location (all the location information in one string)

In addition, the report must be sorted in numerical ascending order by the stock price (ignore the currency symbol if different items use different currencies).

You will also submit the code you have written to solve this problem. Your code must use functions and every function must include a suitable docstring that describes what it does. Each function should implement a logical part of the overall ETL process.

The template main.py includes several functions that you must implement and they must pass unittests provided. In addition, you need to add the main code that generates the CSV file as specified above.

Reference no: EM132277600

Questions Cloud

Discuss different strategic resources : Discuss 3 different strategic resources that your assigned firm possesses.

Legitimate drug manufacturers out of business : Do you believe that counterfeit drugs and people that make them are putting legitimate drug manufacturers out of business?

Conversation about the difference between HMO and PPO : Why the abnormality was not found in his son earlier. This starts a conversation about the difference between an HMO and PPO. What are the differences?

Statement affirm an innovative organization : Explain what Einstein meant in this statement. How does this statement affirm an innovative organization? Provide specific examples.

Task of generating some normalised data on product : Python code to extract information about product details from data files and load them into a consistent SQL database

Describe the new structure that he put in place : Describe the new structure that he put in place at that time. What is this type of structure called? What are its advantages and disadvantages?

Describe three effective stress-management techniques : Describe three effective stress-management techniques. Personality Testing: Write175 to 260 wordsdescribing the main aspects of personality testing.

Commitment important in an organization : How are job satisfaction and organizational commitment important in an organization? Please explain.

Equity theory is motivational theory : Equity theory is a motivational theory that finds its roots and foundations.

User Account

All Pages