Creating sql queries relating to loan defaults

Assignment Help Other Subject
Reference no: EM133757460

Data Acquisition and Management

Assessment - Practical SQL Coding and Report

Your Task

You are required to:
Complete each of the SQL queries, both provided and self-constructed.
Take a screenshot of each of the queries and results - copy and paste these into your report.
In your report, explain and summarise the business insights found from executing the SQL queries and the data visualisations.
Submit your query script file (.sql) with all your queries using the SQL link within the assessment time limit of 2 hours, i.e. In class.

Background
The SQL Server contains data on Peer-to-Peer Lending. Connection details will be provided on the day of the assessment.

Section #1: Server Connection and Simple SQL Queries

Part 1: Updating the table to add a column
Details will be provided in class.

Part 2: Creating a SELECT Query
Details will be provided in class.

Section #2: SQL for Business Insights
Part 1: Creating and analyzing SQL Queries relating to Loan and Funded Amounts
Take screenshots of the above queries, its result and paste same in your report for interpretation.

Part 2: Creating SQL Queries relating to Loan Terms
Take screenshots of the query and its result and paste that in your report for interpretation.

Part 3: Creating SQL Queries relating to Interest Rates
Create an SQL statement that answers the following business questions:
Take screenshots of the query and its result and paste that in your report for interpretation.

Part 4: Creating SQL Queries relating to Loan Statuses
Take screenshots of the query and its result and paste that in your report for interpretation.

Part 5: Creating SQL Queries relating to Loan Grades
Take screenshots of the query and its result and paste that in your report for interpretation.

Part 6: Creating SQL Queries relating to Loan Defaults/Delinquencies
Take screenshots of the query and its result and paste that in your report for interpretation.

Section #3: SQL and Data Visualization for Business Insights
Part 1: Data Acquisition and Data Visualisations
Create a SELECT statement that returns all rows and columns of data.
Export and save the results of the query as CSV file.
Import the CSV file in either PowerBI or Tableau.
Using the above CSV file, carefully examine the columns that offer potential for creating at least three distinct visualizations.
These visualizations should aim to uncover valuable business insights to assist the manager in evaluating and approving loan applications more effectively. For instance, one visualization could analyse customer demographics to identify those most prone to default on their loans.
Take screenshots of each of the visualisations and paste that in your report for interpretation.
The Lending Club Dataset

Hands-on SQL
• During our hands-on SQL workshops, we practiced on the Lending Club 5K dataset, that is, the data was for 5000 customers.
• For Assessment #2, your lecturer will provide you in class with a similar dataset in week 8. This contains data on 10,000 customers.

Lending Club Dataset for Assessment #2
• For the assessment, the bank loan data dataset will have 10000 customers (10K).This dataset will be given to you in class in week 8 by your facilitator.

Your SQL File to Upload

For submission, your SQL script must be labelled in the following manner:
your_name_studentID_A2.sql

You must also repeat this inside your SQL script as an SQL comment.

Inside your SQL script, write the relevant SQL queries under the proper headings in the script.

This script template will be emailed/givento you by your facilitator in Week 8 on commencement of class. Please rename the downloaded file as follows: your_name_studentID_A2.sql
Lending Club Data Dictionary
The Lending Club dataset has the following features (columns of data):
• ID (id): this is the ID of the loan
• Member ID (member_id): this is the ID of the customer (the member)
• Loan Amount (loan_amnt): the amount the customer applied for
• Funded Amount (funded_amnt): the amount funded to the customer
• Term (term): the term of the loan i.e., 36 months or 60 months
• Interest Rate (int_rate):the interest rate charged on the loan
• Instalment (installment): the amount the customer regularly pays
• Grade (grade): Lending Club classification of the class of loan (type)
• Sub-Grade (sub_grade): the sub-grade classification
• Employee Length (emp_length): length of time customer has been employed
• Home Ownership (home_ownership): indicates the living situation of the customer - RENT, MORTGAGE, OWN, OTHER, and NONE
• Annual Income (annual_inc): the annual income of the customer
• Verification Status (verification_status): indicates whether the income source has been verified
• Issue Date (issue_d): when the loan was issued to the customer

• Loan Status (loan_status):
1. Current - the loan is currently active
2. Charged Off - loans which have been written-off i.e., a loss to Lending Club
3. Issued - loan has been issued to the customer
4. Fully Paid - indicates that the customer has completely paid off their loan
5. In Grace Period - customer has been granted a grace period for repayment
6. Late (31-120 days) - indicates that the customer is late with repayments
7. Late (15-30 days) - indicates that the customer is late with repayments
8. Does not meet the credit policy. Status: Charged Off
9. Does not meet the credit policy. Status: Fully Paid
10. Default - indicates that the customer has defaulted on their loan

Notes: loan_status column is a text data type so be very careful how you use SQL to query for loan statuses.

• Purpose (purpose): the purpose of the loan i.e., the reason for the loan application
• Address State (addr_state): the American state where the customer lives
• DTI (dti): A ratio calculated using the borrower's total monthly debt payments on the total debt obligations, excluding mortgage and the requested LC loan, divided by the borrower's self-reported monthly income
• Earliest Credit Line (earliest_cr_line): the month the customer earliest reported credit line was opened
• Outstanding Principal (out_prncp): remaining principal amount on the loan
• Total Payment (total_pymnt): payments received to date for the amount funded
• Recoveries (recoveries): post ‘Charged Off' amount recovered from customer
• Last Payment Date (last_paymnt_d): the last time the customer made a payment on the loan
• Last Payment Amount (last_payment_amnt): the amount the customer last paid on the loan
• Application Type (application_type): either INDIVIDUAL or JOINT

HINTS:
• For the SQL query, think carefully about the data type i.e., text or numerical?
• For visualisations: make sure that each column of data imported into PowerBI, or Tableau,have the correct data type.

• If the column data type is not correct, the visualisation may not be what you expected, and the results may influence the insights obtained. For example, the member_id is simply an ID, and so it makes no sense to add member ids together!

• For visualisations: carefully consider the chart type and combinations of columns of data to use for discovering the insights.

Attachment:- bank_loan_data.rar

Reference no: EM133757460

Questions Cloud

Hospitals post-acute care : Hospitals Post-Acute Care (LTC, Rehabilitation, or SNF) Ambulatory Care (Primary Care or Urgent Care) Behavioral Health (Rehabilitation, Outpatient
What are two examples of health care services : What are two examples of health care services covered by provincial or territorial health plans? What are two examples of health care services covered
Consider the impact of health care disparities : Discuss factors or barriers that limit access to health care or health care resources for individuals today. Consider the impact of health care disparities.
Environment and employment health : The most common health indicators are Environment and employment Health and disease Disease and disability Mortality
Creating sql queries relating to loan defaults : Carefully examine the columns that offer potential for creating at least three distinct visualizations - Complete each of the SQL queries, both provided
Shift to Value-Based Health Care Prior : Shift to Value-Based Health Care Prior to 1983, hospitals billed patients on cost-plus basis, charging for all supplies and resources used by individual patient
Reduces hospital costs : Which of the following represents a change in practice that reduces hospital costs? Same-day admissions Larger hospitals Formation of regional health authoritie
Which included neurologist-cardiologist-endocrinologist : Upon admission, the patient was evaluated by the medical team, which included a neurologist, cardiologist, and endocrinologist.
Child presents with sudden onset of barking cough : A 3-year old child presents with a sudden onset of barking cough, mild fever, and mild insperiatory stridor.

Reviews

Write a Review

 

Other Subject Questions & Answers

  Cross-cultural opportunities and conflicts in canada

Short Paper on Cross-cultural Opportunities and Conflicts in Canada.

  Sociology theory questions

Sociology are very fundamental in nature. Role strain and role constraint speak about the duties and responsibilities of the roles of people in society or in a group. A short theory about Darwin and Moths is also answered.

  A book review on unfaithful angels

This review will help the reader understand the social work profession through different concepts giving the glimpse of why the social work profession might have drifted away from its original purpose of serving the poor.

  Disorder paper: schizophrenia

Schizophrenia does not really have just one single cause. It is a possibility that this disorder could be inherited but not all doctors are sure.

  Individual assignment: two models handout and rubric

Individual Assignment : Two Models Handout and Rubric,    This paper will allow you to understand and evaluate two vastly different organizational models and to effectively communicate their differences.

  Developing strategic intent for toyota

The following report includes the description about the organization, its strategies, industry analysis in which it operates and its position in the industry.

  Gasoline powered passenger vehicles

In this study, we examine how gasoline price volatility and income of the consumers impacts consumer's demand for gasoline.

  An aspect of poverty in canada

Economics thesis undergrad 4th year paper to write. it should be about 22 pages in length, literature review, economic analysis and then data or cost benefit analysis.

  Ngn customer satisfaction qos indicator for 3g services

The paper aims to highlight the global trends in countries and regions where 3G has already been introduced and propose an implementation plan to the telecom operators of developing countries.

  Prepare a power point presentation

Prepare the power point presentation for the case: Santa Fe Independent School District

  Information literacy is important in this environment

Information literacy is critically important in this contemporary environment

  Associative property of multiplication

Write a definition for associative property of multiplication.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd