Exploratory analysis and classification

Assignment Help Accounting Basics
Reference no: EM133228055

Part 2: Exploratory analysis and classification

Background

You are required to engage in exploratory analysis and classification in relation to the dataset Diabetes (CSV 24 KB) 

compiled by the US National Institute of Diabetes and Digestive and Kidney Disease.

The object of the dataset is to diagnostically predict whether a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old and of Pima Indian heritage.

The dataset consists of several medical predictor variables and one target variable, Outcome. Predictor variables include the number of pregnancies the patient has had, her BMI, insulin level, age, and so on.

Step 1:

Complete the following

  1. Using the KNIME platform examine Summary Statistics. 
  2. Build a Decision Tree Workflow in KNIME. 
  3. Make a validation set: Split you dataset into two parts-'Train' and 'Test'. 
  4. Train and build a Decision Tree Classification model for your dataset. 
  5. Evaluate the Performance of your Decision Tree Model using the Confusion Matrix and Determine Accuracy rate. 

Step 2:

Having completed tasks 1-5 above, make a report built on your analysis and classification. The report must be completed in a Word document. The report must contain the following:

  1. The summary statistics of your dataset, including:
    1. Validation: The Confusion Matrix results for your Train decision tree model and its interpretation. 
    2. A list of rules and their explanations (e.g. if condition1 and condition2 and condition3 then outcome). 
  2. The KNIME Workflows file for your project. 

Reference no: EM133228055

Questions Cloud

Discuss the relationships among stigma : Compare the manifestations of HIV/AIDS stigma in resource-rich and resource-limited countries and regions and discuss the relationships among stigma
Discuss appropriateness of simply adding classes of assets : Discuss the appropriateness ofsimply adding all classes of assets, even though they have been measured on a number of different valuation bases?
What esg factors are in the context of a business : Discuss what ESG factors are in the context of a business. Explain why (if indeed they are) they are important considerations in the running of a business.
Describe the roles of your local and state health department : Describe the roles of your local and state health departments in the accomplishment of healthcare promotion and goals. How do their goals of health promotion
Exploratory analysis and classification : You are required to engage in exploratory analysis and classification in relation to the dataset Diabetes (CSV 24 KB)
Concept of net neutrality : Discusses the concept of net neutrality as simply that the Internet is free and open to everyone.
Explain the pathophysiology of diabetic ketoacidosis : Explain the pathophysiology of Diabetic Ketoacidosis including how the condition progresses from hyperglycaemia to life-threatening ketoacidosis and fluid
Socio-cultural-economic-technological and regulatory : Identify one item for each of the 5 environmental scan elements such as demographic, socio cultural, economic, technological and regulatory.
Provide an instance wherein you can see that clients : Provide an instance wherein you can see that clients are applying what is given or desiminated during the health teachings

Reviews

Write a Review

Accounting Basics Questions & Answers

  How much control does fed have over this longer real rate

Hubbard argues that the Fed can control the Fed funds rate, but the interest rate that is important for the economy is a longer-term real rate of interest.   How much control does the Fed have over this longer real rate?

  Coures:- fundamental accounting principles

Coures:- Fundamental Accounting Principles: - Explain the goals and uses of special journals.

  Accounting problems

Accounting problems,  Draw a detailed timeline incorporating the dividends, calculate    the exact Payback Period  b)   the discounted Payback Period. the IRR,  the NPV, the Profitability Index.

  Write a report on internal controls

Write a report on Internal Controls

  Prepare the bank reconciliation for company

Prepare the bank reconciliation for company.

  Cost-benefit analysis

Create a cost-benefit analysis to evaluate the project

  Theory of interest

Theory of Interest: NPV, IRR, Nominal and Real, Amortization, Sinking Fund, TWRR, DWRR

  Liquidity and profitability

Distinguish between liquidity and profitability.

  What is the expected risk premium on the portfolio

Your Corp, Inc. has a corporate tax rate of 35%. Please calculate their after tax cost of debt expressed as a percentage. Your Corp, Inc. has several outstanding bond issues all of which require semiannual interest payments.

  Simple interest and compound interest

Simple Interest, Compound interest, discount rate, force of interest, AV, PV

  Capm and venture capital

CAPM and Venture Capital

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd