Estimates of the percentage of body fat

Assignment Help Basic Statistics
Reference no: EM131296586

Stat Assignment -

1. The bodyfat data set contains estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for men. Accurate measurement of body fat is inconvenient and costly (see preamble of the data set) and it is desirable to have easy methods of estimating body fat that are not inconvenient or costly.

A variety of popular health books suggest that the readers assess their health at least in part by estimating their percentage of body fat In Bailey (1994), for instance, the reader can estimate body fat from tables using their age and various skin-fold measurements obtained by using a caliper. Other texts give predictive equations for body fat using body circumference measurements; e.g. abdominal circumference and/or skin-fold measurements.

The file bodyfat contains introductory information that you should delete in order to import the data in R. The variables provided in the dataset from left to right are:

-Density determined from underwater weighing

-Percent body fat from Sir's (1956) equation

-Age (years)

-Weight (lbs)

-Height (inches)

-Neck circumference (cm)

-Chest circumference (cm)

-Abdomen circumference (cm)

-Hip circumference (cm)

-Thigh circumference (cm)

-Knee circumference (cm)

-Ankle circumference (cm)

-Biceps extended circumference (cm)

-Forearm circumference (cm)

-Wrist circumference (cm)

Analyze these data to produce predictive equations for lean body weight using multiple linear regression (model selection), regression trees and additive models (model selection). Build your model using the first 143 of the 252 cases and the rest to assess the predictive ability of your models. Do the three methods use the same variables to make the prediction?

2. Classification: When a bank receives a loan application, based on the applicants profile the bank has to make a decision regarding whether to go ahead with the loan approval or not. Two types of risks are associated with the bank's decision:

-If the applicant is a good credit risk, i.e. is likely to repay the loan, then not approving the loan to the person results in a loss of business to the bank

-If the applicant is a bad credit risk, i.e. is not likely to repay the loan, then approving the loan to the person results in a financial loss to the bank

Objective of Analysis: Minimization of risk and maximization of profit on behalf of the bank.

To minimize loss from the banks perspective, the bank needs a decision rule regarding who to give approval of the loan and who not to. An applicant's demographic and socio-economic profiles are considered by loan managers before a decision is taken regarding his/her loan application.

The German Credit Data contain data on 20 variables and the classification whether an applicant is considered a Good or a Bad credit risk for 1000 loan applicants. The response is binary (Good credit risk or Bad, Creditability = 1 if credit worthy and 0 otherwise). A predictive model developed on these data is expected to provide a bank manager guidance for making a decision whether to approve a loan to a prospective applicant based on his/her profiles.

Build your classification models using the training data (Training50.csv), all the other variables as predictors and

1. Logistic regression

2. Discriminant Analysis

3. Classification Trees

Assess the predictive accuracy of your models using the test data (Test.csv). Which method results in the model with best predictive ability?

Use the following predictors in your analysis:

1. Account Balance: No account (1), None (No balance) (2), Some Balance (3)

2. Payment Status: Some Problems (1), Paid Up (2), No Problems (in this bank) (3)

3. Savings/Stock Value: None, Below 100 DM, [100, 1000] DM, Above 1000 DM

4. Employment Length: Below 1 year (including unemployed), [1, 4), [4, 7), Above 7

5. Sex/Marital Status: Male Divorced/Single, Male Married/Widowed, Female

6. No of Credits at this bank: 1, More than 1

7. Guarantor: None, Yes

8. Concurrent Credits: Other Banks or Dept Stores, None

9. Purpose of Credit: New car, Used car, Home Related, Other

Attachment:- Assignment.rar

Reference no: EM131296586

Questions Cloud

Devolope and depend more on industrial cities : Assaignment is to give new ideas in these two pages for new and strong ways for the Saudi government to dont fully depend on only oil. Devolope and depend more on industrial cities, like Aljubail industrial city.
How does understanding a countrys ability : How does understanding a country's ability to generate, transport, and sustain transportation forces contribute to understanding its national power?
How the given concept is exemplified in the news story : Thirdly, you will write a short (750-1000 words) Discussion Post that gives a definition of the concept, explains how this concept is exemplified in the news story, and includes a link or directions to view the story you've chosen.
What does elasticity of substitution illustrate : What does elasticity of substitution illustrate? What two factors affect its magnitude?
Estimates of the percentage of body fat : Stat 6242 Assignment. The bodyfat data set contains estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for men
Buying or price acceptance decisions : How does convenience impact your buying or price acceptance decisions? Please provide a detailed, in your own words explanation.
Make a dynamic web page : CO539 Assessment  - Food Hygiene Ratings - For this assessment, you are going to use JQuery and AJAX to make a dynamic web page that communicates with two server-side scripts that we have developed for you.
Write a paper about demand elasticity on transportation cost : Write a paper about demand elasticity on transportation cost.
Cash plus checking account balances : Assume you have $100 in cash, $500 in your checking account, and $2,000 in savings. According to the M1 definition (cash plus checking account balances) the amount of money you have is?

Reviews

len1296586

12/1/2016 2:25:36 AM

Please check attached file for instructions. I just want to make sure that you will be providing the full assignment details, including the original R code/R markdown file and the PDF output. The bodyfat data set contains estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for men. Accurate measurement of body fat is inconvenient and costly.

Write a Review

Basic Statistics Questions & Answers

  Statistics-probability assignment

MATH1550H: Assignment:  Question:  A word is selected at random from the following poem of Persian poet and mathematician Omar Khayyam (1048-1131), translated by English poet Edward Fitzgerald (1808-1883). Find the expected value of the length of th..

  What is the least number

MATH1550H: Assignment:  Question:     what is the least number of applicants that should be interviewed so as to have at least 50% chance of finding one such secretary?

  Determine the value of k

MATH1550H: Assignment:  Question:     Experience shows that X, the number of customers entering a post office during any period of time t, is a random variable the probability mass function of which is of the form

  What is the probability

MATH1550H: Assignment:Questions: (Genetics) What is the probability that at most two of the offspring are aa?

  Binomial distributions

MATH1550H: Assignment:  Questions:  Let’s assume the department of Mathematics of Trent University has 11 faculty members. For i = 0; 1; 2; 3; find pi, the probability that i of them were born on Canada Day using the binomial distributions.

  Caselet on mcdonald’s vs. burger king - waiting time

Caselet on McDonald’s vs. Burger King - Waiting time

  Generate descriptive statistics

Generate descriptive statistics. Create a stem-and-leaf plot of the data and box plot of the data.

  Sampling variability and standard error

Problems on Sampling Variability and Standard Error and Confidence Intervals

  Estimate the population mean

Estimate the population mean

  Conduct a marketing experiment

Conduct a marketing experiment in which students are to taste one of two different brands of soft drink

  Find out the probability

Find out the probability

  Linear programming models

LINEAR PROGRAMMING MODELS

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd