Estimates of the percentage of body fat

Assignment Help Basic Statistics
Reference no: EM131296586

Stat Assignment -

1. The bodyfat data set contains estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for men. Accurate measurement of body fat is inconvenient and costly (see preamble of the data set) and it is desirable to have easy methods of estimating body fat that are not inconvenient or costly.

A variety of popular health books suggest that the readers assess their health at least in part by estimating their percentage of body fat In Bailey (1994), for instance, the reader can estimate body fat from tables using their age and various skin-fold measurements obtained by using a caliper. Other texts give predictive equations for body fat using body circumference measurements; e.g. abdominal circumference and/or skin-fold measurements.

The file bodyfat contains introductory information that you should delete in order to import the data in R. The variables provided in the dataset from left to right are:

-Density determined from underwater weighing

-Percent body fat from Sir's (1956) equation

-Age (years)

-Weight (lbs)

-Height (inches)

-Neck circumference (cm)

-Chest circumference (cm)

-Abdomen circumference (cm)

-Hip circumference (cm)

-Thigh circumference (cm)

-Knee circumference (cm)

-Ankle circumference (cm)

-Biceps extended circumference (cm)

-Forearm circumference (cm)

-Wrist circumference (cm)

Analyze these data to produce predictive equations for lean body weight using multiple linear regression (model selection), regression trees and additive models (model selection). Build your model using the first 143 of the 252 cases and the rest to assess the predictive ability of your models. Do the three methods use the same variables to make the prediction?

2. Classification: When a bank receives a loan application, based on the applicants profile the bank has to make a decision regarding whether to go ahead with the loan approval or not. Two types of risks are associated with the bank's decision:

-If the applicant is a good credit risk, i.e. is likely to repay the loan, then not approving the loan to the person results in a loss of business to the bank

-If the applicant is a bad credit risk, i.e. is not likely to repay the loan, then approving the loan to the person results in a financial loss to the bank

Objective of Analysis: Minimization of risk and maximization of profit on behalf of the bank.

To minimize loss from the banks perspective, the bank needs a decision rule regarding who to give approval of the loan and who not to. An applicant's demographic and socio-economic profiles are considered by loan managers before a decision is taken regarding his/her loan application.

The German Credit Data contain data on 20 variables and the classification whether an applicant is considered a Good or a Bad credit risk for 1000 loan applicants. The response is binary (Good credit risk or Bad, Creditability = 1 if credit worthy and 0 otherwise). A predictive model developed on these data is expected to provide a bank manager guidance for making a decision whether to approve a loan to a prospective applicant based on his/her profiles.

Build your classification models using the training data (Training50.csv), all the other variables as predictors and

1. Logistic regression

2. Discriminant Analysis

3. Classification Trees

Assess the predictive accuracy of your models using the test data (Test.csv). Which method results in the model with best predictive ability?

Use the following predictors in your analysis:

1. Account Balance: No account (1), None (No balance) (2), Some Balance (3)

2. Payment Status: Some Problems (1), Paid Up (2), No Problems (in this bank) (3)

3. Savings/Stock Value: None, Below 100 DM, [100, 1000] DM, Above 1000 DM

4. Employment Length: Below 1 year (including unemployed), [1, 4), [4, 7), Above 7

5. Sex/Marital Status: Male Divorced/Single, Male Married/Widowed, Female

6. No of Credits at this bank: 1, More than 1

7. Guarantor: None, Yes

8. Concurrent Credits: Other Banks or Dept Stores, None

9. Purpose of Credit: New car, Used car, Home Related, Other

Attachment:- Assignment.rar

Reference no: EM131296586

Questions Cloud

Devolope and depend more on industrial cities : Assaignment is to give new ideas in these two pages for new and strong ways for the Saudi government to dont fully depend on only oil. Devolope and depend more on industrial cities, like Aljubail industrial city.
How does understanding a countrys ability : How does understanding a country's ability to generate, transport, and sustain transportation forces contribute to understanding its national power?
How the given concept is exemplified in the news story : Thirdly, you will write a short (750-1000 words) Discussion Post that gives a definition of the concept, explains how this concept is exemplified in the news story, and includes a link or directions to view the story you've chosen.
What does elasticity of substitution illustrate : What does elasticity of substitution illustrate? What two factors affect its magnitude?
Estimates of the percentage of body fat : Stat 6242 Assignment. The bodyfat data set contains estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for men
Buying or price acceptance decisions : How does convenience impact your buying or price acceptance decisions? Please provide a detailed, in your own words explanation.
Make a dynamic web page : CO539 Assessment  - Food Hygiene Ratings - For this assessment, you are going to use JQuery and AJAX to make a dynamic web page that communicates with two server-side scripts that we have developed for you.
Write a paper about demand elasticity on transportation cost : Write a paper about demand elasticity on transportation cost.
Cash plus checking account balances : Assume you have $100 in cash, $500 in your checking account, and $2,000 in savings. According to the M1 definition (cash plus checking account balances) the amount of money you have is?

Reviews

len1296586

12/1/2016 2:25:36 AM

Please check attached file for instructions. I just want to make sure that you will be providing the full assignment details, including the original R code/R markdown file and the PDF output. The bodyfat data set contains estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for men. Accurate measurement of body fat is inconvenient and costly.

Write a Review

Basic Statistics Questions & Answers

  Checkout counter of a supermarket at an average rate

8 Customers arrive at the checkout counter of a supermarket at an average rate of 10 per hour, and these arrivals follow a Poisson distribution. Using each of the following two methods, find the probability that exactly 4 customers will arrive at ..

  Population proportion of women who change their nail polish

Survey of 8000 women, 5431 say they change their nail polish once a week. construct a 95% confidence interval for the population proportion of women who change their nail polish once a week.

  Confidence interval for the population proportion

What statement can you made by using the rule of three, about the proportion p, of all its computers which are defective?

  Probability that cereal have high calorie and high fiber

What is the probability that a cereal would both high calorie and high fiber? In other words, what is P(high calorie and high fiber)?

  Determine the probability that the bill will pass

Thirty-five percent of the democrats and 70% of the republicans favor the bill. the bill needs a simple majority to pass. using a probability tree, determine the probability that the bill will pass.

  Correlating net profit per location

I did a study correlating net profit per location with expenditures on an in-house promotion, and got a correlation of about .15, and that was received very well.

  Correlation does not imply causation

"Correlation does not imply causation" is a phrase that emphasizes that correlation between two variables does not automatically imply that one causes the other

  Determine proportion of passengers who book tickets online

If appropriate, use the Marascuilo procedure and α = .01 to determine which, if any, year groups are significantly different in the proportion of passengers who book tickets online.

  After four weeks the reduction in each persons blood

question a researcher wishes to try three different techniques to lower blood pressure. subjects are randomly

  Explain the descriptive statistics and draw scatter plots

Explore the descriptive statistics and draw some scatter plots. Does this meet the basic assumptions of regression?

  Understands the payroll rates

Shelly feels she understands the payroll rates much better than the payroll department. She has set a labor rate less than what has been suggested by the payroll department.

  Reason to accept or reject the manufacturer-s claim

The sample showed an average of 56.2 miles per gallon with a standard deviation of 5 miles per gallon. With a 0.05 level of significance, do you accept or reject the manufacturer's claim.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd