Draw the box-plots for age and and fat

Assignment Help Engineering Mathematics
Reference no: EM131035621

Problem 1:  This problem is an example of data preprocessing needed in a data mining process.  

Suppose that a hospital tested the age and body fat data for 18 randomly selected adults with the following results:

Age

23

23

27

27

39

41

47

49

50

%fat

9.5

26.5

7.8

17.8

31.4

25.9

27.4

27.2

31.2

Age

52

54

54

56

57

58

58

60

61

%fat

34.6

42.5

28.8

33.4

30.2

34.1

32.9

41.2

35.7

a. Draw the box-plots for age and %fat.  Interpret the distribution of the data

b. Normalize the two attributes based on z-score normalization.

c. Regardless of the original ranges of the variables, normalization techniques transform the data into new ranges that allow to compare and use variables on the same scales. What are the values ranges of the following normalization methods? Explain your answer.

i. Min-max normalization

ii. Z-score normalization

iii. Normalization by decimal scaling.

d. Draw a scatter-plot based on the two variables and interpret the relationship between the two variables.

e. Calculate the correlation coefficient. Are these two attributes positively or negatively correlated? Compute the covariance matrix.

Problem 2:  This problem is an example of data preprocessing needed in a data mining process.  

Suppose a group of 12 sales price records has been sorted as follows:

5, 10, 11, 13, 15, 35, 50,55,72,92,204,215

Partition them into bins by each of the following method, smooth the data and interpret the results:

a. equal-depth partitioning with 3 values per bin

b. equal-width partitioning with 3 bins

Problem 3 a) Figure 1 illustrates the plots for some data with respect to two variables: balance and employment status. If you have to select one of these two variables to classify the data into two classes (circle class and plus class), which one would you select? Is there any approach/criterion that you can use to support your selection? Explain your answer.

822_Figure.png

Figure 1: Data Plots for Problem 3.a.

b) For the data in Figure 2 with three variables and two classes: which variable you would choose to classify the data? Show all the steps of your calculations and interpret your answer.

139_Figure1.png

Figure 2: Data for Problem 3.b

Reference no: EM131035621

Questions Cloud

Recommend for the construction of this system : Which design strategy would you recommend for the construction of this system? Why?
Should the boom be fully retracted : The front wheels are free to roll. Do an equilibrium analysis to explain your answer.
Successively higher levels of debt : If a firm goes from zero debt to successively higher levels of debt, why would you expect its stock price to rise first, then hit a peak, and then begin to decline?
Has the researcher communicated clearly and fully : Did the article make an original contribution to the existing body of knowledge? Was the theoretical framework for the study adequate and appropriate?
Draw the box-plots for age and and fat : Draw the box-plots for age and %fat.  Interpret the distribution of the data and Normalize the two attributes based on z-score normalization
Dfs-files-directories and shares : From the first e-Activity, examine the key benefits afforded to an organization that utilizes Distributed File System (DFS) technologies.
Debt level that maximizes its stock price : Is the debt level that maximizes a firm's expected EPS the same as the debt level that maximizes its stock price? Explain.
Calculate profit margin and gross profit rate for company : Calculate the Profit Margin, and Gross profit rate for the company. Be sure to provide the formula you are using, show your calculations, and discuss your findings/results.
Analyzes the pros and cons of each : Your company has decided to open up a new comprehensive resort on a tropical island. Your manager (me) is working with corporate senior managers to determine how best to structure this new enterprise. analyzes the pros and cons of each, and describes..

Reviews

Write a Review

Engineering Mathematics Questions & Answers

  Use any numerical integration method to solve

Use any numerical integration method, but please show all work in detail.

  Managing ashland multicomm services1 hint let pi 002 as

managing ashland multicomm services1. hint let pi 0.02 as shown in the table for no free premium channels.a. px lt 3

  Determining the barrel yield and per barrel cost

Incoming crude can be processed by one of three methods. The per barrel yield and per barrel cost of each processing method are shown in the following table.

  What is lp and how is an lp problem defined

Our firm makes two products: Y and Z. Suppose that each unity of Y costs $10 and sells for $40. Each unit of Z costs $5 and sells for $25. If the firm's goal were to maximize profit, what is the approproiate objective function?

  What is the definition of a bound charge

Show that for any spherical distribution of charge, the field at radius r is the same as if all the charge inside the volume of radius r were concentrated at the center, and that outside of r were removed.

  Bivariate normal distribution - find a constant

bivariate normal distribution - Find a constant a such that P(3X1 -X2

  A standard normal probabilities based problems

Are the results enough evidence to conclude that the bottles are not filled adequately at the labeled amount of 6 ounces per bottle?

  Calculate the average sublimation flux

Air at 1 atm flows at a Reynolds number of 50,000 normal to a long, circular, 1-in.-diameter cylinder made of naphthalene. Using the physical properties of Example 3.14 for a temperature of 100oC, calculate the average sublimation flux in kmol/s-m..

  Problems requiring computations

For problems requiring computations, please ensure that your Excel file includes the associated cell computations and/or statistics output; this information is needed in order to receive full credit on these problems.

  Problem regarding the integration

Problem 1: Use the integration by part to evaluate the following: 1. ? esx sin xdx, where s is some complex number

  Percentage of americans who only use cell phones

Do not reject H0; the percentage of Americans who only use cell phones does differ from 23% Do not reject H0; the percentage of Americans who only use cell phones does not differ from 23%

  Determine the optimal product mix

Formulate a linear programming model to determine the optimal product mix that will maximize profit. Transform this model into standard form.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd