Reference no: EM132398217
Assignment
Question 1
(RCODE) Write the R-code to extract various nutritional values from different food varieties in the given nutrition XML document. If someone ate all the food items at once, plot a chart to compare it to the daily value for total-fat, saturated-fat, cholesterol, sodium, carb, fiber, protein.
DATA for this can be found in nutrional value xml file aattachment.
Question 2:
Find the correlation between the cholesterol values and carbohydrate values that you extracted in Question 1. Please submit the R code and the value of Pearson's product-moment correlation (NOTE: REFER TO QUESTION 1 (RCODE) question 6: Write an R-code for box plot to show the frequency of admitted students in UCBAdmissions data across various departments.
(This data is already installed in my Rstudio write the code for this one )
> ucba
Admit Gender Dept Freq
1 Admitted Male A 512
2 Rejected Male A 313
3 Admitted Female A 89
4 Rejected Female A 19
5 Admitted Male B 353
6 Rejected Male B 207
7 Admitted Female B 17
8 Rejected Female B 8
9 Admitted Male C 120
10 Rejected Male C 205
11 Admitted Female C 202
12 Rejected Female C 391
13 Admitted Male D 138
14 Rejected Male D 279
15 Admitted Female D 131
16 Rejected Female D 244
17 Admitted Male E 53
18 Rejected Male E 138
19 Admitted Female E 94
20 Rejected Female E 299
21 Admitted Male F 22
22 Rejected Male F 351
23 Admitted Female F 24
24 Rejected Female F 317
Question 3
To understand whether age affects the predictive risk by dose levels. and associated complication risk for one of the treatments (X). a gene expression profiling was conducted after sampling whole blood from few subjects. You mayor may not consider the dose indices to have a cumulative effect i.e. you may consider these are a separate administration of the drug.
|
|
Dose Indiees
|
|
1
|
2
|
3
|
4
|
|
Age Categories
|
A
|
101
|
100
|
88
|
99
|
|
B
|
96
|
95
|
84
|
93
|
|
C
|
103
|
81
|
87
|
86
|
|
D
|
85
|
83
|
81
|
84
|
|
E
|
78
|
79
|
91
|
77
|
|
F
|
91
|
77
|
75
|
86
|
The table shows gene expression values by defined six age categories and four dose indices administered to these subjects. Identify if gene expression is affected significantly by any of the defined age categories or dose indices in the patients treated with -)C.
Explain your testing hypothesis, assumptions and interpret the results
Question 4
|
Month
|
Visits
|
|
January
|
2520
|
|
February
|
2577
|
|
March
|
2238
|
|
April
|
3200
|
|
May
|
3212
|
|
June
|
3600
|
|
July
|
2872
|
|
August
|
2452
|
|
September
|
3200
|
|
October
|
3242
|
|
November
|
3258
|
|
December
|
3408
|
From the given data:
1. Predict the number of visits in January of next year. using a five-period moving average.
2. Predict the number of visits in January. using a seven-period moving average
3. Plot the actual data and the results of five-period and seven-period moving averages. Identify the best predictor
Question 5
|
|
Voting Preferences
|
|
Rep
|
Dem
|
Ind
|
|
Male
|
200
|
150
|
50
|
|
Female
|
250
|
300
|
50
|
An opinion poll surveyed a random sample of 1000 voters and results were organized in the above table. Respondents were classified by gender (female or male) and by voting preference (Independent. Republican or Democrat).
Identify whether gender has a significant impact on voting preference by carrying out an appropriate statistical test. Provide the appropriate code for the statistical test.
Attachment:- Nutrition Data File.rar