Reference no: EM132376172
Monitoring, Evaluation, Quality and Research Skills Assessment
INSTRUCTIONS
Read carefully before beginning!
As part of this assessment, you should have received this instruction document plus an excel file called ‘Data_Exercise_L2.xls'. In this exercise, you will explore the patterns in 4 years of diarrheal disease surveillance data from 10 regions (or 216 districts) of Ghana.The data set contains three main tabs "Diarrhea Data", "District Data" and "Data Key" with additional tabs provided for your work. Please take a few minutes to look at each tab, particularly the "Data Key", to make sure you understand what variables are available. Then go on to answer each of the fivequestions.
You will have up to 8 hours to complete the assessment. After the allotted time, you should send back this word document with your answers, including any figures and tables pasted along with the text, as well as the modified excel file. Be sure to retain all formulas and graphs in the excel file so that your work can be checked. You may add any extra columns to the data if you need them. If you use an external program (such as R or Stata), be sure to also submit your code.
You should complete the assessment independently; however, the use of online resources and asking clarifying questions is allowed. Please manage your time, so that you are able to attempt each question - do not leave any questions blank if you can help it. All of the questions can be answered in many ways, ranging from simple exploratory methods, to those using more sophisticated quantitative methods - give your best answer that you feel best demonstrates your skills and critical thinking ability.
Good luck!
QUESTION 1
Populate a new column called ‘region2' inthe "Diarrhea Data" tab by bringing in the corresponding region for each district from the "District Data" tab using any method of your choice. Validate your new variable by checking that the values in ‘region' are identical to those in ‘region2'.
Recommended time:10-15 min
QUESTION 2
Summarize diarrheal disease counts in the "Diarrhea Data" tab by region (10 regions) and by year (4 years) in the "Region Summary" tab. After you have filled the first table, adjust the count for population by calculating the diarrheal disease rate per 1,000 people.
Write a simple short paragraph about the pattern in the disease across time and space that you observe. You may use a table or a graph (or both) to support the text in your paragraph. Is it more appropriate to use disease counts or disease rates to answer the question? Why?
Recommended time: 45-60 min
QUESTION 3
In the tab called "Temporal Analysis", you will find data for Adansi North district in Ashanti region (south of the country) and Bongo district in Upper West region (north of the country). Graphically explore the data using the appropriate type of chart (or charts).
Briefly describe the temporal pattern of diarrheal disease in the two districts. Is diarrhea related to rainfall in Adansi North? What about in Bongo? Briefly describe why or why not. If you use any quantitative analysis to answer the question, be sure to provide relevant methods details in your write-up.
Recommended time: 60 min
QUESTION 4
Examine the water and sanitation data in the "District Data" tab. Conduct any brief data quality checks you feel necessary. Comment on the data quality of these variables, briefly describing how you went about your validation.
Recommended time: 30-60 min
QUESTION 5
In the tab called "WASH Analysis", data for 20 districts were pasted from the "District Data" tab. It is widely known that drinking contaminated water and not having access to sanitation facilities are risk factors for diarrheal disease. Two variables were retained for analysis: ‘w_river_pond', which is the % of people who report obtaining their drinking water from a river or a pond (i.e. drinking contaminated water); and s_field, or % of people who report using a field for defecation (i.e. not having any sanitation facilities).
Bring in the total diarrheal disease count(dis_ct) for each district from the "Diarrhea Data" tab - you may use eitherdis_ct or population adjusted dis_rate - choose the version you feel is most appropriate. Justify why you used one or the other. Explore whether in these 20 districts, drinking contaminated water and not having access to sanitation facilities are related to diarrhea.
Write a brief paragraph about your findings, supported by any tables, figures, and/or analyses you feel are appropriate.
Recommended time: 60-80 min
Attachment:- Data Exercise.rar