Identify and contextualise a problem by using large dataset

Assignment Help Other Subject
Reference no: EM132240250 , Length: word count:3000

Assignment -

Introduction: The project will be based on a real-world problem using real data from a selected application.

Task: You are required to identify and contextualise a problem by using a large dataset and use SAS programming to solve this problem. You need to use a collection of real-world "raw" datasets and write multiple SAS programs that perform the steps involved in pre-processing, statistical analysis, reporting and comparison.

You need to choose a dataset where the dataset

  • should have at least 2000 instances,
  • should be related to Health or Environment domain,
  • should include multivariate data,

The full functionality of the application is up to you but a brief project proposal (maximum two-pages) should be submitted to the Unit Leader for approval before you start your project. The proposal should be included as an Appendix in the final report and is not included in the word count. Submissions without a pre-approved proposal will not be accepted.

You are required to design, build, test and document a SAS application which will have the following steps:

  • Data Collection and Preparation (download the data sets, clean and prepare the data).
  • Pre-processing (write a SAS program to pre-process the raw data and create new files).
  • Statistical Analysis (write a SAS program that calculates the results to solve the problem that you identified in your proposal and produces statistical reports for at least two comparable files.).
  • Comparison (write a SAS program that compares the statistics such as by showing the similarities and differences).

Deliverables:

Project report - A report that explains the design and development of the project must be submitted by the deadline. The report should contain the problem definition, objectives, system design rationale, results, conclusions, etc. Please see the marking criteria for more details. You should briefly describe your code as well and it should be clear what each data/procedure step does and why it is necessary.

The report should not exceed 3000 words. You can use figures, diagrams, tables and screenshots to explain your system and these are not included in the word count. References and appendices are not included in the word count either.

 SAS code - Your SAS code that implements the project requirements code should be attached as an appendix in your report. You should test your code with the SAS software before you submit to make sure it works. Correct use of SAS language constructs, code quality and professionalism will be marked. Please see the marking criteria for more details. A ZIP file containing the full running code should be submitted separately.

Presentation - Students will make a presentation to demonstrate their work and show the outcome. Student presentations have 15 minutes. There will be a 3-5 minute questions and answers part as well.

Attachment:- Assignment Files.rar

Reference no: EM132240250

Questions Cloud

How they are reported between the two vendors : Using a Web browser, perform some research on a newer malware variant that has been reported by a major malware containment vendor.
Research the internet to obtain information on windows : Review the critical considerations to prepare a Group Policy that ensures secuirty, accountability, and availability.
How you made the decision to move forward with this : Determine if improvements are needed, and if so, list your recommendation with how the improvement will affect the standard or protocol.
What are the pros and cons of deploying heuristic scanning : What are the pros and cons of deploying heuristic scanning techniques in an operational network?
Identify and contextualise a problem by using large dataset : You are required to identify and contextualise a problem by using a large dataset and use SAS programming to solve this problem
Discuss sql injection attacks : Our text discusses SQL injection attacks. Consider a NoSQL database, is it still vulnerable to an SQL injection attack? Why or why not?
Case - the multinational enterprise of the future : What sort of management skills and executive perspectives make someone an attractive candidate for a micro-multinational - technology trends support
Is a decrease in the unemployment rate necessarily good : Identify some of the reasons GDP should not be considered an effective measure of the standard of living of a country.
Discuss ideas and how given will increase profitability : Imagine that you are a manager at a brick-and-mortar store that has an online storefront as an additional source of revenue. The company has tasked.

Reviews

len2240250

2/22/2019 3:26:57 AM

Total Words: 3000. I need an initial proposal (sent u a sample) by tomorrow explaining what we need to do it says not over 3000 words on the papers. Initial proposal is in couple of days because it needs to be approved before we continue. So proposal is very important. Then Proposal must include the chosen data set. It doesn’t have to be from the given website (they are flexible as long as it indicates what dataset will be used on proposal, they will check to approved) work must indicate a problem and solution in proposal.

len2240250

2/22/2019 3:26:50 AM

First to get the proposal then the rest when proposal approved if you can sort this out properly these guys has lots to do also there is a data about diabetes with about 100000 obs i think u may consider that just an idea 2000 instances min needed. The report should not exceed 3000 words. You can use figures, diagrams, tables and screenshots to explain your system and these are not included in the word count. References and appendices are not included in the word count either.

Write a Review

Other Subject Questions & Answers

  Late adulthood phenomenon

Assess how individuals can promote health and wellness into late adulthood and mitigate the negative effects of aging.

  Describe the hypothesis of the study

Discuss concepts, theories, and principles included in the course textbook to show synthesis of what has been learned in the course related to the information in the article reviewed

  Consider current telehealth strategies and alternatives

Evaluate current implementation strategies for telehealth systems and provide an assessment of the pros and cons.

  Demonstrate your knowledge of your chosen topic

How clearly you demonstrate your knowledge of your chosen topic. Frequent and appropriate use of examples from the textbook and selected articles to support you

  What increase airways resistance

What increase airways resistance?

  Articles about nature vs nurture on serial killers

Where can I find case studies or articles about Nature vs Nurture on serial killers? I'm writing an argumentative research paper on this topic.

  Explain whether the given data are acceptable

Psychological Testing and Assessment text, you read about three sources of error variance that occur in testing and assessment.

  Discuss five most significant differences in cultural values

Describe what you would consider the five most significant differences in cultural values between Native Americans and Europeans at the time of Europe's contact

  Different criminological theories and types of crime

uring this course, you have learned about the public policy implications and crime prevention strategies related to different criminological theories and types of crime. In your opinion, what public policy or prevention strategy implication makes the..

  Difference between primary and secondary sex characteristics

Explain the difference between the primary sex characteristics and secondary sex characteristics and explain the biological foundation of sex-how people become female and male

  Write a summary of the chepter the image of environment

Write a summary of the chepter "The Image of Environment".

  Identify key attributes of electronic health record systems

Health information technology is the area of involving the design, development, creation, use and maintenance of information systems for the healthcare industry. An EHR or electronic medical record is an individual's official, digital health recor..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd